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To  the  Student  from  the  Authors 

Few  people  ever  read  a  preface,  and  those  who  do  often  just  glance  at  the  first  few  lines.  So  we 
begin  by  answering  the  question  most  frequently  asked  by  the  readers  of  our  manuscript:  “What 
does  [ BB]  mean?”  Like  most  undergraduate  texts  in  mathematics  these  days,  answers  to  some 
of  our  exercises  appear  at  the  back  of  the  book.  Those  which  do  are  marked  [BB]  for  “Back  of 
Book.”  In  this  book,  complete  solutions,  not  simply  answers,  are  given  to  almost  all  the  exercises 
marked  [BB].  So,  in  a  sense,  there  is  a  free  Student  Solutions  Manual  at  the  end  of  this  text. 

We  are  active  mathematicians  who  have  always  enjoyed  solving  problems.  It  is  our  hope  that 
our  enthusiasm  for  mathematics  and,  in  particular,  for  discrete  mathematics  is  transmitted  to  our 
readers  through  our  writing. 

The  word  "discrete”  means  separate  or  distinct.  Mathematicians  view  it  as  the  opposite  of 
“continuous.”  Whereas,  in  calculus,  it  is  continuous  functions  of  a  real  variable  that  are  important, 
such  functions  are  of  relatively  little  interest  in  discrete  mathematics.  Instead  of  the  real  numbers, 
it  is  the  natural  numbers  1,  2,  3,  . . .  that  play  a  fundamental  role,  and  it  is  functions  with  domain 
the  natural  numbers  that  are  often  studied.  Perhaps  the  best  way  to  summarize  the  subject  matter 
of  this  book  is  to  say  that  discrete  mathematics  is  the  study  of  problems  associated  with  the 
natural  numbers. 

You  should  never  read  a  mathematics  book  or  notes  taken  in  a  mathematics  course  the  way 
you  read  a  novel,  in  an  easy  chair  by  the  fire.  You  should  read  at  a  desk,  with  paper  and  pencil 
at  hand,  verifying  statements  which  are  less  than  clear  and  inserting  question  marks  in  margins 
so  that  you  are  ready  to  ask  questions  at  the  next  available  opportunity. 

Definitions  and  terminology  are  terribly  important  in  mathematics,  much  more  so  than  many 
students  think.  In  our  experience,  the  number  one  reason  why  students  have  difficulty  with 
“proofs”  in  mathematics  is  their  failure  to  understand  what  the  question  is  asking.  This  book 
contains  a  glossary  of  definitions  (often  including  examples)  at  the  end  as  well  as  a  summary  of 
notation  inside  the  front  and  back  covers.  We  urge  you  to  consult  these  areas  of  the  book  regularly. 

As  an  aid  to  interaction  between  author  and  student,  we  occasionally  ask  you  to  “pause  a 
moment”  and  think  about  a  specific  point  we  have  just  raised.  Our  Pauses  are  little  questions 
intended  to  be  solved  on  the  spot,  right  where  they  occur,  like  this. 

Where  will  you  find  [BB]  in  this  book  and  what  does  it  mean? 

The  answers  to  Pauses  are  given  at  the  end  of  every  section  just  before  the  exercises.  So  when 
a  Pause  appears,  it  is  easy  to  cheat  by  turning  over  the  page  and  looking  at  the  answer,  but  that, 
of  course,  is  not  the  way  to  learn  mathematics! 

We  believe  that  writing  skills  are  terribly  important,  so,  in  this  edition,  we  have  highlighted 
some  exercises  where  we  expect  answers  to  be  written  in  complete  sentences  and  good  English. 
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Discrete  mathematics  is  quite  different  from  other  areas  in  mathematics  which  you  may  have 
already  studied,  such  as  algebra,  geometry,  or  calculus.  It  is  much  less  structured;  there  are  far 
fewer  standard  techniques  to  learn.  It  is,  however,  a  rich  subject  full  of  ideas  at  least  some  of 
which  we  hope  will  intrigue  you  to  the  extent  that  you  will  want  to  learn  more  about  them.  Related 
sources  of  material  for  further  reading  are  given  in  numerous  footnotes  throughout  the  text. 


To  the  Student  from  a  Student 

I  am  a  student  at  Memorial  University  of  Newfoundland  and  have  taken  a  course  based  on  a 
preliminary  version  of  this  book.  I  spent  one  summer  working  for  the  authors,  helping  them  to 
try  to  improve  the  book.  As  part  of  my  work,  they  asked  me  to  write  an  introduction  for  the 
student.  They  felt  a  fellow  student  would  be  the  ideal  person  to  prepare  (warn?)  other  students 
before  they  got  too  deeply  engrossed  in  the  book. 

There  are  many  things  which  can  be  said  about  this  textbook.  The  authors  have  a  unique  sense 
of  humor,  which  often,  subtly  or  overtly,  plays  a  part  in  their  presentation  of  material.  It  is  an 
effective  tool  in  keeping  the  information  interesting  and,  in  the  more  subtle  cases,  in  keeping 
you  alert.  They  try  to  make  discrete  mathematics  as  much  fun  as  possible,  at  the  same  time 
maximizing  the  information  presented. 

While  the  authors  do  push  a  lot  of  new  ideas  at  you,  they  also  try  hard  to  minimize  potential 
difficulties.  This  is  not  an  easy  task  considering  that  there  are  many  levels  of  students  who  will 
use  this  book,  so  the  material  and  exercises  must  be  challenging  enough  to  engage  all  of  them. 
To  balance  this,  numerous  examples  in  each  section  are  given  as  a  guide  to  the  exercises.  Also, 
the  exercises  at  the  end  of  every  section  are  laid  out  with  easier  ones  at  the  beginning  and  the 
harder  ones  near  the  end. 

Concerning  the  exercises,  the  authors’  primary  objective  is  not  to  stump  you  or  to  test  more 
than  you  should  know.  The  purpose  of  the  exercises  is  to  help  clarify  the  material  and  to  make 
sure  you  understand  what  has  been  covered.  The  authors  intend  that  you  stop  and  think  before 
you  start  writing. 

Inevitably,  not  everything  in  this  book  is  exciting.  Some  material  may  not  even  seem  par¬ 
ticularly  useful.  As  a  textbook  used  for  discrete  mathematics  and  graph  theory,  there  are  many 
topics  which  must  be  covered.  Generally,  less  exciting  material  is  in  the  first  few  chapters  and 
more  interesting  topics  are  introduced  later.  For  example,  the  chapter  on  sets  and  relations  may 
not  captivate  your  attention,  but  it  is  essential  for  the  understanding  of  almost  all  later  topics. 
The  chapter  on  principles  of  counting  is  both  interesting  and  useful,  and  it  is  fundamental  to  a 
subsequent  chapter  on  permutations  and  combinations. 

This  textbook  is  written  to  engage  your  mind  and  to  offer  a  fun  way  to  learn  some  mathematics. 
The  authors  do  hope  that  you  will  not  view  this  as  a  painful  experience,  but  as  an  opportunity  to 
begin  to  think  seriously  about  various  areas  of  modem  mathematics.  The  best  way  to  approach 
this  book  is  with  pencil,  paper,  and  an  open  mind. 


To  the  Instructor 

Since  the  first  printing  of  this  book,  we  have  received  a  number  of  queries  about  the  existence  of 
a  solutions  manual.  Let  us  begin  then  with  the  assurance  that  a  complete  solutions  manual  does 
exist  and  is  available  from  the  publisher,  for  the  benefit  of  instructors. 
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The  material  in  this  text  has  been  taught  and  tested  for  many  years  in  two  one-semester 
courses,  one  in  discrete  mathematics  at  the  sophomore  level  (with  no  graph  theory)  and  the  other 
in  applied  graph  theory  (at  the  junior  level).  We  believe  this  book  is  more  elementary  and  written 
with  a  far  more  leisurely  style  than  comparable  books  on  the  market.  For  example,  since  students 
can  enter  our  courses  without  calculus  or  without  linear  algebra,  this  book  does  not  presume  that 
students  have  backgrounds  in  either  subject.  The  few  places  where  some  knowledge  of  a  first 
calculus  or  linear  algebra  course  would  be  useful  are  clearly  marked.  With  one  exception,  this 
book  requires  virtually  no  background.  The  exception  is  Section  10.3,  on  the  adjacency  matrix 
of  a  graph,  where  we  assume  a  little  linear  algebra.  If  desired,  this  section  can  easily  be  omitted 
without  consequences. 

The  material  for  our  first  course  can  be  found  in  Chapters  1  through  7,  although  we  find  it 
impossible  to  cover  all  the  topics  presented  here  in  the  thirty-three  50-minute  lectures  available 
to  us.  There  are  various  ways  to  shorten  the  course.  One  possibility  is  to  omit  Chapter  4  (The 
Integers),  although  it  is  one  of  our  favorites,  especially  if  students  will  subsequently  take  a 
number  theory  course.  Another  solution  is  to  omit  all  but  the  material  on  mathematical  induction 
in  Chapter  5,  as  well  as  certain  other  individual  topics,  such  as  partial  orders  (Section  2.5)  and 
derangements  (Section  7.4). 

Graph  theory  is  the  subject  of  Chapters  9  through  15,  and  again  we  find  that  there  is  more 
material  here  than  can  be  successfully  treated  in  thirty-three  lectures.  Usually,  we  include  only  a 
selection  of  the  various  applications  and  algorithms  presented  in  this  part  of  the  text.  We  do  not 
always  discuss  the  puzzles  in  Section  9.1,  scheduling  problems  (Section  1 1.5),  applications  of  the 
Max  Flow-Min  Cut  Theorem,  or  matchings  (Sections  15.3  and  15.4).  Chapter  13  (Depth-First 
Search  and  Applications)  can  also  be  omitted  without  difficulty.  In  fact,  most  of  the  last  half  of 
this  book  is  self-contained  and  can  be  treated  to  whatever  extent  the  instructor  may  desire. 

Chapter  8,  which  introduces  the  concepts  of  algorithm  and  complexity,  seems  to  work  best  as 
the  introduction  to  the  graph  theory  course. 

Wherever  possible,  we  have  tried  to  keep  the  material  in  various  chapters  independent  of  mate¬ 
rial  in  earlier  chapters.  There  are,  of  course,  obvious  situations  where  this  is  simply  not  possible. 
It  is  necessary  to  understand  equivalence  relations  (Section  2.4),  for  example,  before  reading 
about  congruence  in  Section  4.4,  and  one  must  study  Hamiltonian  graphs  (Section  10.2)  before 
learning  about  facilities  design  in  Section  14.3.  For  the  most  part,  however,  the  graph  theory 
material  can  be  read  independently  of  earlier  chapters.  Some  knowledge  of  such  basic  notions  as 
function  (Chapter  3)  and  equivalence  relation  is  needed  in  several  places  and,  of  course,  many 
proofs  in  graph  theory  require  mathematical  induction  (Section  5.1). 

On  the  other  hand,  we  have  deliberately  included  in  most  exercise  sets  some  problems  which 
relate  to  material  in  earlier  sections,  as  well  as  some  which  are  based  solely  on  the  material  in  the 
given  section.  This  opens  a  wide  variety  of  possibilities  to  instructors  as  to  the  kind  of  syllabus 
they  wish  to  follow  and  to  the  level  of  exercise  that  is  most  appropriate  to  their  students.  We 
hope  students  of  our  book  will  appreciate  the  complete  solutions,  not  simply  answers,  provided 
for  many  of  the  exercises  at  the  back.  By  popular  demand,  we  have  increased  the  number  of 
[BB]s  in  this  second  edition  by  over  60%. 

One  of  the  main  goals  of  this  book  is  to  introduce  students  in  a  rigorous,  yet  friendly,  way  to 
the  “mysteries”  of  theorem  proving.  Sections  1.1  and  1.2  are  intended  as  background  preparation 
for  this  often  difficult  journey.  Because  many  instructors  wish  to  include  more  formal  topics 
in  logic,  this  edition  includes  sections  on  truth  tables,  the  algebra  of  propositions,  and  logical 
arguments  (Sections  1.3,  1.4,  1.5,  respectively). 


Supplements 

There  is  a  full  Instructor’s  Solutions  Manual  (0130920126)  free  to  faculty,  available  only  through 
Prentice  Hall's  sales  reps  and  home  offices.  In  addition,  there  is  a  student  website  of  activities 
available  by  November  1,  2001  at  the  following  address:  www.prenhall.com/goodaire.  This  site 
is  free  to  all  users/purchasers  of  this  text. 


New  in  the  Second  Edition 

The  most  common  (negative)  criticism  of  our  first  edition  was  the  short  treatment  of  logic  and  the 
absence  of  truth  tables.  This  problem  has  been  remedied  with  Chapter  1  (previously  Chapter  0) 
completely  rewritten  and  expanded  significantly  to  include  new  sections  on  truth  tables,  the 
algebra  of  propositions,  and  logical  arguments.  The  text  now  includes  more  than  enough  material 
for  instructors  who  wish  to  include  a  substantial  unit  on  formal  logic,  while  continuing  to  permit 
a  shorter  treatment  dealing  exclusively  with  the  major  points  and  jargon  of  proofs  in  mathematics. 

The  second  most  common  complaint — and  every  student’s  favorite — was  the  shortage  of 
answers  in  the  back  of  the  book.  In  fact,  the  first  edition  contained  over  500  solved  problems. 
For  the  second  edition,  however,  this  number  has  been  increased  to  over  800. 

Other  features  of  the  second  edition  include  the  following: 

•  A  new  section  (12.5)  on  acyclic  graphs  and  an  algorithm  of  Bellman 

•  A  shortest  path  algorithm  due  to  Bellman  and  Ford  which  permits  negative  weight  arcs,  in 
the  section  (11.2)  on  digraphs 

•  Algorithms  rewritten  in  a  less  casual  way  so  as  to  more  closely  resemble  computer  code 

•  Review  exercises  at  the  end  of  every  chapter 

•  Nonmathematical  exercises,  often  requiring  some  research  on  behalf  of  the  reader,  asking  that 
answers  be  written  in  good  clear  English,  in  order  to  encourage  the  development  of  sound 
writing  and  expository  skills 

•  A  new  numbering  scheme  which  will  make  searching  much  easier  for  our  readers 
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1 

There  Are  Proofs! 

An  Introduction  to  Logic 


“How  many  dots  are  there  on  a  pair  of  dice?”  The  question  once  popped  out  of 
the  box  in  a  game  of  trivia  in  which  one  of  us  was  a  player.  A  long  pause  and 
much  consternation  followed  the  question.  After  the  correct  answer  was  finally 
given,  the  author  (a  bit  smugly)  pointed  out  that  the  answer  “of  course”  was  6x7, 
twice  the  sum  of  the  integers  from  1  to  6.  “This  is  because,”  he  declared,  “the 
sum  of  the  integers  from  1  to  n  is  \n(n  +  1),  so  twice  this  sum  is  n(n  +  1)  and, 
in  this  case,  n  =  6.” 

“What?”  asked  one  of  the  players. 

At  this  point,  the  game  was  delayed  for  a  considerable  period  while  the  author 
found  pencil  and  paper  and  made  a  picture  like  that  in  Fig  1.1.  If  we  imagine  the 
dots  on  one  die  to  be  solid  and  on  the  other  hollow,  then  the  sum  of  the  dots 
on  two  dice  is  just  the  number  of  dots  in  this  picture.  There  are  seven  rows  of 
six  dots  each — 42  dots  in  all.  What  is  more,  a  similar  picture  could  be  drawn  for 
seven-sided  dice  showing  that 


and  generally, 

(*)  2(1 +2 +  3 -I - \-n)  —  n  x  (n  +  1). 


Figure  1.1 


o  o  o  o  o  o 

•  o  o  o  o  o 

•  •  o  o  o  o 


•  o  o  o 

•  •  o  o 

•  •  •  o 


Sadly,  that  last  paragraph  is  fictitious.  Everybody  was  interested  in,  and  most 
experimented  with,  the  general  observation  in  equation  (*),  but  nobody  (except 
an  author)  cared  why.  Everybody  wanted  to  resume  the  game! 


1 


2  Chapter  1  Yes,  There  Are  Proofs! 

What  is  the  sum  1  +  2  +  3  +  •  •  •  +  100  of  the  integers  from  1  to  100? 

“Are  there  proofs?”  This  is  one  of  the  first  questions  students  ask  when  they 
enter  a  course  in  analysis  or  algebra.  Young  children  continually  ask  “why”  but, 
for  whatever  reason,  as  they  grow  older,  most  people  only  want  the  facts.  We 
take  the  view  that  intellectual  curiosity  is  a  hallmark  of  advanced  learning  and 
that  the  ability  to  reason  logically  is  an  increasingly  sought-after  commodity  in 
the  world  today.  Since  sound  logical  arguments  are  the  essence  of  mathematics, 
the  subject  provides  a  marvellous  training  ground  for  the  mind.  The  expectation 
that  a  math  course  will  sharpen  the  powers  of  reason  and  provide  instruction 
in  clear  thinking  surely  accounts  for  the  prominence  of  mathematics  in  so  many 
university  programs  today.  So  yes,  proofs — reasons  or  convincing  arguments  prob¬ 
ably  sound  less  intimidating — will  form  an  integral  part  of  the  discussions  in  this 
book. 

In  a  scientific  context,  the  term  “statement”  means  an  ordinary  English  state¬ 
ment  of  fact  (subject,  verb,  and  predicate  in  that  order)  which  can  be  assigned 
a  “truth  value,”  that  is,  which  can  be  classified  as  being  either  true  or  false.  We 
occasionally  say  “mathematical”  statement  to  emphasize  that  a  statement  must 
have  this  characteristic  property  of  being  true  or  false.  The  following  are  all 
mathematical  statements: 

“There  are  168  primes  less  than  1000.” 

“Seventeen  is  an  even  number.” 

is  a  rational  number.” 

“Zero  is  not  negative.” 

Each  statement  is  certainly  either  true  or  false.  (It  is  not  necessary  to  know  which!) 
On  the  other  hand,  the  following  are  not  mathematical  statements: 

“What  are  irrational  numbers?” 

"Suppose  every  positive  integer  is  the  sum  of  three  squares.” 

The  first  is  a  question  and  the  second  a  conditional;  it  would  not  make  sense  to 
classify  either  as  true  or  false. 


1.1  Compound  Statements 

"And"  and  "Or" 

A  compound  statement  is  a  statement  formed  from  two  other  statements  in  one 
of  several  ways,  for  example,  by  linking  them  with  “and”  or  “or.”  Consider 

“9  =  32  and  3.14  <  n.” 

This  is  a  compound  statement  formed  from  the  simpler  statements  “9  =  32  ”  and 
“3.14  <  7i .”  How  does  the  truth  of  an  “and”  compound  statement  depend  upon 
the  truth  of  its  parts?  The  rule  is 

"p  and  q”  is  true  if  both  p  and  q  are  true;  it  is  false 
if  either  p  is  false  or  q  is  false. 
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Thus,  22  =  -4  and  5  <  100”  is  true  while  “22  +  32  =  42  and  3.14  <  7 r” 
is  false. 

In  the  context  of  mathematics,  just  as  in  everyday  English,  one  can  build  a 
compound  statement  by  inserting  the  word  “or”  between  two  other  statements.  In 
everyday  English,  “or”  can  be  a  bit  problematic  because  sometimes  it  is  used  in 
an  inclusive  sense,  sometimes  in  an  exclusive  sense,  and  sometimes  ambiguously, 
leaving  the  listener  unsure  about  just  what  was  intended.  We  illustrate  with  three 
sentences. 

“To  get  into  that  college,  you  have  to  have  a  high  school  diploma  or  be  over 
25.”  (Both  options  are  allowed.) 

“That  man  is  wanted  dead  or  alive.”  (Here  both  options  are  quite  impossible.) 
“I  am  positive  that  either  blue  or  white  is  in  that  team's  logo.”  (Might  there 
be  both?) 

Since  mathematics  does  not  tolerate  ambiguities,  we  must  decide  precisely  what 
“or”  should  mean.  The  decision  is  to  make  “or”  inclusive:  “or”  always  includes 
the  possibility  of  both. 

“p  or  q"  is  true  if  p  is  true  or  q  is  true  or  both  p 
and  q  are  true;  it  is  false  only  when  both  p  and  q 
are  false. 

Thus, 

“7  +  5  =  12  or  571  is  the  125th  prime”  and  “25  is  less  than  or  equal  to  25” 
are  both  true  sentences,  while 

“5  is  an  even  number  or  \/8  >  3” 
is  false. 

implication 

Many  mathematical  statements  are  implications;  that  is,  statements  of  the  form 
“p  implies  q,"  where  p  and  q  are  statements  called,  respectively,  the  hypothesis 
and  conclusion.  The  symbol  — >  is  read  “implies,”  so 

Statement  1:  “2  is  an  even  number  ->  4  is  an  even  number” 

is  read  “2  is  an  even  number  implies  4  is  an  even  number.” 

In  Statement  1,  “2  is  an  even  number”  is  the  hypothesis  and  “4  is  an  even 
number”  is  the  conclusion. 

Implications  often  appear  without  the  explicit  use  of  the  word  “implies.”  To 
some  ears,  Statement  1  might  sound  better  as 

“If  2  is  an  even  number,  then  4  is  an  even  number” 


or 

“2  is  an  even  number  only  if  4  is  an  even  number.” 

Whatever  wording  is  used,  common  sense  tells  us  that  this  implication  is  true. 
Under  what  conditions  will  an  implication  be  false?  Suppose  your  parents  tell 


you 
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Statement  2:  “If  it  is  sunny  tomorrow,  you  may  go  swimming.” 

If  it  is  sunny,  but  you  are  not  allowed  to  go  swimming,  then  clearly  your  parents 
have  said  something  which  is  false.  However,  if  it  rains  tomorrow  and  you  are  not 
allowed  to  go  swimming,  it  would  be  unreasonable  to  accuse  them  of  breaking 
their  word.  This  example  illustrates  an  important  principle. 

The  implication  “p  -»  q ”  is  false  only  when  the  hypothesis  p  is 
true  and  the  conclusion  q  is  false.  In  all  other  situations,  it  is  true. 

In  particular,  Statement  1  is  true  since  both  the  hypothesis,  “2  is  an  even  num¬ 
ber,”  and  the  conclusion,  “4  is  an  even  number,”  are  true.  Note,  however,  that  an 
implication  is  true  whenever  the  hypothesis  is  false  (no  matter  whether  the  con¬ 
clusion  is  true  or  false).  For  example,  if  it  were  to  rain  tomorrow,  the  implication 
contained  in  Statement  2  is  true  because  the  hypothesis  is  false.  For  the  same 
reason,  each  of  the  following  implications  is  true. 

“If  —1  is  a  positive  number,  then  2  +  2  =  5.” 

“If  —  1  is  a  positive  number,  then  2  +  2  =  4.” 

Think  about  the  implication,  “If  42  =  16,  then  —  l2  =  1.”  Is  it  true  or  false? 

The  Converse  of  an  implication 

The  converse  of  the  implication  p  ->  q  is  the  implication  q  — >•  p.  For  example, 
the  converse  of  Statement  1  is 

“If  4  is  an  even  number,  then  2  is  an  even  number.” 

Write  down  the  converse  of  the  implication  given  in  Pause  2.  Is  this  true  or  false? 

Double  Implication 

Another  compound  statement  which  we  will  use  is  the  double  implication,  p  **  q, 
read  “p  if  and  only  if  q."  As  the  notation  suggests,  the  statement  “p  -o-  q ”  is 
simply  a  convenient  way  to  express 

“p  — »  q  and  p  <—  q." 

(We  would  be  more  likely  to  write  “q  —>  p”  than  “p  <—  q .”) 

Putting  together  earlier  observations,  we  conclude  that 

The  double  implication  “p  -o-  q"  is  true  if  p  and  q  have  the  same 
truth  values;  it  is  false  if  p  and  q  have  different  truth  values. 

For  example,  the  statement 

“2  is  an  even  number  4  is  an  even  number” 

is  true  since  both  “2  is  an  even  number”  and  “4  is  an  even  number”  are  true. 
However, 

“2  is  an  even  number  if  and  only  if  5  is  an  even  number" 
is  false  because  one  side  is  true  while  the  other  is  false. 
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Determine  whether  each  of  the  following  double  implications  is  true  or  false. 

(a)  “42  =  16  **  -l2  =  -1.” 

(b)  “42  =  16  if  and  only  if  (— l)2  =  —1.” 

(c)  “42  =  15  if  and  only  if  -l2  =  -1.” 

(d)  “42  =  15  (-1)2  =  -1.” 

Negation 

The  negation  of  the  statement  p  is  the  statement  which  asserts  that  p  is  not  true. 
We  denote  the  negation  of  p  by  “—•p”  and  say  “not  p.”  The  negation  of  “x  equals 
4”  is  the  statement  “x  does  not  equal  4.”  In  mathematical  writing,  a  slash  (!) 
through  a  symbol  is  used  to  express  the  negation  of  that  symbol.  So,  for  instance, 
7^  means  “not  equal.”  Thus,  the  negation  of  “ x  =  4”  is  “x  ^  4.”  In  succeeding 
chapters,  we  shall  meet  other  symbols  like  e,  C,  and  I,  each  of  which  is  negated 
with  a  slash,  <g,  Jf. 

Some  rules  for  forming  negations  are  a  bit  complicated  because  it  is  not 
enough  just  to  say  “not  p”:  We  must  also  understand  what  is  being  said!  To 
begin,  we  suggest  that  the  negation  of  p  be  expressed  as 
“It  is  not  the  case  that  p." 

Then  we  should  think  for  a  minute  or  so  about  precisely  what  this  means.  For 
example,  the  negation  of  “25  is  a  perfect  square”  is  the  statement  “It  is  not  the 
case  that  25  is  a  perfect  square,”  which  surely  means  "25  is  not  a  perfect  square.” 
To  obtain  the  negation  of 
“n  <  10  or  n  is  odd,” 

we  begin 

“It  is  not  the  case  that  n  <  10  or  n  is  odd.” 

A  little  reflection  suggests  that  this  is  the  same  as  the  more  revealing  “ n  >  10 
and  n  is  even.” 

The  negation  of  an  “or”  statement  is  always  an  “and”  statement  and  the 
negation  of  an  "and”  is  always  an  “or.”  The  precise  rules  for  expressing  the 
negation  of  compound  statements  formed  with  “and”  and  “or”  are  due  to  Augustus 
De  Morgan  (whom  we  shall  meet  again  in  Chapter  2). 

The  negation  of  “p  and  q”  is  the  assertion  “~>p  or  -*q” 

The  negation  of  “p  or  q ”  is  the  assertion  “~>p  and  -<q.” 

For  example,  the  negation  of  “n2  +  b2  =  c2  and  a  >  0”  is  “Either  a2  +  b 2  ^  o2 
or  a  <  0.”  The  negation  of  “x  +  y  =  6  or  2x  +  3y  <  7”  is  “x  +  y  ^  6  and 
2x  +  3v  >  7.” 

The  Contrapositive 

The  contrapositive  of  the  implication  “p  — >  q ”  is  the  implication  “( — )  —*■  (_,p).” 
For  example,  the  contrapositive  of 

“If  2  is  an  even  number,  then  4  is  an  even  number” 
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is 

“If  4  is  an  odd  number,  then  2  is  an  odd  number.” 

flHO  Write  down  the  contrapositive  of  the  implications  in  Pauses  2  and  3.  In  each 
case,  state  whether  the  contrapositive  is  true  or  false.  How  do  these  truth  values 
compare,  respectively,  with  those  of  the  implications  in  these  Pauses? 


Quantifiers 


The  expressions  “there  exists”  and  “for  all,”  which  quantify  statements,  figure 
prominently  in  mathematics.  The  universal  quantifier  for  all  (and  equivalent 
expressions  such  as  for  every,  for  any,  and  for  each)  says,  for  example,  that  a 
statement  is  true  for  all  integers  or  for  all  polynomials  or  for  all  elements  of  a 
certain  type.  The  following  statements  illustrate  its  use.  (Notice  how  it  can  be 
disguised;  in  particular,  note  that  “for  any”  and  “all”  are  synonymous  with  “for 
all.”) 

“x2  +  x  +  1  >  0  for  all  real  numbers  x.” 

“All  polynomials  are  continuous  functions.” 

“For  any  positive  integer  n,  2(1  +  2  +  3  + - 1-  n)  —  n  x  (n  +  1).” 

“(A6)C  —  A(BC)  for  all  square  matrices  A,  B,  and  C.” 

Rewrite  “All  positive  real  numbers  have  real  square  roots,”  making  explicit  use 
of  a  universal  quantifier. 


The  existential  quantifier  there  exists  stipulates  the  existence  of  a  single  ele¬ 
ment  for  which  a  statement  is  true.  Here  are  some  assertions  in  which  it  is 
employed. 

“There  exists  a  smallest  positive  integer.” 

“Two  sets  may  have  no  element  in  common.” 

"Some  polynomials  have  no  real  zeros.” 


Again,  we  draw  attention  to  the  fact  that  the  ideas  discussed  in  this  chapter  can 
arise  in  subtle  ways.  Did  you  notice  the  implicit  use  of  the  existential  quantifier 
in  the  second  of  the  preceding  statements? 

“There  exists  a  set  A  and  a  set  B  such  that  A  and  B  have  no  element  in 
common.” 

Rewrite  “Some  polynomials  have  no  real  zeros”  making  use  of  the  existential 
quantifier. 


Here  are  some  statements  which  employ  both  types  of  quantifiers. 

“There  exists  a  matrix  0  with  the  property  that  A  +  0  =  0  +  A  for  all  matri¬ 
ces  A.” 

“For  any  real  number  x,  there  exists  an  integer  n  such  that  n  <  x  <  n  +  1 .” 
“Every  positive  integer  is  the  product  of  primes.” 

“Every  nonempty  set  of  positive  integers  has  a  smallest  element.” 

Sometimes  it  requires  real  thought  to  express  in  a  useful  way  the  negation  of 
a  statement  which  involves  one  or  more  quantifiers.  It’s  usually  helpful  to  begin 
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with  “It  is  not  the  case.”  Consider,  for  instance,  the  assertion 
“For  every  real  number  x,  x  has  a  real  square  root.” 

A  first  stab  at  its  negation  gives 

“It  is  not  the  case  that  every  real  number  x  has  a  real  square  root.” 

Now  we  must  think  about  what  this  really  means.  Surely, 

“There  exists  a  real  number  which  does  not  have  a  real  square  root.” 

It  is  helpful  to  observe  that  the  negation  of  a  statement  involving  one  type  of 
quantifier  is  a  statement  which  uses  the  other.  Specifically, 

The  negation  of  “For  all  something,  p”  is  the  statement  “There  exists 
something  such  that  ->p.n 

The  negation  of  “There  exists  something  such  that  p ”  is  the  state¬ 
ment  “For  all  something,  -> p 

For  example,  the  negation  of 

“There  exist  a  and  b  for  which  ab  ^  ba” 

is  the  statement 

“For  all  a  and  b,  ab  =  ba .” 

The  symbols  V  and  3  are  commonly  used  for  the  quantifiers  “for  all”  and  “there 
exists”,  respectively.  For  example,  we  might  encounter  the  statement 

V.r,  3n  such  that  n  >  x 

in  a  book  in  real  analysis.  We  won't  use  this  notation  in  this  book,  but  it  is  so 
common  that  you  should  know  about  it. 

What  may  l  assume? 

In  our  experience,  when  asked  to  prove  something,  students  often  wonder  just 
what  they  are  allowed  to  assume.  For  the  rest  of  this  book,  the  answer  is  any  fact, 
including  the  result  of  any  exercise,  stated  earlier  in  the  book.  This  chapter  is 
somewhat  special  because  we  are  talking  “about”  mathematics  and  endeavoring  to 
use  only  “familiar”  ideas  to  motivate  our  discussion.  In  addition  to  basic  college 
algebra,  here  is  a  list  of  mathematical  definitions  and  facts  which  we  assume  and 
which  the  student  is  free  also  to  assume  in  this  chapter’s  exercises. 

•  The  product  of  nonzero  real  numbers  is  nonzero. 

•  The  square  of  a  nonzero  real  number  is  a  positive  real  number. 

•  An  even  integer  is  one  which  is  of  the  form  2k  for  some  integer  k\  an  odd 
integer  is  one  which  is  of  the  form  2 k  +  1  for  some  integer  k. 

•  The  product  of  two  even  integers  is  even;  the  product  of  two  odd  integers  is 
odd;  the  product  of  an  odd  integer  and  an  even  integer  is  even. 

•  A  real  number  is  rational  if  it  is  a  common  fraction,  that  is,  the  quotient  f 
of  integers  m  and  n  with  n  /0. 
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Answers  to  Pauses 


•  A  real  number  is  irrational  if  it  is  not  rational.  For  example,  n  and  V 5  are 
irrational  numbers. 

•  An  irrational  number  has  a  decimal  expansion  which  neither  repeats  nor  ter¬ 
minates. 

•  A  prime  is  a  positive  integer  p  >  1  which  is  divisible  evenly  only  by  ±1  and 
±p,  for  example,  2,  3  and  5. 

1.  By  equation  (*),  twice  the  sum  of  the  integers  from  1  to  100  is  100  x  101. 
So  the  sum  itself  is  50  x  101  =  5050. 

2.  This  is  false.  The  hypothesis  is  true  but  the  conclusion  is  false:  —  l1 2  =  —  1, 
not  1. 

3.  The  converse  is  “If  —  l2  =  1,  then  42  =  16.”  This  is  true,  because  the 
hypothesis,  l2  =  1”,  is  false. 

4.  (a)  This  is  true  because  both  statements  are  true. 

(b)  This  is  false  because  the  two  statements  have  different  truth  values. 

(c)  This  is  false  because  the  two  statements  have  different  truth  values. 

(d)  This  is  true  because  both  statements  are  false. 

5.  The  contrapositive  of  the  implication  in  Pause  2  is  “If  —  l2  ^  1,  then  42  ^ 
16.”  This  is  false  because  the  hypothesis  is  true,  but  the  conclusion  is  false. 
The  contrapositive  of  the  implication  in  Pause  3  is  “If  42  ^  16,  then  -l2  ^ 

1.”  This  is  true  because  the  hypothesis  is  false. 

These  answers  are  the  same  as  in  Pauses  2  and  3.  This  is  always  the  case,  as 
we  shall  see  in  Section  1.2. 

6.  “For  all  real  numbers  x  >  0,  x  has  a  real  square  root.” 

7.  “There  exists  a  polynomial  with  no  real  zeros.” 


EXERCISES 


The  symbol  [BB]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 

You  are  urged  to  read  the  final  paragraph  of  this  section — What  may  I  assume? — before  attempting  these  exercises. 


1.  Classify  each  of  the  following  statements  as  true  or  false 
and  explain  your  answers. 


(a)  [BB]  “4  =  2  +  2  and  7  <  V50.” 

(b)  “4^2  +  2  and  7  <  V50.” 

(c)  [BB]  “4  =  2  +  2  — ►  7  <  V50  ” 

(d)  “4  =  2  +  2  o  7  <  V50.” 

(e)  [BB]  "4  #  2  +  2  ->  7  <  V50.” 

(f)  “4  #  2  +  2  7  <  V50.” 

(g)  [BB]  “4  =  2  +  2  — »  7  >  V50 .” 

(h)  “The  area  of  a  circle  of  radius  r  is  2jrr  or  its  cir¬ 
cumference  is  7 rr2." 

(i)  “2  +  3  =  5  ->  5  +  6  =  10.” 

2.  Classify  each  of  the  following  statements  as  true  or  false 
and  explain  your  answers. 


(a)  [BB]  If  a  and  b  are  integers  with  a  —  b  >  0  and 
b  —  a  >  0,  then  a  =  b. 

(b)  If  a  and  b  are  integers  with  a—b>  0  and  b  —  a  >  0, 
then  a  —  b. 

3.  Write  down  the  negation  of  each  of  the  following  state¬ 
ments  in  clear  and  concise  English.  Do  not  use  the 

expression  “It  is  not  the  case  that”  in  your  answers. 

(a)  [BB]  Either  a2  >  0  or  a  is  not  a  real  number. 

(b)  x  is  a  real  number  and  x2  +  \  =  0. 

(c)  [BB]  x  =  ±\. 

(d)  Every  integer  is  divisible  by  a  prime. 

(e)  [BB]  For  every  real  number  x,  there  is  an  integer  n 
such  that  n  >  x. 

(f>  There  exist  a,  b,  and  c  such  that  ( ab)c  /  a(bc). 

(g)  [BB]  There  exists  a  planar  graph  which  cannot  be 
colored  with  at  most  four  colors. 


1.2  Proofs  in  Mathematics  9 


(h)  For  every  x  >  0,  x2  +  y2  >  0  for  all  y. 

(i)  [BB]  For  all  integers  a  and  b,  there  exist  integers  q 
and  r  such  that  b  =  qa  +  r . 

(j)  There  exists  an  infinite  set  whose  proper  subsets  are 
all  finite. 

4.  Write  down  the  converse  and  the  contrapositive  of  each 
of  the  following  implications. 

(a)  [BB]  If  |  and  *  are  integers,  then  |  is  an  integer. 

(b)  x2  =  1  — >•  x  =  ±1. 

(c)  [BB]  Every  Eulerian  graph  is  connected. 

(d)  ab  =  0  — ►  a  =  0  or  b  =  0. 


(e)  [BB]  A  square  is  a  four-sided  figure. 

(f)  If  ABAC  is  a  right  triangle,  then  a2  =  b2  +  c2. 

5.  Rewrite  each  of  the  following  statements  using  the  quan¬ 
tifiers  “for  all”  and  “there  exists”  as  appropriate. 

(a)  [BB]  Not  all  continuous  functions  are  differentiable. 

(b)  For  real  x,  2X  is  never  negative. 

(c)  [BB]  There  is  no  largest  real  number. 

(d)  There  are  infinitely  many  primes. 

(e)  [BB]  Every  positive  integer  is  the  product  of  primes. 

(f)  All  positive  real  numbers  have  real  square  roots. 

6.  Is  it  possible  for  both  an  implication  and  its  converse  to 
be  false?  Explain  your  answer. 


1.2  Proofs  in  Mathematics 

Many  mathematical  theorems  are  statements  that  a  certain  implication  is  true.  A 
simple  result  about  real  numbers  says  that  if  x  is  between  0  and  1,  then  x2  <  1. 
In  other  words,  for  any  choice  of  a  real  number  between  0  and  1 ,  it  is  a  fact  that 
the  square  of  the  number  will  be  less  than  1 .  We  are  asserting  that  the  implication 

Statement  3:  “0  <  x  <  1  ->•  x2  <  1” 

is  true.  In  Section  1.1,  the  hypothesis  and  conclusion  of  an  implication  could  be 
any  two  statements,  even  statements  completely  unrelated  to  each  other.  In  the 
statement  of  a  theorem  or  a  step  of  a  mathematical  proof,  however,  the  hypothesis 
and  conclusion  will  be  statements  about  the  same  class  of  objects  and  the  statement 
(or  step)  is  the  assertion  that  an  implication  is  always  true.  The  only  way  for  an 
implication  to  be  false  is  for  the  hypothesis  to  be  true  and  the  conclusion  false. 
So  the  statement  of  a  mathematical  theorem  or  a  step  in  a  proof  only  requires 
proving  that  whenever  the  hypothesis  is  true,  the  conclusion  must  also  be  true. 

When  the  implication  “A  — *■  B”  is  the  statement  of  a  theorem,  or  one  step 
in  a  proof,  A  is  said  to  be  a  sufficient  condition  for  B  and  B  is  said  to  be  a 
necessary  condition  for  A.  For  instance,  the  implication  in  Statement  3  can  be 
restated  “0  <  x  <  1  is  sufficient  for  x2  <  1”  or  “x2  <  1  is  necessary  for 
0  <  x  <  1.” 

^ j  Rewrite  the  statement  “A  matrix  with  determinant  1  is  invertible”  so  that  it 
becomes  apparent  that  this  is  an  implication.  What  is  the  hypothesis?  What  is  the 
conclusion?  This  sentence  asks  which  is  a  necessary  condition  for  what?  What  is 
a  sufficient  condition  for  what? 

To  prove  that  Statement  3  is  true,  it  is  not  enough  to  take  a  single  example, 

i  i  2 

x  =  ^  for  instance,  and  claim  that  the  implication  is  true  because  (J>  <  L 

It  is  not  better  to  take  ten,  or  even  ten  thousand,  such  examples  and  verify  the 
implication  in  each  special  case.  Instead,  a  general  argument  must  be  given 
which  works  for  all  x  between  0  and  1 .  Here  is  such  an  argument. 

Assume  that  the  hypothesis  is  true;  that  is,  x  is  a  real  number  with 
0  <  x  <  1.  Since  x  >  0,  it  must  be  that  x  •  x  <  1  •  x  because 
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multiplying  both  sides  of  an  inequality  such  as  x  <  1  by  a  positive 
number  such  as  x  preserves  the  inequality.  Hence  x2  <  x  and,  since 
x  <  1,  x2  <  1  as  desired. 

Now  let  us  consider  the  converse  of  the  implication  in  Statement  3: 
Statement  4:  “x2  <  1  — >•  0  <  x  <  1 .” 

This  is  false.  For  example,  when  x  =  —  j,  the  left-hand  side  is  true,  since  (—  j)2  = 
\  <  1,  while  the  right-hand  side  is  false.  So  the  implication  fails  when  x  = 

It  follows  that  this  implication  cannot  be  used  as  part  of  a  mathematical  proof. 
The  number  x  =  —  ^  is  called  a  counterexample  to  Statement  4;  that  is,  a  specific 
example  which  proves  that  an  implication  is  false. 

There  is  a  very  important  point  to  note  here.  To  show  that  a  theorem,  or  a 
step  in  a  proof,  is  false,  it  is  enough  to  find  a  single  case  where  the  implication 
does  not  hold.  However,  as  we  saw  with  Statement  3,  to  show  that  a  theorem  is 
true,  we  must  give  a  proof  which  covers  all  possible  cases. 

State  the  contrapositive  of  “0  <  x  <  1  — >  x2  <  1 .”  Is  this  true  or  false? 

Theorems  in  mathematics  don’t  have  to  be  about  numbers.  For  example,  a 
very  famous  theorem  proved  in  1976  asserts  that  if  Q  is  a  planar  graph,  then 
Q  can  be  colored  with  at  most  four  colors.  (The  definitions  and  details  are  in 
Chapter  14.)  This  is  an  implication  of  the  form  “A  ->  B,”  where  the  hypothesis 
A  is  the  statement  that  Q  is  a  planar  graph  and  the  conclusion  B  is  the  statement 
that  Q  can  be  colored  with  at  most  four  colors.  The  Four-Color  Theorem  states 
that  this  implication  is  true. 

(For  students  who  have  studied  linear  algebra.)  The  statement  given  in  Pause  8  is 
a  theorem  in  linear  algebra;  that  is,  the  implication  is  true.  State  the  converse  of 
this  theorem.  Is  this  also  true? 

A  common  expression  in  scientific  writing  is  the  phrase  if  and  only  if  denoting 
both  an  implication  and  its  converse.  For  example, 

Statement  5:  “x2  4-  y2  =  0  -o-  (x  =  0  and  y  =  0).” 

As  we  saw  in  Section  1.1,  the  statement  “A  <-»  B”  is  a  convenient  way  to  express 
the  compound  statement 
“A  ->•  B  and  B  ->  A.” 

The  sentence  “A  is  a  necessary  and  sufficient  condition  for  B”  is  another 
way  of  saying  “A  -«■>  B.”  The  sentence  “A  and  B  are  necessary  and  sufficient 
conditions  for  C”  is  another  way  of  saying 
“(A  and  B)  ^  C.” 

For  example,  “a  triangle  has  three  equal  angles”  is  a  necessary  and  sufficient 
condition  for  “a  triangle  has  three  equal  sides.”  We  would  be  more  likely  to  hear 
“In  order  for  a  triangle  to  have  three  equal  angles,  it  is  necessary  and  sufficient 
that  it  have  three  equal  sides.” 

To  prove  that  “A  -o-  B”  is  true,  we  must  prove  separately  that  “A  — »■  B” 
and  “B  -»  A”  are  both  true,  using  the  ideas  discussed  earlier.  In  Statement  5,  the 
implication  “(x  =  0  and  y  =  0)  -*■  x2  +  y2  =  0”  is  easy. 
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Prove  that  “x2  +  y1  =  0  -»■  (x  =  0  and  y  =  0).” 

As  another  example,  consider 

Statement  6:  “0  <  x  <  1  -o-  x2  <  1 

Is  this  true?  Well,  we  saw  earlier  that  “0  <  a:  <  1  — >  x2  <  1”  is  true,  but  we 
also  noted  that  its  converse,  “x1  <  1  — ¥  0  <  Jt  <  1,”  is  false.  It  follows  that 
Statement  6  is  false. 

Determine  whether  “—1  <  x  <  1  -o-  x2  <  1"  is  true  or  false. 

Sometimes,  a  theorem  in  mathematics  asserts  that  three  or  more  statements 
are  equivalent,  meaning  that  all  possible  implications  between  pairs  of  statements 
are  true.  Thus 

“The  following  are  equivalent: 

1.  A 

2.  B 

3.  6” 

means  that  each  of  the  double  implications  A  <h>-  B,  23  **  C,  A  C  is  true. 
Instead  of  proving  the  truth  of  the  six  implications  here,  it  is  more  efficient  just 
to  establish  the  truth,  say,  of  the  sequence 

A  -+  3  6  -»•  A 

of  three  implications.  It  should  be  clear  that  if  these  implications  are  all  true,  then 
any  implication  involving  two  of  A,  23,  C  is  also  true;  for  example,  the  truth  of 
23  — >  A  would  follow  from  the  truth  of  23  — >  G  and  C  ->  A. 

Alternatively,  to  establish  that  A,  13,  and  C  are  equivalent,  we  could  establish 
the  truth  of  the  sequence 

B  -»•  A-+  G->  3 

of  implications.  Which  of  the  two  sequences  a  person  chooses  is  a  matter  of 
preference  but  is  usually  determined  by  what  appears  to  be  the  easiest  way  to 
argue.  Here  is  an  example. 


PROBLEM  1.  Let  x  be  a  real  number.  Show  that  the  following  are  equivalent. 

(1)  x  =  ±1. 

(2)  x2  =  1. 

(3)  If  a  is  any  real  number,  then  ax  =  ±a. 

Solution.  To  show  that  these  statements  are  equivalent,  it  is  sufficient  to  establish 
the  truth  of  the  sequence 

(2)  -+  (1)  ->  (3)  (2). 

(2)  ->  (1):  The  notation  means  “assume  (2)  and  prove  (1).”  Since  x2  =  1, 
0  =  x2  —  1  =  (x  +  l)(x  —  1).  Since  the  product  of  real  numbers  is  zero  if  and 
only  if  one  of  the  numbers  is  zero,  either  x  +  1  =  0  or  x  —  1  =0;  hence  x  =  —  1 
or  x  =  + 1 ,  as  required. 
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(1)  ->  (3):  The  notation  means  “assume  (1)  and  prove  (3).”  Thus  either 
x  =  +1  or  x  =  —1.  Let  a  be  a  real  number.  If  x  —  +1,  then  a  ■  x  =  a  ■  1  =  a 
while  if  x  =  —  1,  then  ax  =  -a.  In  every  case,  ax  =  ±a  as  required. 

(3)  -*  (2):  We  assume  (3)  and  prove  (2).  We  are  given  that  ax  =  ±a  for 
any  real  number  a.  With  a  =  1,  we  obtain  x  =  ±1  and  squaring  gives  x2  =  1,  as 
desired.  H 


The  symbol  I  marks  the  end  of  a  proof.  Some  authors  write  “Q.E.D.”  from  the 
Latin  quod  erat  demonstrandum  (which  was  to  be  proved)  for  the  same  purpose. 

In  the  index  (see  equivalent ),  you  are  directed  to  other  places  in  this  book 
where  we  establish  the  equivalence  of  a  series  of  statements. 

Direct  Proof 

Most  theorems  in  mathematics  are  stated  as  implications:  “A  —>  3.”  Sometimes, 
it  is  possible  to  prove  such  a  statement  directly,  that  is,  by  establishing  the  validity 
of  a  sequence  of  implications: 

A  — >•  A\  — >  A2  —*■  ■■■—*■  3. 


PROBLEM  2.  Prove  that  for  all  real  numbers  x,  x2  —  Ax  +  17  7^  0. 

Solution.  We  observe  that  x2  —  4x  +  17  =  (x  —  2)2  +  13  is  the  sum  of  13  and 
a  number,  ( x  -  2)2,  which  is  never  negative.  So  x2  -  Ax  +  17  >  13  for  any  x;  in 
particular,  x2  —  4x  +  17  ^  0.  I 


PROBLEM  3.  Suppose  that  x  and  y  are  real  numbers  such  that  2x  +  y  —  1  and 
x  —  y  =  —4.  Prove  that  x  =  —  1  and  y  =  3. 

Solution.  (2x  +  y  =  1  and  x  —  y  —  —4)  ->  (2x  +  y)  +  {x  —  y)  =  1  —  4 

—*■  3x  =  -  3  ->  x  =  —  1 . 

Also, 

(x  =  —  1  and  x  —  y  =  —4)  — ►  (—  1  —  y  —  —4)  -»  (y  =  —  1  +  4  =  3).  I 


Many  of  the  proofs  in  this  book  are  direct.  In  the  index  (under  direct ),  we 
guide  you  to  several  of  these. 

Proof  by  Cases 

Sometimes  a  direct  argument  is  made  simpler  by  breaking  it  into  a  number  of 
cases,  one  of  which  must  hold  and  each  of  which  leads  to  the  desired  conclusion. 

PROBLEM  4.  Let  n  be  an  integer.  Prove  that  9n 2  +  3n  —  2  is  even. 

Solution.  Case  1:  n  is  even. 

The  product  of  an  even  integer  and  any  integer  is  even.  Since  n  is  even,  9 n2 
and  3 n  are  even  too.  Thus  9 n2  +  372—2  is  even  because  it  is  the  sum  of  three 
even  integers. 
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Case  2:  n  is  odd. 

The  product  of  odd  integers  is  odd.  In  this  case,  since  n  is  odd,  9 n2  and  3 n 
are  also  odd.  The  sum  of  two  odd  integers  is  even.  Thus  9 n2  +  3 n  is  even. 
The  sum  of  even  integers  is  even,  so  9 n2  +  3n  —  2  =  (9 n2  +  3 n)  +  (—2)  is 
even.  I 

In  the  index,  we  guide  you  to  other  places  in  this  book  where  we  give  proofs 
by  cases.  (See  cases.) 


Prove  the  Contrapositive 


A  very  important  principle  of  logic,  foreshadowed  by  Pause  5,  is  summarized  in 
the  next  theorem. 


“A  — >•  3’’  is  true  if  and  only  if  its  contrapositive  “->3  -*  ->A”  is  true. 


Proof  “A  3"  is  false  if  and  only  if  A  is  true  and  3  is  false;  that  is,  if  and  only  if 
->A  is  false  and  -'3  is  true;  that  is,  if  and  only  if  “->3  — >  ->A”  is  false.  Thus  the 
two  statements  “ A  —*■  3 ”  and  “—*3  — ►  —A”  are  false  together  (and  hence  true 
together);  that  is,  they  have  the  same  truth  values.  The  result  is  proved.  I 


PROBLEM  5.  If  the  average  of  four  different  integers  is  10,  prove  that  one  of 
the  integers  is  greater  than  1 1 . 

Solution.  Let  A  and  3  be  the  statements 

A:  “The  average  of  four  integers,  all  different,  is  10.” 

3:  “One  of  the  four  integers  is  greater  than  1 1.” 

We  are  asked  to  prove  the  truth  of  “A  ->  3.”  Instead,  we  prove  the  truth  of 
the  contrapositive,  "-’'B  — >  -’A”,  from  which  the  result  follows  by  Theorem  1.2.1. 

Call  the  given  integers  a ,  b ,  c ,  d.  If  3  is  false,  then  each  of  these  numbers  is 
at  most  1 1  and,  since  they  are  all  different,  the  biggest  value  for  a  +  b  +  c  +  d  is 

to 

1 1  +  10  +  9  +  8  =  38.  So  the  biggest  possible  average  would  be  which  is  less 
than  10,  so  A  is  false.  I 


Proof  by  Contradiction 

Sometimes  a  direct  proof  of  a  statement  A  seems  hopeless:  We  simply  do  not 
know  how  to  begin.  In  this  case,  we  can  sometimes  make  progress  by  assuming 
that  the  negation  of  A  is  true.  If  this  assumption  leads  to  a  statement  which  is 
obviously  false  (an  “absurdity”)  or  to  a  statement  which  contradicts  something 
else,  then  we  will  have  shown  that  ->A  is  false.  So,  A  must  be  true. 

PROBLEM  6.  Show  that  there  is  no  largest  integer. 

Solution.  Let  A  be  the  statement  “There  is  no  largest  integer.”  If  A  is  false, 
then  there  is  a  largest  integer  N.  This  is  absurd,  however,  because  N  +  1  is  an 
integer  larger  than  N.  Thus  ->A  is  false,  so  A  is  true.  I 
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Remember  that  rational  number  just  means  common  fraction,  the  quotient 
™  of  integers  m  and  n  with  n  0.  A  number  which  is  not  rational  is  called 
irrational. 


PROBLEM  7.  Suppose  that  a  is  a  nonzero  rational  number  and  that  b  is  an 
irrational  number.  Prove  that  ab  is  irrational. 

Solution.  We  are  asked  to  prove  the  truth  of  the  implication  “A  -*■  B,”  where 
A  and  B  are  the  statements 

A:  “ a  is  a  nonzero  rational  and  b  is  irrational.” 

3:  “ab  is  irrational.” 

Suppose  that  this  implication  is  false  and  remember  that  this  occurs  only  when  A 
is  true  and  B  is  false;  that  is,  a  is  rational,  b  is  irrational,  and  ab  is  rational.  So 
we  know  that  ab  =  ^  for  integers  m  and  n,  n  /  0  and  a  =  |  for  integers  k  and 
l,  l  7^0  and  k  ^  0  (because  a  ^  0).  Thus, 

m  ml 
na  nk 

with  nk  f  0,  so  h  is  rational.  This  contradicts  the  fact  that  b  is  irrational.  So 
“A  -r  B”  is  true.  I 


Here  is  a  well-known  but  nonetheless  beautiful  example  of  a  proof  by  con¬ 
tradiction. 

PROBLEM  8.  Prove  that  V2  is  an  irrational  number. 

Solution.  If  the  statement  is  false,  then  there  exist  integers  m  and  n  such  that 
V2  =  j.  If  both  m  and  n  are  even,  we  can  cancel  2’s  in  numerator  and  denomi¬ 
nator  until  at  least  one  of  them  is  odd.  Thus,  without  loss  of  generality,  we  may 
assume  that  not  both  m  and  n  are  even. 

Square  both  sides  of  y/2  =  We  get  m2  =  2 n2,  so  m2  is  even.  Since  the 
square  of  an  odd  integer  is  odd,  m  must  be  even.  Since  not  both  m  and  n  are 
even,  it  follows  that  n  must  be  odd.  Write  m  =  2k  and  n  =  21  +  1.  Then  from 
m2  =  2 n2  we  get 

4k2  =  2(4f2  +  M  +  1)  =  U1  +  U  +  2 

and,  dividing  by  2, 

2k2  =4l2  +  4l  +  1. 

This  is  an  absurdity  because  the  left  side  is  even  and  the  right  is  odd.  I 


Answers  to  Pauses 


8.  “A  matrix  A  has  determinant  1  — >  A  is  invertible.” 
Hypothesis:  A  matrix  A  has  determinant  1. 
Conclusion:  A  is  invertible. 


The  invertibility  of  a  matrix  is  a  necessary  condition  for  its  determinant  being 
equal  to  1;  determinant  1  is  a  sufficient  condition  for  the  invertibility  of  a 
matrix. 
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9.  The  contrapositive  is  “a1 2 3 4 5  >  1  — >  (a  <  0  or  a  >  1).”  This  is  true,  and  here  is 
a  proof. 

Assume  x2  >  1.  If  a  <  0,  we  have  the  desired  result,  so  assume  a  >  0.  In 
that  case,  if  a  <  1 ,  we  have  seen  that  x2  <  1 ,  which  is  not  true,  so  we  must 
have  a  >  1,  again  as  desired. 

10.  The  converse  is  “An  invertible  matrix  has  determinant  1.”  This  is  false:  for 
example,  the  matrix  A  =  [  ®  ^  ]  is  invertible  (the  inverse  of  A  is  A  itself),  but 
det  A  —  -1. 

11.  Assume  that  a2  +  y2  =0.  Since  the  square  of  a  real  number  cannot  be 
negative  and  the  square  of  a  nonzero  real  number  is  positive,  if  either  a2  jz  0 
or  y 2  7^  0,  the  sum  a2  +  v2  would  be  positive,  which  is  not  true.  This  means 
a2  =  0  and  y2  =  0,  so  a  =  0  and  y  =  0,  as  desired. 

12.  This  statement  is  true.  To  prove  it,  we  must  show  that  two  implications  are 
true. 

(— »)  First  assume  that  —  1  <  a  <  1.  If  0  <  a  <  1,  then  we  saw  in  the  text 
that  a2  <  1  while,  if  a  =  0,  clearly  a2  =  0  <  1.  If  —1  <  a  <0,  then 
0  <  — a  <  1  (multiplying  an  inequality  by  a  negative  number  reverses  it)  so, 
by  the  argument  in  the  text  (—a)2  <  1,  that  is,  a2  <  1.  In  all  cases,  we  have 
a2  <  1.  Thus  —  1  <  a  <  1  a2  <  1  is  true. 

(«— )  Next  we  prove  that  a2  <  1  — ►  —  1  <  a  <  1  is  true.  Assume  a2  <  1.  If 

a  >  1,  then  we  would  also  have  a2=aa>a-1=a>1,soa2>1,  which 
is  not  true.  If  we  had  a  <  — 1,  then  — a  >  1  and  so  a2  =  (—a)2  >  1,  which, 

again,  is  not  true.  We  conclude  that  —  1  <  a  <  1,  as  desired. 


EXERCISES 


The  symbol  IBB]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 

You  are  urged  to  read  the  final  paragraph  of  Section  1.1 — What  may  I  assume? — before  attempting  these  exercises. 


1.  What  is  the  hypothesis  and  what  is  the  conclusion  in 
each  of  the  following  implications? 

(a)  [BB]  The  sum  of  two  positive  numbers  is  positive. 

(b)  The  square  of  the  length  of  the  hypotenuse  of  a 
right-angled  triangle  is  the  sum  of  the  squares  of 
the  lengths  of  the  other  two  sides. 

(c)  All  primes  are  even. 

2.  [BB]  Determine  whether  or  not  the  following  implica¬ 
tion  is  true. 

“a  is  an  even  integer  -»  a  +  2  is  an  even  integer.” 

3.  State  the  converse  of  the  implication  in  Exercise  2  and 
determine  whether  or  not  it  is  true. 

4.  Answer  Exercise  2  with  replaced  by  •**.  [Hint:  Exer¬ 
cises  2  and  3.] 

5.  Let  n  be  an  integer  greater  than  1  and  consider  the  state¬ 
ment  “A:  2"  —  1  prime  is  necessary  for  n  to  be  prime.” 


(a)  Write  A  as  an  implication. 

(b)  Write  A  in  the  form  “p  is  sufficient  for  q.” 

(c)  Write  the  converse  of  A  as  an  implication. 

(d)  Determine  whether  the  converse  of  A  is  true  or 
false. 

6.  [BB]  A  theorem  in  calculus  states  that  every  differen¬ 
tiable  function  is  continuous.  State  the  converse  of  this 
theorem. 

(For  students  who  have  taken  calculus)  Is  the  converse 
true  or  false?  Explain. 

7.  Let  n  be  an  integer,  n  >  3.  A  certain  mathematical 

theorem  asserts  that  n  statements  ...  ,  An  are 

equivalent. 

(a)  A  student  proves  this  by  showing  that  A\  o  Aj, 
A.2  A3, . . .  ,  A„^i  -o-  A„  are  all  true.  How  many 

implication  proofs  did  the  student  write  down? 
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(b)  Another  student  proves  the  truth  of  A\  — >  A2, 
A2  — >  ./I ,  ...  ,  An—i  — A n,  and  An  — >  ,/l  1 . 
How  many  implication  proofs  did  this  student  write 
down? 

(c)  A  third  student  wishes  to  find  a  proof  which  is  dif¬ 
ferent  from  that  in  7(b)  but  uses  the  same  number 
of  implication  proofs  as  in  7(b).  Outline  a  possible 
proof  for  this  student. 

The  next  three  exercises  illustrate  that  the  position  of  a 
quantifier  is  very  important. 

8.  [BB]  Consider  the  assertions 

A:  “For  every  real  number  x,  there  exists  an  integer  n 
such  that  n  <  x  <  n  +  1 .” 

23:  “There  exists  an  integer  n  such  that  n  <  x  <  n  +  1 
for  every  real  number  x." 

One  of  these  assertions  is  true.  The  other  is  false.  Which 
is  which?  Explain. 

9.  Answer  Exercise  8  with  A  and  23  as  follows. 

A:  “There  exists  a  real  number  y  such  that  y  >  x  for 
every  real  number  x.” 

23:  “For  every  real  number  x,  there  exists  a  real  number 
y  such  that  y  >  x.” 

10.  Answer  true  or  false  and  supply  a  direct  proof  or  a  coun¬ 
terexample  to  each  of  the  following  assertions. 

(a)  There  exists  an  integer  n  such  that  nq  is  an  integer 
for  every  rational  number  q. 

(b)  For  every  rational  number  q.  there  exists  an  integer 
n  such  that  nq  is  an  integer. 

11.  (a)  [BB]  Let  a  be  an  integer.  Show  that  either  a  or  a+  1 

is  even. 

(b)  [BB]  Show  that  n2  +n  is  even  for  any  integer  n. 

12.  Provide  a  direct  proof  that  n2  —  n  +  5  is  odd,  for  all 
integers  n. 

13.  [BB]  Prove  that  2x2  —  4,v  +  3  >  0  for  any  real  number  x. 

14.  Let  a  and  b  be  integers.  By  examining  the  four  cases 

i .  a.  b  both  even, 

ii.  a,  b  both  odd, 

iii.  a  even,  b  odd, 

iv.  a  odd,  b  even. 

find  a  necessary  and  sufficient  condition  for  a 2  —  b2  to 
be  odd. 

15.  [BB]  Let  n  be  an  integer.  Prove  that  n2  is  even  if  and 
only  if  n  is  even. 

16.  Let  x  be  a  real  number.  Find  a  necessary  and  sufficient 

I 

condition  for  x  4 —  >  2.  Prove  your  answer. 


17.  [BB]  Prove  that  if  n  is  an  odd  integer  then  there  is  an 
integer  m  such  that  n  =  4 m  +  1  or  n  =  4 m  +  3.  [Hint: 
Consider  a  proof  by  cases.] 

18.  Prove  that  if  n  is  an  odd  integer,  there  is  an  integer  m 
such  that  n  =  8 m  +1  or  n  =  8m  +  3  or  n  =  8 m  +  5  or 
n  —  8 m  +  7.  (You  may  use  the  result  of  Exercise  17.) 

19.  Prove  that  there  exists  no  smallest  positive  real  number. 
| Hint :  Find  a  proof  by  contradiction.] 

20.  [BB]  (For  students  who  have  studied  linear  algebra) 
Suppose  0  is  an  eigenvalue  of  a  matrix  A.  Prove  that 
A  is  not  invertible.  [Hint:  There  is  a  short  proof  by 
contradiction.] 

21.  Let  n  =  ab  be  the  product  of  positive  integers  a  and  b. 
Prove  that  either  a  <  J n  or  b  <  Tt . 

22.  [BB]  Suppose  a  and  b  are  integers  such  that  a+b+ab  = 
0.  Prove  that  a=b=0  or  a=b  =  —2.  Give  a  direct 
proof. 

23.  Suppose  that  a  is  a  rational  number  and  that  b  is  an 
irrational  number.  Prove  that  a  +  b  is  irrational. 

24.  [BB]  Prove  that  the  equations 

2x  +  3y  —  z  =  5 
x  -  2y  +  3z  =  7 
jc  +  5y  -  4z  =  0 

have  no  solution.  (Give  a  proof  by  contradiction.) 

25.  Find  a  proof  or  exhibit  a  counterexample  to  each  of  the 
following  statements. 

(a)  [BB]  2x2  +  3y2  >  0  for  all  real  numbers  x  and  y. 

(b)  a  an  even  integer  — »  \a  an  even  integer. 

(c)  [BB]  For  each  real  number  x,  there  exists  a  real 
number  y  such  that  xy  =  1 . 

(d)  If  a  and  b  are  real  numbers  with  a+b  rational,  then 
a  and  b  are  rational. 

(e)  [BB]  a  and  b  real  numbers  with  ab  rational  — »  a 
and  b  rational. 

26.  Suppose  ABC  and  A'B'C1  are  triangles  with  pairwise 
equal  angles;  that  is,  LA  =  LA',  LB  =  LB',  and 
LC  =  LC' .  Then  it  is  a  well-known  result  in  Euclidean 
geometry  that  the  triangles  have  pairwise  proportional 
sides  (the  triangles  are  similar ).  Does  the  same  property 
hold  for  polygons  with  more  than  three  sides?  Give  a 
proof  or  provide  a  counterexample. 

27.  (a)  [BB]  Suppose  m  and  n  are  integers  such  that  n2  + 

1  =  2m.  Prove  that  m  is  the  sum  of  squares  of  two 
integers. 

(b)  [BB]  Given  that  (4373)2  +  1  =  2(9,561.565),  write 
9,561,565  as  the  sum  of  two  squares. 
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28.  Observe  that  for  any  real  number  a, 

4a4  -T  1  =  (2  a  “  T"  2a  I  1)(2a2  —  2a  -T  1). 

(a)  Use  this  identity  to  express  24,i+2  +  1  (n  a  positive 
integer)  and  218  +  1  as  the  product  of  two  integers 
each  greater  than  1 . 


(b)  Express  236  —  1  as  the  product  of  four  integers  each 
of  which  is  larger  than  1 . 

29.  Prove  that  one  of  the  digits  1,2 . 9  occurs  infinitely 

often  in  the  decimal  expansion  of  n . 

30.  Prove  that  there  exist  irrational  numbers  a  and  b  such 
that  ab  is  rational. 


1.3  Truth  Tables 

In  previous  sections,  we  presented  a  rather  informal  introduction  to  some  of  the 
basic  concepts  of  mathematical  logic.  There  are  times,  however,  when  a  more 
formal  approach  can  be  useful.  We  begin  to  look  at  such  an  approach  now. 

Let  p  and  q  be  statements.  For  us,  remember  that  statement  means  a  statement 
of  fact  which  is  either  true  or  false.  The  compound  statements  “p  or  q ”  and  “p 
and  q which  were  introduced  in  Section  1.1,  will  henceforth  be  written  “pvq” 
and  “p  A  q,”  respectively. 


p  V  q :  p  or  q 
p  A  q :  p  and  q . 

The  way  in  which  the  truth  values  of  these  compound  statements  depend  upon 
those  of  p  and  q  can  be  neatly  summarized  by  tables  called  truth  tables.  Truth 
tables  for  p  v  q  and  p  a  q  are  shown  in  Fig  1.2. 


p 

q 

pvq 

T 

T 

T 

T 

F 

T 

F 

T 

T 

F 

F 

F 

P 

q 

pAq 

T 

T 

T 

T 

F 

F 

F 

T 

F 

F 

F 

F 

Figure  1 .2  Truth  tables  for  p  v  q  (p  or  q )  and  p  Aq  (p  and  q). 

In  each  case,  the  first  two  columns  show  all  possible  truth  values  for  p  and  q — 
each  is  either  true  ( T )  or  false  (F) — and  the  third  column  shows  the  corresponding 
truth  value  for  the  compound  statement. 

The  truth  table  for  the  implication  p  — >•  q,  introduced  in  Section  1.1,  is  shown 
on  the  left  in  Fig  1.3.  On  the  right,  we  show  the  particularly  simple  truth  table 
for  “-■p,”  the  negation  of  p. 


P 

q 

p  q 

T 

T 

T 

T 

F 

F 

F 

T 

T 

F 

T 

Figure  1.3 


Truth  tables  for  p  ->  q  ( p  implies  q)  and  ->/?  (not  p). 


18  Chapter  1  Yes,  There  Are  Proofs! 


Truth  tables  for  more  complicated  compound  statements  can  be  constructed 
using  the  truth  tables  we  have  seen  so  far.  For  example,  the  statement  “p  4*  q 
defined  in  Section  1.1  as  “(p  — ►  q)  and  ( q  —*■  p),”  is  “(p  ->■  q)  A  (q  —>■  p).” 
The  truth  values  for  p  —*■  q  and  q  p  are  shown  in  Fig  1.4.  Focusing  on 

columns  3  and  4  and  remembering  that  r  As  is  true  if  and  only  if  both  r  and  s 
are  true — see  the  truth  table  for  “A”  shown  in  Fig  1.2 — we  obtain  the  truth  table 
for  (p  — »  q)  A  (q  — »•  p),  that  is,  for  p  <+  q. 


p 

q 

q 

q  p 

(p  ^  q)  A(q  ->  p) 

T 

T 

T 

T 

T 

T 

F 

F 

T 

F 

F 

T 

T 

F 

F 

F 

F 

T 

T 

T 

Figure  1.4  The  truth  table  for  p  <+  q,  p  if  and  only  if  q. 

It  is  the  first  two  columns  and  the  last  column  which  are  the  most  important, 
of  course,  so  in  future  applications,  we  remember  p  4*  q  with  the  simple  truth 
table  shown  in  Fig  1.5. 


P 

q 

p  +*  q 

T 

T 

T 

T 

F 

F 

F 

T 

F 

F 

F 

T 

Figure  1.5  The  truth  table  for  p  <-»  q. 

Here  is  another  demonstration  of  how  to  analyze  complex  compound  state¬ 
ments  with  truth  tables. 


EXAMPLE  9  Suppose  we  want  the  truth  table  for  p  -+  ->(q  V  p). 


P 

q 

?vp 

(q  v  p) 

P  ->  ~‘{q  V  p) 

T 

T 

T 

F 

F 

T 

F 

T 

F 

F 

F 

T 

T 

F 

T 

F 

F 

F 

T 

T 

Although  the  answer  is  presented  as  a  single  truth  table,  the  procedure  is  to  con¬ 
struct  appropriate  columns  one  by  one  until  the  answer  is  reached.  Here,  columns 
1  and  2  are  used  to  form  column  3 — q  V  p.  Then  column  4  follows  from  column 
3  and,  finally,  columns  1  and  4  are  used  to  construct  column  5,  using  the  truth 
table  for  an  implication  in  Fig  1.3.  A 

When  three  statements  p,  q ,  and  r  are  involved,  eight  rows  are  required  in  a 
truth  table  since  it  is  necessary  to  consider  the  two  possible  truth  values  for  r  for 
each  of  the  four  possible  truth  values  of  p  and  q. 
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PROBLEM  10.  Construct  a  truth  table  for  (pv  q)  [((->/?)  A  r)  —*■  (q  A  r)]. 

Solution. 


p 

q 

r 

- P 

(-‘P)  A  r 

q  A  r 

((— ’P)  Ar)->(?Ar) 

pvq 

T 

T 

T 

F 

F 

T 

T 

T 

T 

T 

F 

F 

F 

F 

T 

T 

T 

F 

T 

F 

F 

F 

T 

T 

T 

F 

F 

F 

F 

F 

T 

T 

F 

T 

T 

T 

T 

T 

T 

T 

F 

T 

F 

T 

F 

F 

T 

T 

F 

F 

T 

T 

T 

F 

F 

F 

F 

F 

F 

T 

F 

F 

T 

F 

(pVq)  +*  [((—■/?)  A  r)  -»  (q  A  r)] 

T 

T 

T 

T 

T 

T 

T 


Of  course,  it  is  only  necessary  to  construct  an  entire  truth  table  if  a  complete 
analysis  of  a  certain  compound  statement  is  desired.  We  do  not  need  to  construct 
all  32  rows  of  a  truth  table  to  do  the  next  problem. 


PROBLEM  11.  Find  the  truth  value  of 

[p  ((q  A  (—■/-))  V  5)]  a  [(— -f )  <+  (s  A  r )], 
where  p,  q,  r,  and  s  are  all  true  while  t  is  false. 

Solution.  We  evaluate  the  expression  step  by  step,  showing  just  the  relevant  row 
of  the  truth  table. 


P 

q 

r 

s 

t 

— «r 

q  A  (—>/') 

(q  A  {-')■))  V  s 

T 

T 

T 

T 

F 

F 

F 

T 

p  -»•  [(q  a  (—■/-))  v  s] 

->t 

s  A  r 

(—>t)  e(sAr) 

T 

T 

T 

T 

[P  ->•  [((<7  A  C-1''))  v £)]  A  ^  O  A  r>l 

T 


The  truth  value  is  true. 


I 
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EXAMPLE  13 


1.3.1  DEFINITIONS 


^hszxei 


Answers  to  Pauses 


A  notion  that  will  be  important  in  later  sections  is  that  of  logical  equivalence. 
Formally,  statements  A  and  13  are  logically  equivalent  if  they  have  identical  truth 
tables. 


PROBLEM  12.  Show  that  A :  p  ->•  ( — ■<?)”  and  “13 :  ->(p  a  q )”  are  logically 
equivalent. 

Solution.  We  simply  observe  that  the  final  columns  of  the  two  truth  tables  are 
identical. 


p 

<7 

^q 

P  i^q) 

T 

T 

F 

F 

T 

F 

T 

T 

F 

T 

F 

T 

F 

F 

T 

T 

P 

q 

p  Aq 

(P  A  q) 

T 

T 

T 

F 

T 

F 

F 

T 

F 

T 

F 

T 

F 

F 

F 

T 

In  Section  1.1,  we  defined  the  contrapositive  of  the  statement  "p  ->  q"  as  the 
statement  “(-></)  — >  (->/>).”  In  Theorem  1.2.1,  we  proved  that  these  implications 
are  logically  equivalent  without  actually  introducing  the  terminology.  Here  is  how 
to  establish  the  same  result  using  truth  tables. 


P 

q 

^q 

- P 

(-’<?)  (~'P) 

T 

r 

F 

F 

T 

T 

F 

T 

F 

F 

F 

T 

F 

T 

T 

F 

F 

T 

T 

T 

P 

q 

p  q 

T 

T 

T 

T 

F 

F 

F 

T 

T 

F 

F 

T 

A  compound  statement  that  is  always  true,  regardless  of  the  truth  values  assigned 
to  its  variables,  is  a  tautology.  A  compound  statement  that  is  always  false  is  a 
contradiction. 

Show  that  “(p  A  q)  —>  (pv  q )”  is  a  tautology,  while  “(( — ■  /?)  A  q)  A  (p  V  {—>q))" 
is  a  contradiction. 

The  truth  table  for  p  -o-  q  appears  in  Fig  1.5.  From  this,  we  see  immediately 
that  two  statements  A  and  13  are  logically  equivalent  precisely  when  the  statement 
A  13  is  a  tautology. 


P 

q 

p  Aq 

pvq 

(p  Aq)  -*  (pvq) 

T 

T 

T 

T 

T 

T 

F 

F 

T 

T 

F 

T 

F 

T 

T 

F 

F 

F 

F 

T 

The  final  column  shows  that  (p  A  q)  ->  (p  v  q)  is  true  for  all  values  of  p 
and  q,  so  this  statement  is  a  tautology. 
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p 

<7 

(—■/»)  a  q 

-^q 

P  V  (-‘q) 

> 

a. 

< 

< 

r 

T 

T 

F 

F 

F 

T 

F 

T 

F 

F 

F 

T 

T 

F 

F 

T 

T 

T 

F 

F 

F 

F 

F 

T 

F 

T 

T 

F 

The  final  column  shows  that  {(~'p)  A  q)  A  (p  v  (-ig))  is  false  for  all  values 
of  p  and  q,  so  this  statement  is  a  contradiction. 


EXERCISES 


The  symbol  [ BB ]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 


1.  Construct  a  truth  table  for  each  of  the  following  com¬ 
pound  statements. 

(a)  [BB]  p  A  ((-?)  v  p) 

(b)  (p  Aq)  V  ((—■/?)  -+  q) 

(c)  — ■  (p  A  (q  V  p))  p 

(d)  [BB]  (-  (p  V  (-9)))  A  ((-v>)  vr) 

(e)  (p  ->■  (q  ->  ;•))  ((/?  A  q)  V  r) 

2.  (a)  If  p  —»■  q  is  false,  determine  the  truth  value  of 

(p  A  (—■<?))  v  ((-■/>)  -»■  <?). 

(b)  Is  it  possible  to  answer  2(a)  if  p  —*■  q  is  true  instead 
of  false?  Why  or  why  not? 

3.  [BB]  Determine  the  truth  value  for 

Ip  —*  (q  A  (--/•))]  v  [/•  <->■  ((-■«)  v  q)} 
when  p,  q,  r  and  .v  are  all  true. 

4.  Repeat  Exercise  3  in  the  case  where  p.  q,  r,  and  s  are 
all  false. 

5.  (a)  Show  that  q  (p  q)  is  a  tautology. 

(b)  Show  that  [pAq]  a[(->/?)v  (-■</)]  is  a  contradiction. 

6.  (a)  [BB]  Show  that  [{p  —>  q)  A  (q  —*■  r)]  ->  (p  —*■  r) 

is  a  tautology. 

(b)  [BB]  Explain  in  plain  English  why  the  answer  to 
6(a)  makes  sense. 


7.  Show  that  the  statement 

[p  v  ((— -r)  -►  (-■$))]  v  [(s  -»  ((-T)  v  p)) 

V  ((- q )  -*  r)] 

is  neither  a  tautology  nor  a  contradiction. 

8.  Given  that  the  compound  statement  A  is  a  contradiction, 
establish  each  of  the  following. 

(a)  [BB]  If  3  is  any  statement,  A  -*■  3  is  a  tautology. 

(b)  If  3  is  a  tautology,  3  ->  A  is  a  contradiction. 

9.  (a)  Show  that  the  statement  p  -*■  [q  -*■  r)  is  not  logi¬ 

cally  equivalent  to  the  statement  (p  —>■  q)  —>  r. 

(b)  What  can  you  conclude  from  9(a)  about  the 
compound  statement  [p  ->  (q  — ►  r )]  <->  [(p  —* 
q)  -*•  r]? 

10.  If  p  and  q  are  statements,  then  the  compound  statement 
p  v  q  (often  called  the  exclusive  or)  is  defined  to  be 
true  if  and  only  if  exactly  one  of  p,  q  is  true;  that  is, 
either  p  is  true  or  q  is  true,  but  not  both  p  and  q  are 
true. 

(a)  [BB]  Construct  a  truth  table  for  p  v_q. 

(b)  Construct  a  truth  table  for  (p  v  ((— -p)  A  q))  V  q. 

(c)  [BB]  Show  that  (p  V  q)  ->  (p  V  q)  is  a  tautology. 

(d)  Show  that  p  v  q  is  logically  equivalent  to 

-'{p  q)- 


1.4  The  Algebra  of  Propositions 

In  Section  1.3,  we  discussed  the  notion  of  logical  equivalence  and  noted  that  two 
statements  A  and  23  are  logically  equivalent  precisely  when  the  statement  A  ■ o-  23 
is  a  tautology. 

More  informally,  when  statements  A  and  23  are  logically  equivalent,  we  often 
think  of  statement  23  as  just  a  rewording  of  statement  A.  Clearly  then,  it  is  of 
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interest  to  be  able  to  determine  in  an  efficient  manner  when  two  statements  are 
logically  equivalent  and  when  they  are  not.  Truth  tables  will  do  this  job  for  us, 
but,  as  you  may  already  have  noticed,  they  can  become  cumbersome  rather  easily. 
Another  approach  is  first  to  gather  together  some  of  the  fundamental  examples  of 
logically  equivalent  statements  and  then  to  analyze  more  complicated  situations 
by  showing  how  they  reduce  to  these  basic  examples.  Henceforth,  we  shall  write 
A  <==>  '23  to  denote  the  fact  that  A  and  23  are  logically  equivalent. 

The  word  “proposition”  is  a  synonym  for  “(mathematical)  statement.”  Just 
as  there  are  rules  for  addition  and  multiplication  of  real  numbers — commutativity 
and  associativity,  for  instance — there  are  properties  of  A  and  V  which  are  helpful 
in  recognizing  that  a  given  compound  statement  is  logically  equivalent  to  another, 
often  more  simple,  one. 

Some  Basic  Logical  Equivalences 

1.  Idempotence: 

(i)  (p  v  p )  p 

(ii)  (p  A  p)  p 

2.  Commutativity: 

(i)  (p  V  q)  ( q  V  p) 

(ii)  (p  A  q)  (q  a  p) 

3.  Associativity: 

(i)  ((pv<7)vr)  <=►  (pv(qvr)) 

(ii)  ((pAq)Ar)  <=>  {p  A  (q  A  r)) 

4.  Distributivity: 

(i)  (pv(^Ar))  <=>  ((p  v  q)  A  (p  v  r)) 

(ii)  (pA(gVr))  <=>■  {(p  A  q)  V  (p  A  r)) 

5.  Double  Negation:  ->  (->/?)  «=>  p 

6.  De  Morgan’s  Laws: 

(i)  -•  (/?  v  q)  ((-> p )  A  (—><?)) 

(ii)  ->  (p  A  q)  <=»  ((->/?)  v  (->q) 

Property  6  was  discussed  in  a  less  formal  manner  in  Section  1.1. 

It  is  clear  that  any  two  tautologies  are  logically  equivalent  and  that  any  two 
contradictions  are  logically  equivalent.  Letting  1  denote  a  tautology  and  0  a  con¬ 
tradiction,  we  can  add  the  following  properties  to  our  list. 

7.  (i)  (pvl)  <=►  1 

(ii)  (p  A  1)  p 

8.  (i)  (pvO)  <=>  p 

(ii)  (pAO)  «=>  0 

9.  (i)  (p  v  (  'p) )  1 

(ii)  (p  A  (-■/?))  0 
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10.  (i)  -1  0 

(ii)  -.0  «=»  1 

We  can  also  add  three  more  properties  to  our  list. 

11.  (p^q)  <=>  [(- q )  -*  (-/>)] 

12.  ( p  q)  <=>  [(p  q)  A  (q  -a  p)} 

13.  (p  q)  <==$  [ (~'P)Vq] 

Property  11  simply  restates  the  fact,  proven  in  Theorem  1.2.1,  that  a  statement 
and  its  contrapositive  are  logically  equivalent.  The  definition  of  “4>”  gives  Prop¬ 
erty  12  immediately.  Property  13  shows  that  any  implication  is  logically  equiva¬ 
lent  to  a  statement  which  does  not  use  the  symbol  — 


Given  the  statement  p  —*■  q,  show  that  its  converse,  q  —*■  p,  and  its  inverse, 
t( — ■/?)  -*■  (_,g)],  are  logically  equivalent. 


Show  that  [-*(p  q )]  [(p  A  (-’<?))  V  (q  A  (—■/»))]. 


PROBLEM  14.  Show  that  (->/?)  -*■  (p  -»  q)  is  a  tautology. 
Solution.  Using  Property  13,  we  have 


[(—■/?)  (P 


<?)]  <*=>  [(—■/?)  ->  ((—■/?)  v  <?)] 

<=>  [H-p))v((-p)v?)] 
4=*  p  V  [(-ip)  v  q] 

«=>■  [fVU/))]Vg  <=>•  lv<7 


1  I 


In  the  exercises,  we  ask  you  to  verify  all  the  properties  of  logical  equivalence 
which  we  have  stated.  Some  are  absurdly  simple.  For  example,  to  see  that  (p  v 
p)  •<=>  p,  we  need  only  observe  that  p  and  p  V  p  have  the  same  truth  tables: 


p 

pv  p 

T 

T 

F 

F 

Others  require  more  work.  To  verify  the  second  distributive  property,  for  example, 
we  would  construct  two  truth  tables. 


P 

q 

r 

q  V  r 

P  A  (q  V  r) 

T 

T 

T 

T 

T 

T 

T 

F 

T 

T 

T 

F 

T 

T 

T 

T 

F 

F 

F 

F 

F 

T 

T 

T 

F 

F 

T 

F 

T 

F 

F 

F 

T 

T 

F 

F 

F 

F 

F 

F 
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p 

q 

r 

pAq 

p  A  r 

(P  A  q)  v  (p  A  r ) 

T 

T 

T 

T 

T 

T 

T 

T 

F 

T 

F 

T 

T 

F 

T 

F 

T 

T 

T 

F 

F 

F 

F 

F 

F 

T 

T 

F 

F 

F 

F 

T 

F 

F 

F 

F 

F 

F 

T 

F 

F 

F 

F 

F 

F 

F 

F 

F 

PROBLEM  15.  Express  the  statement  (~<(p  V  q))  v  (( ->p )  A  q)  in  simplest 
possible  form. 

Solution.  De  Morgan’s  first  law  says  that  —>  (pv  q)  5=4  (->p)  A  (~iq).  Thus 

(*)  [(~ 1  (p  V  q))  V  ((—■/>)  A  q)]  5=4  [((—■/>)  A  (—■<7))  V  ((~>p)  A  <?)], 

since  adjoining  the  same  term  to  statements  which  have  the  same  truth  tables  will 
produce  statements  with  the  same  truth  values.  Using  distributivity,  the  right  side 
of  (*)  is 

[((—■/>)  a  (—<?))  V  ((-'P)  A  q)]  5=4>  I(-'P)  A  ((^q)  V  q)] 

4=4>  [(_,P)  A  1]  5=4>  ->p, 

so  the  given  statement  is  logically  equivalent  simply  to  -•p.  I 


In  the  solution  to  Problem  15,  we  said  “adjoining  the  same  term  to  statements 
which  have  the  same  truth  tables  will  produce  statements  with  the  same  truth 
values.”  Make  this  claim  into  a  formal  assertion  and  verify  it. 

In  Problems  14  and  15  we  used,  in  a  sneaky  way,  a  very  important  principle 
of  logic,  which  we  now  state  as  a  theorem.  The  fact  that  we  didn't  really  think 
about  this  at  the  time  tells  us  that  the  theorem  is  easily  understandable  and  quite 
painless  to  apply  in  practice. 


1.4.1  THEOREM 


Suppose  A  and  3  are  logically  equivalent  statements  involving  variables 

pi ,  p2 . pn ■  Suppose  that  61,62,...  ,  6„  are  statements.  If,  in  A  and  3,  we 

replace  p\  by  6| ,  pi  by  62  and  so  on  until  we  replace  pn  by  6„,  then  the  resulting 
statements  will  still  be  logically  equivalent. 


Explain  how  Theorem  1.4.1  was  used  in  Problem  15. 


PROBLEM  16.  Show  that 

[(p  V  q)  V  ((q  v  (— >r ))  A  (p  V  r))J  5=4  -■  [(—■/?)  A  (-■<?)]. 

Solution.  The  left  hand  side  is  logically  equivalent  to 

[(p  V  q)v  (q  V  (— -r))]  A  [(p  V  q)  V  (p  V  r)] 
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by  distributivity.  Associativity  and  idempotence,  however,  say  that 
[( P  V  <?)  V  (qv  (--r))]  «=>  [/>v(gv(?v  (-v)))] 

<=>•  [ P  V  ((q  Vq)V  (-'/-))] 

[pv(?v  (t))]  «=>  [( p  vq)w  (->r)], 

while  we  also  have 

lip  V  4)  v  (p  V  r)]  \(q  V  p)  v  (p  V  r)] 

[<?  v  (p  v  (p  v  r ))] 

[?v((pvp)vr)] 

[^v(pv  r )] 

«=>  [(?vp)vr]  [(pv^)vr]. 

Hence  our  expression  is  logically  equivalent  to 

[((p  V  q)  V  (-.r))  A  ((p  V  q)  V  r)]  [(p  V  q)  v  ((->r)  A  r)] 

<t=»  [(pv?)v0]  <=>  p  v  q . 

Finally,  with  the  help  of  double  negation  and  one  of  the  laws  of  De  Morgan,  we 
obtain 

pvq  «==>•  — '  1  (p  V  q))  4=»  1  [ (  1  p )  A  (-’<?)].  1 

The  next  problem  illustrates  clearly  why  employing  the  basic  logical  equiva¬ 
lences  discussed  in  this  section  is  often  more  efficient  than  working  simply  with 
truth  tables. 

PROBLEM  17.  Show  that  [x  -*  (((->p)A^) Ar)]  <=>■  -,[(pv(-’  (<?  Ar)))As]. 

Solution. 

M(((-,p)A?)Ar)]  <=>  [(->5)  V  (((— >p)  A  q)  A  r)] 

«=>■  [(— ’-y)  V  ((— *p)  A  (q  A  r))] 

[(—•■*’)  V  (— '(P  V  (-'(q  A  r))))] 

— '[(^  A  (p  V  (— >  {q  A  r))))] 
-[((pV(-.((jiAr)))As)].  I 

A  primary  application  of  the  work  in  this  section  is  reducing  statements  to 
logically  equivalent  simpler  forms.  There  are  times,  however,  when  a  different 
type  of  logically  equivalent  statement  is  required. 
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1.4.2  DEFINITION 


Let  n  >  1  be  an  integer.  A  compound  statement  based  on  variables  x\,x2, . . .  ,  xn 
is  said  to  be  in  disjunctive  normal  form  if  it  looks  like 


(an  Aa)2  a  •  •  •  Aain)  v  (a2i  A  a22  A  •  •  •  A  a2n)  V  •  •  •  v  (ami  A  am2  A  •  •  •  A  amn) 


where,  for  each  i  and  j,l<i<m,l<j<n,  either  an  —  xj  or  atj  =  — >jc j  and 
all  minterms  a ,•  [  A  al2  A  •  •  •  A  are  distinct. 


EXAMPLE  18  The  statement  (xi  A  jc2  A  JC3)  v  (jci  A  (-oc2)  A  (-VC3))  is  in  disjunctive  normal  form 
(on  the  variables  x\,  x2,  x2).  a 

EXAMPLE  19  The  statement  ((p  Aq)  Vr)  A  ({p  Aq)  v  (->«?))  is  not  in  disjunctive  normal  form. 

One  reason  is  that  the  minterms,  (p  A  q)  v  /•  and  {p  A  q)  v  (--9),  involve  the 
symbol  V.  This  statement  is  logically  equivalent  to  (p  A  q)  V  {r  A  (-•q)),  which 
is  still  not  in  disjunctive  normal  form  because  the  minterms,  p  Aq  and  r  A  (^q), 
don’t  contain  all  the  variables.  Continuing,  however,  our  statement  is  logically 
equivalent  to 

(p  A  q  A  r)  v  (p  a  q  A  (— >r))  V  (p  A  r  A  (->(?))  V  ((—•/?)  A  r  A  ( — ■<?)), 
which  is  in  disjunctive  normal  form  (on  the  variables  p,  q,  r).  A 

As  shown  in  Example  19,  when  writing  a  statement  in  disjunctive  normal 
form,  it  is  very  useful  to  note  that 

(1)  .v  <=>  [(.v  A  y)  V  (.v  A  ( — ■  y))] 

for  any  statements  .r  and  y.  This  follows  from 

X  {x  A  1)  [x  A  (y  V  (— ’ v))]  [(*  A  y)  V  (x  A  (—>>?))]. 

PROBLEM  20.  Express  p  -»  (q  A  r)  in  disjunctive  normal  form. 

Solution.  Method  1:  We  construct  a  truth  table. 


p 

<7 

r 

q  A  r 

\  p  ^  (q  A  r) 

T 

T 

T 

T 

T 

T 

T 

F 

F 

F 

T 

F 

T 

F 

F 

F 

T 

T 

T 

T 

T 

F 

F 

F 

F 

F 

T 

F 

F 

T 

F 

F 

T 

F 

T 

F 

F 

F 

F 

T 

Now  focus  attention  on  the  rows  for  which  the  statement  is  true — each  of 
these  will  contribute  a  minterm  to  our  answer.  For  example,  in  row  1,  P,  q,  and 
r  are  all  T,  so  p  A  q  A  r  agrees  with  the  T  in  the  last  column.  In  row  4,  p  is  F, 
while  q  and  r  are  both  7  .  This  gives  the  minterm  (—>/?)  A  q  A  r.  In  this  way,  we 
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obtain 

(P  A  q  A  r)  V  ((—■/?)  A  q  A  r)  V  ((->p)  A  q  A  (--r)) 

V  ((-'/?)  A  (-<7)  A  r)  v  ((—*/?)  A  (-^)  A  (->r )). 


Method  2:  We  have 
[p  — ^  A  r)] 

[(—■/»)  v  (?Ar)] 

<=>  [((—■/?)  A  q)y  ((-•p)  A  (-■<?))  V  (q  A  r)] 

<==>  [((— '/?)  A  q  A  r)  V  ((—<p)  A  q  A  (— T ))  V  ((-'p)A(^)Ar) 

V  ((—■/?)  A  (-’<?)  A  (-t))  V  (p  A  q  A  r)  V  ((->p)  A  A  r)] 

<=f[((-,/’)A?Ar)  V  ((— ’p)  A  q  A  (— >r ))  V  ((— ’p)  A  (—•q)  A  r) 

V  ((->p)  A  (-■<?)  A  (->r))  V  (p  A  q  A  r)], 

omitting  the  second  occurrence  of  (—>p)  A  q  A  r  at  the  last  step.  We  leave  it  to 
you  to  decide  for  yourself  which  method  you  prefer.  I 


Answers  to  Pauses 


Disjunctive  normal  form  is  useful  in  applications  of  logic  to  computer  science, 
particularly  in  the  construction  of  designs  for  logic  circuits. 

14.  This  is  just  a  restatement  of  Property  11,  writing  p  instead  of  q  and  q  instead 
of  p. 

15.  This  could  be  done  with  truth  tables.  Alternatively,  we  note  that 


-•  ( p  q) 


[— >((P  ->  q)  A  {q  p))\ 

P)  V?)  A  ((-<?)  V  p)] 

“•  [((—■/?)  V  q)  V  (—■((—■<?)  V  p)>] 
I(p  A  (—■<3'))  v  (q  A  (— -p))]. 


16.  The  claim  is  that  if  A  4==>  23,  then  ( A  V  C)  •<==>■  (23  V  C)  for  any 

other  statement  6.  Since  A  23,  A  and  23  have  the  same  truth  tables. 

Establishing  (A  v  6)  «=>  (23  v  C)  requires  four  rows  of  a  truth  table. 


A 

23 

e 

.a  v  e 

b  v  e 

T 

T 

r 

T 

T 

T 

T 

F 

T 

T 

F 

F 

T 

T 

T 

F 

F 

F 

F 

The  last  two  columns  establish  our  claim.  A  similar  argument  shows  that  if 
A  23,  then  (A  a  23)  <t==>  (23  A  C)  for  any  statement  6. 

17.  In  applying  the  distributive  property,  we  are  using  ->p,  — ■ q  and  q  in  place  of 
p,  r/  and  r.  Also,  when  applying  Property  7,  we  use  —‘p  instead  of  p. 
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EXERCISES 


The  symbol  [BB]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 


1.  Verify  each  of  the  13  properties  of  logical  equiva-  5. 
lence  which  appear  in  this  section  [BB;  1,3, 5,7, 9, 

11,13], 

2.  Simplify  each  of  the  following  statements. 

(a)  [BB]  (pAq)V  (-((-p)  v  q)) 

(b)  (p  V  r)  — »•  [(q  V  (— -r))  ((-p)  -►  r)] 

(c)  [( p  -+  q)V  {q  ^  r)]  A  (r  s) 

3.  Using  truth  tables,  verify  the  following  absorption  prop¬ 
erties. 

(a)  [BB]  (pV(pAq))  <=>  p 

(b)  (pA(pVq))  <=>  p 

4.  Using  the  properties  in  the  text  together  with  the  absorp¬ 
tion  properties  given  in  Exercise  3,  establish  each  of  the 
following  logical  equivalences. 

(a)  [BB]  [(pV^)A(-p)]  <=>  [(-p)Ap] 

(b)  [p  -+  (q  r)]  [(p  A  (->r))  (-> q )] 

(c)  [~>(p  -w  q)]  «=»  [p  -w  (->?)] 

(d)  [BB]  -[(p  **  q)V(p  A  (-<?))]  <=►  [(p 

(—•«))  A  ((-■/»)  V  q)] 

(e)  [(p  A  (-<q))  A  ((p  A  (-'<?))  V  {q  A  (~>r )))] 

[p  A  (-<?)] 


Prove  that  the  statements  (p  A  (~^q))  — *  q  and  (p  A 
(-•q))  -a-  -<p  are  logically  equivalent.  What  simpler 
statement  is  logically  equivalent  to  both  of  them? 

In  Exercise  10  of  Section  1.3  we  defined  the  exclusive 
or  “p  v  q”  to  be  true  whenever  either  p  or  q  is  true, 
but  not  both.  For  each  of  the  properties  discussed  in 
this  section  (including  those  of  absorption  given  in 
Exercise  3)  determine  whether  or  not  the  property  holds 
with  v  replacing  v  wherever  it  occurs  [BB;  1,3,7,9,13]. 

7.  Which  of  the  following  are  in  disjunctive  normal  form 
(on  the  appropriate  set  of  variables)? 

(a)  (p  V  q)  A  ((— p)  v  (->?)) 

(b)  [BB]  (p  a  q)  v  ((— -p)  A  (-<?)) 

(c)  [BB]  p  v  ((->p)  A  q) 

(d)  (p  A  q)  V  ((— -p)  A  (rq)  A  r) 

(e)  (p  A  q  A  r)  V  ((— >p)  A  (-q)  A  (->/•)) 

8.  Express  each  of  the  following  statements  in  disjunctive 
normal  form. 

(a)  [BB]  p  A  q 

(b)  [BB]  (p  A  q)  V  (-((-p)  V 

(c)  p  ->  q 

(d)  (p  q)  A  (q  A  r ) 

(£3  9.  Find  out  what  you  can  about  Augustus  De  Morgan  and 
write  a  paragraph  or  two  about  him,  in  good  English,  of 
course! 


1.5  Logical  Arguments 

Proving  a  theorem  in  mathematics  involves  drawing  a  conclusion  from  some  given 
information.  The  steps  required  in  the  proof  generally  consist  of  showing  that  if 
certain  statements  are  true,  then  the  truth  of  other  statements  must  follow.  Taken 
in  its  entirety,  the  proof  of  a  theorem  demonstrates  that  if  an  initial  collection  of 
statements — called  premises  or  hypotheses — are  all  true,  then  the  conclusion  of 
the  theorem  is  also  true. 

Different  methods  of  proof  were  discussed  informally  in  Section  1.2.  Now 
we  relate  these  ideas  to  some  of  the  more  formal  concepts  introduced  in  Sections 
1.3  and  1.4.  First,  we  define  what  is  meant  by  a  valid  argument. 


1.5.1  DEFINITIONS 


An  argument  is  a  finite  collection  of  statements  A\,  A2,  ■  ■ .  ,An  called  premises 
(or  hypotheses )  followed  by  a  statement  23  called  the  conclusion.  Such  an  argument 
is  valid  if,  whenever  A\,  A2,  ...  ,  An  are  all  true,  then  23  is  also  true. 


1.5  Logical  Arguments  29 
It  is  often  convenient  to  write  an  elementary  argument  in  column  form,  like 

this. 

Ai 
■A  2 


PROBLEM  21.  Show  that  the  argument 

p  ->  -'q 

r  ->  q 

r 

-'P 

is  valid. 

Solution.  We  construct  a  truth  table. 


p 

<7 

r 

-’<? 

<3< 

r 

t 

a. 

r  q 

^P 

T 

T 

T 

F 

F 

T 

F 

T 

T 

F 

F 

F 

T 

F 

T 

F 

T 

T 

T 

F 

F 

F 

T 

T 

F 

T 

T 

T 

T 

F 

F 

T 

T 

T 

F 

F 

T 

F 

F 

T 

T 

T 

F 

F 

T 

T 

T 

F 

T 

F 

F 

F 

T 

T 

T 

T 

Observe  that  row  4 — marked  with  the  star  (★} — is  the  only  row  where  the  premises 
p  ->q,  r  ->  q,  r  are  all  marked  T.  In  this  row,  the  conclusion  ->p  is  also  T . 
Thus  the  argument  is  valid.  I 

In  Problem  21,  we  were  a  bit  fortunate  because  there  was  only  one  row  where 
all  the  premises  were  marked  T.  In  general,  in  order  to  assert  that  an  argument  is 
valid  when  there  are  several  rows  with  all  premises  marked  T,  it  is  necessary  to 
check  that  the  conclusion  is  also  T  in  every  such  row. 

Arguments  can  be  shown  to  be  valid  without  the  construction  of  a  truth  table. 
For  example,  here  is  an  alternative  way  to  solve  Problem  21. 

Assume  that  all  premises  are  true.  In  particular,  this  means  that  r  is 
true.  Since  r  ->  q  is  also  true,  q  must  also  be  true.  Thus  ~<q  is  false 
and,  because  p  — >  {—iq)  is  true,  p  is  false.  Thus  — -p  is  true  as  desired. 

PROBLEM  22.  Determine  whether  or  not  the  following  argument  is  valid. 

If  I  like  biology,  then  I  will  study  it. 

Either  I  study  biology  or  I  fail  the  course. 

If  I  fail  the  course,  then  I  do  not  like  biology. 
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Yes,  There  Are  Proofs! 


1.5.2  THEOREM 


1.5.3  THEOREM 


Solution.  Let  p  be  “I  like  biology,”  q  be  “I  study  biology,”  and  r  be  “I  fail  the 
course.”  In  symbols,  the  argument  we  are  to  check  becomes 

P  <7 
q  V  r 

r  -►  (rp). 

This  can  be  analyzed  by  a  truth  table. 


★ 

★ 

★ 

★ 

★ 


The  rows  marked  *  are  those  in  which  the  premises  are  true.  In  row  1,  the  premises 
are  true,  but  the  conclusion  is  F.  The  argument  is  not  valid.  I 

The  theorem  which  follows  relates  the  idea  of  a  valid  argument  to  the  notions 
introduced  in  Sections  1 .3  and  1 .4. 

An  argument  with  premises  A\ ,  A2,  ■  ■  ■  ,  An  and  conclusion  53  is  valid  precisely 
when  the  compound  statement  A\  A  A2  A  •  •  ■  A  An  -»•  53  is  a  tautology. 

Surely,  this  is  not  hard  to  understand.  In  order  for  the  implication  A\  A  A2  A 
•  ■  -  A An  —>■  51  to  be  a  tautology,  it  must  be  the  case  that  whenever  A  \  hAi/x  -  ■  -  A An 
is  true,  then  53  is  also  true.  But  A\  A  Ai  A  ■  ■  ■  A  An  is  true  precisely  when  each 
of  A\,A2,..-  , An  is  true,  so  the  result  follows  from  our  definition  of  a  valid 
argument. 

In  the  same  spirit  as  Theorem  1.4.1,  we  have  the  following  important  substi¬ 
tution  theorem. 

[Substitution]  Assume  that  an  argument  with  premises  A\,  A2,  . . .  ,  An  and  con¬ 
clusion  53  is  valid,  and  that  all  these  statements  involve  variables  p\,  P2,  •  ■  ■  ,  Pm- 
If  P\,  P2,  ■■■  >  Pm  are  replaced  by  statements  61,62,...  ,  6m,  the  resulting  argu¬ 
ment  is  still  valid. 


p 

<7 

r 

P  <7 

qVr 

“■ P 

r  (-p) 

T 

T 

T 

T 

T 

F 

F 

T 

T 

F 

T 

T 

F 

T 

T 

F 

T 

F 

T 

F 

F 

F 

T 

T 

T 

T 

T 

T 

T 

F 

F 

F 

F 

F 

T 

F 

T 

F 

T 

T 

T 

T 

F 

F 

T 

T 

T 

T 

T 

F 

F 

F 

T 

F 

T 

T 

Rules  of  inference 

Because  of  Theorem  1.5.3,  some  very  simple  valid  arguments  which  regularly 
arise  in  practice  are  given  special  names.  Here  is  a  list  of  some  of  the  most 
common  rules  of  inference. 
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1.  Modus  ponens:  p 

p  ->  q 
q 

2.  Modus  tollens:  p  —>■  q 

^q 

“■ P 

3.  Disjunctive  syllogism:  /?v^ 

-1 P 


4.  Chain  rule:  p  — >  q 


5.  Resolution:  p  V  r 

q  v  (-t) 
pv  q 

|  Verify  modus  tollens. 

We  illustrate  how  the  rules  of  inference  can  be  applied. 

PROBLEM  23.  Show  that  the  following  argument  is  valid. 

(P  v  q)  — >  (5  A  t) 

[-■  ((— ‘-y)  v  (-0)1  -»  [(-t)  v  q] 

(/)V?)->(r->  q) 

Solution.  One  of  the  laws  of  De  Morgan  and  the  principle  of  double  negation — 
see  Section  1.4 — tell  us  that 


[-*  ((“•*)  v  (-f))]  <=>  [(-’  (-«))  A  (->(-r))]  (s  A  t). 

Property  13  of  logical  equivalence  as  given  in  Section  1.4  says  that  (->r)  Vq  <=> 
(r  — >  q).  Thus  the  given  argument  can  be  rewritten  as 

(pVq)  ^  (s  A  t) 

(5  A  t)  -»  (r  — »  g) 

(?v?)^(r->  ^). 

The  chain  rule  now  tells  us  that  our  argument  is  valid.  I 


If  a  truth  table  were  used  to  answer  Problem  23,  how  many  rows  would  be 
required? 


Sometimes,  rules  of  inference  need  to  be  combined. 
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PROBLEM  24.  Determine  the  validity  of  the  following  argument. 

If  I  study,  then  I  will  pass. 

If  I  do  not  go  to  a  movie,  then  I  will  study. 

I  failed. 

Therefore,  I  went  to  a  movie. 

Solution.  Let  p,  q  and  r  be  the  statements 

p :  “I  study.” 

q :  “I  pass.” 

r :  “I  go  to  a  movie.” 

The  given  argument  is 

P  <7 

(-'>■)  -»  p 

-'q 

r 

The  first  two  premises  imply  the  truth  of  (~v)  ->  q  by  the  chain  rule.  Since 
(t)  -*■  q  and  ->q  imply  ->  (~>r)  by  modus  tollens,  the  validity  of  the  argument 
follows  by  the  principle  of  double  negation:  -■  (~>r)  <=>  r.  I 


Answers  to  Pauses 


18.  While  this  can  be  shown  with  a  truth  table,  we  prefer  an  argument  by  words. 
Since  -<q  is  true,  q  is  false.  Since  p  — >  q  is  true,  p  must  also  be  false.  Hence 
-<p  is  true  and  we  are  done. 

19.  There  are  five  variables,  each  of  which  could  be  T  or  F,  so  we  would  need 
2s  =  32  rows. 


EXERCISES 


The  symbol  [ BB ]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 

You  are  encouraged  to  use  the  result  of  any  exercise  in  this  set  to  assist  with  the  solution  of  any  other. 


1.  Determine  whether  or  not  each  of  the  following  argu¬ 
ments  is  valid. 

(a)  [BB]  p  ->  (q  ->■  r) 

q 

p  — >  r 

(b)  [BB]  p  q 

q  V  r 
r  ->  (-•q) 

(c)  p  ->  q 

r  zL  1 

r  —r  p 


(d)  p  q 

(q  v  (—■/•))  -»  (p  A  s) 
s  — (r  V  q) 

2.  Verify  that  each  of  the  five  rules  of  inference  given  in 
this  section  is  a  valid  argument. 

3.  Verify  that  each  of  the  following  arguments  is  valid. 

(a)  [BB]  p  ->  r 
q  — >  r 
(pV  q)  r 
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(b)  p  ->  r 
q  s 

(p  Aq)-+  (r  As) 

(c)  pvq 
(~<p)  V  r 
(-r)  v  5 

q  V  s 

(d)  p  v  ((—><7)  A  r) 

^(p  as) 

~ '  G  A  (q  V  (-T ))) 

4.  Test  the  validity  of  each  of  the  following  arguments. 

(a)  [BB]  p^q 

(— r)  v  (-9) 

r 

~^P 

(b)  [BB]  p  V  (-<?) 

(tvs)-*  (pv  r) 

(~r)  V  (t  v  s) 
p  +*  (tv  s) 

( pvr )-+  (qvr) 

(c)  p  v  (  ‘<7) 

(tvs)-*  (pVr) 

(— -r)  v  ((  v  j) 

P  **  (t  V  S) 

(qv  r)  -*■  (pv  r) 

(d)  [{p  Aq)Vr]—*(qAr  A  s) 

[(— ’P)  A  (“■? )]  ->(tV  p) 

[p  V  (->q)  Vr]->  (q  As) 

(P  A  <?)  4*  [(<?  A  r)  V  .?] 

5.  Determine  the  validity  of  each  of  the  following  argu¬ 
ments.  If  the  argument  is  one  of  those  listed  in  the  text, 
name  it. 

(a)  [BB]  If  I  stay  up  late  at  night,  then  I  will  be 

tired  in  the  morning. 

I  stayed  up  late  last  night. 

I  am  tired  this  morning. 

(b)  [BB]  If  I  stay  up  late  at  night,  then  I  will  be 

tired  in  the  morning. 

I  am  tired  this  morning. 

I  stayed  up  late  last  night. 

(c)  If  I  stay  up  late  at  night,  then  I  will  be 
tired  in  the  morning. 

I  am  not  tired  this  morning. 

I  did  not  stay  up  late  last  night. 


(d)  If  I  stay  up  late  at  night,  then  I  will  be 
tired  in  the  morning. 

I  did  not  stay  up  late  last  night. 

I  am  not  tired  this  morning. 

(e)  [BB]  Either  I  wear  a  red  tie  or  I  wear  blue  socks. 

I  am  wearing  pink  socks. 

I  am  wearing  a  red  tie. 

(f)  Either  I  wear  a  red  tie  or  I  wear  blue  socks. 

I  am  wearing  blue  socks. 

I  am  not  wearing  a  red  tie. 

(g)  [BB]  If  I  work  hard,  then  I  earn  lots  of  money. 

If  I  earn  lots  of  money,  then  I  pay  high 
taxes. 

If  I  pay  high  taxes,  then  I  have  worked 
hard. 

(h)  If  I  work  hard,  then  I  earn  lots  of  money. 

If  I  earn  lots  of  money,  then  I  pay  high 
taxes. 

If  I  work  hard,  then  I  pay  high  taxes. 

(i)  If  I  work  hard,  then  I  earn  lots  of  money. 

If  I  earn  lots  of  money,  then  I  pay  high 
taxes. 

If  I  do  not  work  hard,  then  I  do  not  pay 
high  taxes. 

(j)  If  I  like  mathematics,  then  I  will  study. 

I  will  not  study. 

Either  I  like  mathematics  or  I  like  football. 

I  like  football. 

(k)  Either  1  study  or  I  like  football. 

If  I  like  football,  then  I  like  mathematics. 

If  I  don’t  study,  then  I  like  mathematics. 

(l)  [BB]  If  I  like  mathematics,  then  I  will  study. 

Either  I  don't  study  or  I  pass  mathematics. 

If  I  don’t  graduate,  then  I  didn't  pass 
mathematics. 

If  I  graduate,  then  I  studied. 

(m)  If  I  like  mathematics,  then  I  will  study. 

Either  I  don’t  study  or  I  pass  mathematics. 

If  I  don’t  graduate,  then  I  didn’t  pass 
mathematics. 

If  I  like  mathematics,  then  I  will  graduate. 

6.  [BB]  Given  the  premises  p  -*  (~>r)  and  r  v  q,  either 
write  down  a  valid  conclusion  which  involves  p  and 
q  only  and  is  not  a  tautology  or  show  that  no  such 
conclusion  is  possible. 

7.  [BB]  Repeat  Exercise  6  with  the  premises  (~<p)  -*■  r 
and  rVq. 
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8.  (a)  [BB]  Explain  why  two  premises  p  and  q  can  always 

be  replaced  by  the  single  premise  p  A  q  and  vice 
versa. 

(b)  Using  8(a),  verify  that  the  following  argument  is 
valid. 

p  Aq 
p  — >  r 

(->s)  A  r 

9.  Let  n  be  an  integer  greater  than  1.  Show  that  the  fol¬ 
lowing  argument  is  valid. 


P  i  -*■  (q  t  n) 
P2  ->  (qi  ->  n) 


Pn  ^  0?rt  *  A; ) 

_ q\  Aq2A---  Aq„ _ 

(pi  ->  ri)  A  (p2  ->  n)  A  ■  •  •  A  (p„  r„) 

[Hint:  Exercise  8(a).] 

10.  [BB]  What  language  is  being  used  when  we  say  "modus 
ponens”  or  “modus  tollens”?  Translate  these  expressions 
into  English  and  explain. 


REVIEW  EXERCISES  FOR  CHAPTER  1 


1.  State,  with  a  reason,  whether  each  of  the  following  state¬ 
ments  is  true  or  false. 

(a)  If  a  and  b  are  integers  with  a  —  b>  0  and  b—  a  >  0, 
then  a  b. 

(b)  If  a  and  b  are  integers  with  a  —  b>  0  and  b  —  a>  0, 
then  a  #  b. 

2.  Write  down  the  negation  of  each  of  the  following  state¬ 
ments  in  clear  and  concise  English. 

(a)  Either  x  is  not  a  real  number  or  x  >  5. 

(b)  There  exists  a  real  number  a:  such  that  n  >  x  for 
every  integer  n. 

(c)  If  x,  y,  z  are  positive  integers,  then  a;3  -4-  y3  ^  z3. 

(d)  If  a  graph  has  n  vertices  and  n  +  1  edges,  then  its 
chromatic  number  is  at  most  3. 

3.  Write  down  the  converse  and  contrapositive  of  each  of 
the  following  implications. 

(a)  If  a  and  b  are  integers,  then  ab  is  an  integer. 

(b)  If  ar  is  an  even  integer,  then  ,v2  is  an  even  integer. 

(c)  Every  planar  graph  can  be  colored  with  at  most  four 
colors. 

4.  Rewrite  each  of  the  following  statements  using  the  quan¬ 
tifiers  "for  all”  and  “there  exists"  as  appropriate. 

(a)  Not  all  countable  sets  are  finite. 

(b)  1  is  the  smallest  positive  integer. 


5.  (a)  Determine  whether  or  not  the  following  implication 

is  true. 

“x  is  a  positive  odd  integer 

->  x  +  2  is  a  positive  odd  integer.” 

(b)  Repeat  (a)  with  -*■  replaced  by  -o-. 

6.  Let  n  be  an  integer.  Prove  that  n3  is  odd  if  and  only  if 
n  is  odd. 

7.  With  a  proof  by  contradiction,  show  that  there  exists  no 
largest  negative  rational  number. 

8.  Construct  a  truth  table  for  the  compound  statement  [p  A 
(q  (“'f))]  -*■  IH)Vr]. 

9.  Determine  the  truth  value  of  [p  V  (q  — >  ((~r)  A.?))] 

(r  A  t),  where  p ,  q,  r,  .v  and  t  are  all  true. 

10.  Two  compound  statements  A  and  23  have  the  property 
that  A  ->  23  is  logically  equivalent  to  23  ->  A.  What 
can  you  conclude  about  A  and  23? 

11.  (a)  Suppose  A,  23,  and  6  are  compound  statements 

such  that  A  <=>  23  and  23  <==>  C.  Explain 

why  A  «=>  C. 

(b)  Give  a  proof  of  Property  1 1  which  uses  the  result 
of  Property  13. 

12.  Establish  the  logical  equivalence  [(p  — >  q)  — >  r ]  <=> 
[(p  v  r)  A  (-(</  A  (^r)))]. 

13.  Express  ((p  v  q)  a  r)  v  ((p  v  q)  a  (->/?))  in  disjunctive 
normal  form. 
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14.  Determine  whether  or  not  each  of  the  following  argu¬ 
ments  is  valid. 

(a)  -((-/>)  a  <7) 

"'(pAr) 
r  V  s 
q  -»  s 

(b)  p  V  (- <q ) 

(t  v  j)  (p  v  r) 

(-r)  v  (r  v  j) 
p  •<->  (t  v  s) 

(p  A  r)  — *  (q  A  r) 

15.  Discuss  the  validity  of  the  argument 

P  a  q 

_ (~'P)Ar _ 

Purple  toads  live  on  Mars. 


16.  Determine  the  validity  of  each  of  the  following  argu¬ 
ments.  If  the  argument  is  one  of  those  listed  in  the  text, 
name  it. 

(a)  Either  I  wear  a  red  tie  or  I  wear  blue  socks. 

Either  I  wear  a  green  hat  or  I  do  not  wear 
blue  socks. 

Either  I  wear  a  red  tie  or  I  wear  a  green 
hat. 

(b)  If  I  like  mathematics,  then  I  will  study. 

Either  I  don’t  study  or  I  pass  mathematics. 

If  I  don’t  pass  mathematics,  then  I  don’t 
graduate. 

If  I  graduate,  then  I  like  mathematics. 
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2.1  Sets 

Any  branch  of  science,  like  a  foreign  language,  has  its  own  terminology.  “Iso¬ 
morphism,”  “cyclotomic,”  and  “cosef  ’  are  words  one  is  unlikely  to  hear  except  in 
a  mathematical  context.  On  the  other  hand,  quite  a  number  of  common  English 
words — field,  complex,  function — have  precise  mathematical  meanings  quite  dif¬ 
ferent  from  their  usual  ones.  Students  of  French  or  Spanish  know  that  mem¬ 
ory  work  is  a  fundamental  part  of  their  studies;  it  is  perfectly  obvious  to  them 
that  without  a  firm  grasp  of  the  meaning  of  words,  their  ability  to  learn  gram¬ 
mar  and  to  communicate  will  be  severely  hindered.  It  is,  however,  not  always 
understood  by  science  students  that  they  must  memorize  the  terminology  of  their 
discipline  with  the  same  diligence  that  students  of  Russian  memorize  vocabu¬ 
lary.  Without  constant  review  of  the  meanings  of  words,  one’s  understanding  of 
a  paragraph  of  text  or  the  words  of  a  teacher  is  very  limited.  We  advise  read¬ 
ers  of  this  book  to  maintain  and  constantly  review  a  mathematical  vocabulary 
list. 

What  would  it  be  like  to  delve  into  a  dictionary  if  you  didn't  already  know 
the  meanings  of  some  of  the  words  in  it?  Most  people,  at  one  time  or  another, 
have  gone  to  a  dictionary  in  search  of  a  word  only  to  discover  that  the  definition 
uses  another  unfamiliar  word.  Some  reflection  indicates  that  a  dictionary  can  be 
of  no  use  unless  there  are  some  words  which  are  so  basic  that  we  can  understand 
them  without  definitions.  Mathematics  is  the  same  way.  There  are  a  few  basic 
terms  which  we  accept  without  definitions. 

Most  of  mathematics  is  based  upon  the  single  undefined  concept  of  set,  which 
we  think  of  as  just  a  collection  of  things  called  elements  or  members.  Primitive 
humans  discovered  the  set  of  natural  numbers  with  which  they  learned  to  count. 
The  set  of  natural  numbers,  which  is  denoted  with  a  capital  boldface  N  or,  in  hand¬ 
writing,  with  this  symbol,  N,  consists  of  the  numbers  1, 2,  3, . . .  (the  three  dots 
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meaning  “and  so  on”).1  The  elements  of  N  are,  of  course,  just  the  positive  inte¬ 
gers.  The  full  set  of  integers,  denoted  Z  or  Z,  consists  of  the  natural  numbers,  their 
negatives,  and  0.  We  might  describe  this  set  by  ...  ,  —3,  —2,  -1,  0,  1,  2,  3,  ... . 
Our  convention,  which  is  not  universal,  is  that  0  is  an  integer,  but  not  a  natural 
number. 

There  are  various  ways  to  describe  sets.  Sometimes  it  is  possible  to  list  the 
elements  of  a  set  within  braces. 

•  {eggl,  egg2}  is  a  set  containing  two  elements,  eggl  and  egg2. 

•  {x}  is  a  set  containing  one  element,  x. 

•  N  =  (1,  2,  3, . . . }  is  the  set  of  natural  numbers. 

•  Z  =  {. . .  ,  -3,  —2,  —  1,  0,  1,  2,  3, . . . )  is  the  set  of  integers. 

On  other  occasions,  it  is  convenient  to  describe  a  set  with  so-called  set  builder 
notation.  This  has  the  format 

{x  |  x  has  certain  properties}, 

which  is  read  “the  set  of  x  such  that  x  has  certain  properties.”  We  read  “such 
that”  at  the  vertical  line,  “|.” 

More  generally,  we  see 

{some  expression  |  the  expression  has  certain  properties). 

Thus,  the  set  of  odd  natural  numbers  could  be  described  as 
{n  |  n  is  an  odd  integer,  n  >  0} 


or  as 


{2k  -  1  |  k  =  1,2,3, ...} 


or  as 


{2k  -  1  |  k  e  N}. 

The  expression  “k  e  N”  is  read  “k  belongs  to  N,”  the  symbol  e  denoting  set 
membership.  Thus,  “m  e  Z”  simply  records  the  fact  that  m  is  an  integer.  Recall 
that  a  slash  (/)  written  over  any  mathematical  symbol  negates  the  meaning  of  that 
symbol.  So,  in  the  same  way  that  n  ^  3.14.  we  have  0  £  N. 

The  set  of  common  fractions — numbers  like  |,  yy,  and  5  (=  y),  which  are 
ratios  of  integers  with  nonzero  denominators — is  more  properly  called  the  set  of 
rational  numbers  and  is  denoted  Q  or  Q.  Formally, 

Q  =  {f  \m,n  el,n  ^  0}. 

The  set  of  all  real  numbers  is  denoted  R  or  R.  To  define  the  real  numbers 
properly  requires  considerable  mathematical  maturity.  For  our  purposes,  we  think 


1  Since  the  manufacture  of  boldface  symbols  such  as  N  is  a  luxury  not  afforded  users  of  chalk  or  pencil, 
it  has  long  been  traditional  to  use  N  on  blackboards  or  in  handwritten  work  as  the  symbol  for  the  natural 
numbers  and  to  call  N  a  blackboard  bold  symbol. 
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of  real  numbers  as  numbers  which  have  decimal  expansions  of  the  form  a.a\ci2.  ■  ■ 
where  a  is  an  integer  and  a\,a2,  ■  ■  ■  are  integers  between  0  and  9  inclusive.  In 
addition  to  the  rational  numbers,  whose  decimal  expansions  terminate  or  repeat, 
the  real  numbers  include  numbers  like  V2,  V 17,  e,  ? r,  In  5,  and  cos  f  whose  dec¬ 
imal  expansions  neither  terminate  nor  repeat.  Such  numbers  are  called  irrational. 
An  irrational  number  is  a  number  which  cannot  be  written  in  the  form  —  with  m 
and  n  both  integers.  Incidentally,  it  can  be  very  difficult  to  decide  whether  or  not 
a  given  real  number  is  irrational.  For  example,  it  is  unknown  whether  or  not  such 
numbers  as  e  +  n  or  ~  are  irrational. 

The  complex  numbers,  denoted  C  or  C,  have  the  form  a  +  hi  where  a  and  b 
are  real  numbers  and  r  =  —1;  that  is, 

C  =  [a  +  bi  |  a,  b  €  R,  i 2  =  —  1). 


Sometimes  people  are  surprised  to  discover  that  a  set  can  be  an  element  of 
another  set.  For  example,  {{a,b},c}  is  a  set  with  two  elements,  one  of  which  is 
{a,  b}  and  the  other  c. 


Let  S  denote  the  set  {{a},b,c}.  True  or  false? 


(a)  a  i  S. 

(b) 


2.1.1  DEFINITION 


Equality  of  Sets 

Sets  A  and  B  are  equal,  and  we  write  A  —  B,  if  and  only  if  A  and  B  contain  the 
same  elements  or  neither  set  contains  any  element. 


EXAMPLES  1 


{1,2,1}  =  {1,2}  =  {2,1}; 

ll  1  =1  2Ll — (ip 
i  2  ’  4  ’  — 6  ’  2n‘  1  2  *’ 

[t  |  t  =  r  —  s,  r,  s  e  (0, 1,  2}}  = 


-2, -1,0,  1,2}. 


The  Empty  Set 

One  set  which  arises  in  a  variety  of  different  guises  is  the  set  which  contains  no 
elements.  Consider,  for  example,  the  set  small  of  people  less  than  1  millimeter  in 
height,  the  set  LARGE  of  people  taller  than  the  Eiffel  Tower,  the  set 

fpSaiEOSEH  =  {«  <e  N  |  5n  =  2}, 


and  the  set 


5  =  [n  e  N  |  n2  +  1  =  0}. 

These  sets  are  all  equal  since  none  of  them  contains  any  elements.  The  unique  set 
which  contains  no  elements  is  called  the  empty  set.  Set  theorists  originally  used 
0  (zero)  to  denote  this  set,  but  now  it  is  customary  to  use  a  0  with  a  slash  through 
it,  0,  to  avoid  confusion  between  zero  and  a  capital  “Oh.” 


True  or  false?  {0}  =  0. 
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2.1.2  DEFINITION 


Subsets 

A  set  A  is  a  subset  of  a  set  R,  and  we  write  A  c  B,  if  and  only  if  every  element 
of  A  is  an  element  of  B.  If  A  C  B  but  A  ^  B,  then  A  is  called  a  proper  subset 
of  B  and  we  write  A  ^  B. 


When  A  c  B,  it  is  common  to  say  “A  is  contained  in  B ”  as  well  as  “A 
is  a  subset  of  B.”  The  notation  A  c  B,  which  is  common,  unfortunately  means 
A  C  b  to  some  people  and  A  C  B  to  others.  For  this  reason,  we  avoid  it,  while 
reiterating  that  it  is  present  in  a  lot  of  mathematical  writing.  Upon  encountering 
it,  the  reader  should  make  an  effort  to  discover  what  the  intended  meaning  is. 

We  occasionally  see  “B  3  A,”  read  “fi  is  a  superset  of  A.”  This  is  an 
alternative  way  to  express  “A  C  B”  A  is  a  subset  of  B,  just  as  >  x”  is  an 
alternative  way  to  express  “x  <  y”  We  generally  prefer  the  subset  notation. 


EXAMPLES  2 


•  {a,  b }  C  {a,  b,  c } 

•  { a ,  b)  C  { a ,  b,  c } 

•  la,  b}  C  { a ,  b,  {a,  b}} 

•  {a,  fi}  €  (a,  b,  {a,  b}} 

.  NCZCQCRCC 


Note  the  distinction  between  A  ^  B  and  A  %  B,  the  latter  expressing  the 
negation  of  A  C  6;  for  example. 


{a,b}  g  { a,b ,  c)  g  {«,£,  x). 


2.1.3  PROPOSITION 


For  any  set  A,  A  c  A  and  0c  A. 


Proof  If  a  g  A,  then  a  €  A,  so  A  c  A.  The  proof  that  0  C  A  is  a  classic  model  of 
proof  by  contradiction.  If  0  C  A  is  false,  then  there  must  exist  some  x  e  0  such 
that  x  <£  A.  This  is  an  absurdity  since  there  is  no  x  €  0.  I 


True  or  false? 


(a)  {0}  €  {{0}} 

(b)  0  c  {{0}} 

(c)  {0}C{{0}} 

(As  Shakespeare  once  wrote,  “Much  ado  about  nothing.”) 

The  following  proposition  is  an  immediate  consequence  of  the  definitions  of 
“subset”  and  “equal  sets,”  and  it  illustrates  the  way  we  prove  two  sets  are  equal 
in  practice. 


2.1.4  PROPOSITION 


If  A  and  B  are  sets,  then  A  =  B  if  and  only  if  A  c  B  and  B  c  A. 


There  are  two  assertions  being  made  here. 

(-»)  If  A  =  B,  then  A  is  a  subset  of  B  and  B  is  a  subset  of  A. 
(<— )  If  A  is  a  subset  of  B  and  I?  is  a  subset  of  A,  then  A  =  B. 
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Remember  that  another  way  to  state  Proposition  2. 1 .4  is  to  say  that,  for  two 
sets  to  be  equal,  it  is  necessary  and  sufficient  that  each  be  a  subset  of  the  other. 

Note  the  distinction  between  membership,  a  e  b,  and  subset,  a  c  b.  By  the 
former  statement,  we  understand  that  a  is  an  element  of  the  set  b\  by  the  latter, 
that  a  is  a  set  each  of  whose  elements  is  also  in  the  set  b.2 


EXAMPLES  3  Each  of  the  following  assertions  is  true. 

.  {a}  €  [x,y,  {a}} 

.  {a}  C  { x,y,a } 

•  {«}  %  \.x,y,  [a}} 

•  {a,  b }  c  {a,  b } 

.  0  €  {x ,y,0} 

•  0  C  {x,  y,  0} 

•  {0}  ^  [x,  y,  0} 

The  Power  Set 


2.1.5  DEFINITION 


An  important  example  of  a  set,  all  of  whose  elements  are  themselves  sets,  is  the 
power  set  of  a  set. 

The  power  set  of  a  set  A,  denoted  V(A),  is  the  set  of  all  subsets  of  A: 

V{A)  =  {B  |  B  c  A). 


EXAMPLES  4 


Answers  to  Pauses 


•  If  A  =  (a},  then  V(A)  =  {0,  {a}) 

.  If  A  =  [a,  b],  then  P(A)  =  {0,  {a},  {b),  {a,  b }} 

.  V({a,  b,  c})  =  {0,  {a},  {b},  {c},  {a,  b],  {a,  c],  {b,  c),  [a,  b,  c}}  A 

1.  Both  statements  are  true.  The  set  S  contains  the  set  {a}  as  one  of  its  elements, 
but  not  the  element  a. 

2.  This  statement  is  false:  {0}  is  not  the  empty  set  for  it  contains  one  element, 
namely,  the  set  0. 

3.  (a)  True:  {{0}}  is  a  set  which  contains  the  single  element  {0}. 

(b)  True:  The  empty  set  is  a  subset  of  any  set. 

(c)  False:  There  is  just  one  element  in  the  set  {0}  (namely,  0)  and  this  is  not 
an  element  of  the  set  {{0}},  whose  only  element  is  {0}. 


EXERCISES 


The  symbol  [BB]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 

1.  List  the  (distinct)  elements  in  each  of  the  following  sets:  (c)  [BB]  [x  e  Q  |  x(x2  —  2)(2x  +  3)  =  0} 

(a)  [BB]  [x  e  R  |  x2  =  5)  (d)  {x  +  y  \  x  e  [-1,0,  1],  y  e  [0,  1,2}} 

(b)  {x  e  Z  |  xy  =  15  for  some  y  e  Z)  (e)  [a  e  N  |  a  <  — 4  and  a  >  4} 


2Note  the  use  of  lowercase  letters  for  sets,  which  is  not  common  but  certainly  permissible. 
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2.  List  five  elements  in  each  of  the  following  sets: 

(a)  [BB]  [a+bi  \  a,b  &1J2  =  -1} 

(b)  { a  +  bs/2  |  a  e  N,  -b  e  {2,  5,  7}} 

(c)  {i  \x,y  eR.x2  +  y2  =  25} 

(d)  [n  e  N  |  n2  +  n  is  a  multiple  of  3} 

3.  Let  A  =  (1,2,  3,4}. 

(a)  [BB]  List  all  the  subsets  B  of  A  such  that 
(1,2}  c  B. 

(b)  List  all  the  subsets  B  of  A  such  that  B  c  (1,2). 

(c)  List  all  the  subsets  B  of  A  such  that  {1,2}  B. 

.  (d)  List  all  the  subsets  B  of  A  such  that  B  (1,2). 

(e)  List  all  the  subsets  B  of  A  such  that  {1,2}  C  B. 

(f)  List  all  the  subsets  B  of  A  such  that  B  L  {1,2}. 

4.  [BB]  Let  A  =  {{a,b}}.  Are  the  following  statements 
true  or  false?  Explain  your  answer. 

(a)  a  e  A. 

(b)  A  e  A. 

(c)  {<3,  b }  €  A. 

(d)  There  are  two  elements  in  A. 

5.  Determine  which  of  the  following  are  true  and  which 
are  false.  Justify  your  answers. 

(a)  [BB]  3  e  {1,3,5} 

(b)  {3}  €  (1,3,5} 

(c)  (3}  C  {1,3,5} 

(d)  [BB]  {3,5}  %  {1,3,5} 

(e)  {1,3,5}  C  (1,3,5} 

(f)  1  €  [a  +  2b  |  a,  b  even  integers} 

(g)  0  €  {a  +  by/ 2  |  a,  b  e  Q,  b  #  0} 

6.  Find  the  power  sets  of  each  of  the  following  sets: 

(a)  [BB]  0 

(b)  {0} 

(c)  {0,{0}} 

7.  Determine  whether  each  of  the  following  statements  is 
true  or  false.  Justify  your  answers. 

(a)  [BB]  0  c  0 

(b)  0  c  {0} 

(c)  0  €  0 

(d)  0  e  {0} 

(e)  |BB]  {1,2}  %  {1,2,3,{I,2,3}} 

(f)  {1,2}  6  {1,2,3,  {1,2,3}} 

(g)  {1,2}  C  {1,2,  {{1,2}}} 

(h)  [BB]  {1,2}  €{1,2,  {{1,2}}} 

(i)  {{1,2}}  c  {1,2,  {1,2}} 


8.  [BB]  Let  A  be  a  set  and  suppose  x  e  A.  Is  x  C  A  also 
possible?  Explain. 

9.  (a)  List  all  the  subsets  of  the  set  [a,b,c,d]  which 

contain: 

i.  four  elements 

ii.  [BB]  three  elements 

iii.  two  elements 

iv.  one  element 

v.  no  elements 

(b)  How  many  subsets  of  { a ,  b ,  c,  d)  are  there  alto¬ 
gether? 

10.  (a)  How  many  elements  are  in  the  power  set  of  the 

power  set  of  the  empty  set? 

(b)  Suppose  A  is  a  set  containing  one  element.  How 
many  elements  are  in  P(P(A))1 

11.  (a)  [BB]  If  A  contains  two  elements,  how  many  ele¬ 

ments  are  there  in  the  power  set  of  A? 

(b)  [BB]  If  A  contains  three  elements,  how  many  ele¬ 
ments  are  there  in  the  power  set  of  A? 

(c)  [BB]  If  a  set  A  contains  n  >  0  elements,  guess  how 
many  elements  are  in  the  power  set  of  A. 

12.  Suppose  A,  B ,  and  C  are  sets.  For  each  of  the  following 
statements  either  prove  it  is  true  or  give  a  counterexam¬ 
ple  to  show  that  it  is  false. 

(a)  [BB]  A  €  B,  B  G  C  ->  A  €  C. 

(b)  A  C  B,  B  C  C  ->•  A  C  C. 

(c)  A  C  b,  B  C  C  -+  A  g  c. 

(d)  [BB]  A  €  B,  B  CC  — ►  A  €  C. 

(e)  A  e  B,  B  c  C  ->  A  C  C. 

(f)  A  c  B,  B  €  C  ->  A  e  C. 

(g)  A  c  B,  B  g  C  ->  A  C  C. 

13.  Suppose  A  and  B  are  sets. 

(a)  Answer  true  or  false  and  explain:  A  g  B  — > 
B  c  A. 

(b)  Is  the  converse  of  the  implication  in  (a)  true  or  false? 
Explain. 

14.  Suppose  A,  B,  and  C  are  sets.  Prove  or  give  a  counter¬ 
example  which  disproves  each  of  the  following  asser¬ 
tions. 

(a)  [BB]  C  €  P(A)  «Ccj, 

(b)  A  c  B  <+  P(A)  c  P(B). 

(c)  A  =  0  -o-  P(A)  =  0. 
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2.2  Operations  on  Sets 

In  this  section,  we  discuss  ways  in  which  two  or  more  sets  can  be  combined  in 
order  to  form  a  new  set. 


2.2.1  DEFINITIONS 


Union  and  intersection 

The  union  of  sets  A  and  B,  written  A  U  B,  is  the  set  of  elements  in  A  or  in  B 
(or  in  both).  The  intersection  of  A  and  B,  written  A  fl  B,  is  the  set  of  elements 
which  belong  to  both  A  and  B. 


EXAMPLES  5  •  If  A  =  [a,  b ,  c }  and  B  =  {a,  x,  y,  b ),  then 

A  U  B  =  {a,  b,  c,  x,  _y},  A  (1  B  =  {a,  bj, 

AU(0)  =  {a,  b,  c,  0}  and  B  n  (0)  =  0. 

•  For  any  set  A,  A  U  0  =  A  and  4  0  0  =  0.  A 


As  with  addition  and  multiplication  of  real  numbers,  the  union  and  intersection 
of  sets  are  associative  operations.  To  say  that  set  union  is  associative  is  to  say 
that 


(A,  UA2)UA3  =  4,  U(42U43) 
for  any  three  sets  A|,  A2,  A3.  It  follows  that  the  expression 

A]  UA2UA3 

is  unambiguous.  The  two  different  interpretations  (corresponding  to  different  inser¬ 
tions  of  parentheses)  agree.  The  union  of  n  sets  A\,  A2,  ....  A„  is  written 

n 

(1)  A|  U  A2  U  A3  U  •  •  •  U  An  or  (J  A, 

!  =  1 

and  represents  the  set  of  elements  which  belong  to  one  or  more  of  the  sets  A,  . 
The  intersection  of  A 1,  A2,  . . .  ,  A„  is  written 

n 

(2)  a  \  n  a2  n  a3  n  •  •  •  n  a„  or  P)  a,- 

i= 1 

and  denotes  the  set  of  elements  which  belong  to  all  of  the  sets. 

Do  not  assume  from  the  expression  A 1  U  A2  U  A3  U  •  •  •  U  A„  that  n  is  actually 
greater  than  3  since  the  first  part  of  this  expression — Ai  UA2U  A3 — is  present 
only  to  make  the  general  pattern  clear;  a  union  of  sets  is  being  formed.  The 
last  term — A„ — indicates  that  the  last  set  in  the  union  is  An.  If  n  =  2,  then 
A[  U  A2  U  A3  U  •  •  •  U  An  means  A]  U  A2.  Similarly,  if  n  =  1,  the  expression 
A 1  n  A2  n  •  •  •  n  A„  simply  means  A 1 . 

While  parentheses  are  not  required  in  expressions  like  (1)  or  (2),  they  are 
mandatory  when  both  union  and  intersection  are  involved.  For  example,  An(fiUC) 
and  (AnS)UC  are,  in  general,  different  sets.  This  is  probably  most  easily  seen 
by  the  use  of  the  Venn  diagram  shown  in  Fig  2. 1 . 
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Figure  2.1  A  Venn  diagram. 

The  diagram  indicates  that  A  consists  of  the  points  in  the  regions  labeled  1, 
2,  3,  and  4;  B  consists  of  those  points  in  regions  3,  4,  5,  and  6  and  C  of  those 
in  2,  3,  5,  and  7.  The  set  B  U  C  consists  of  points  in  the  regions  labeled  3,  4,  5, 
6,  2,  and  7.  Notice  that  A  (T  (B  U  C)  consists  of  the  points  in  regions  2,  3,  and  4. 
The  region  An  B  consists  of  the  points  in  regions  3  and  4;  thus,  (A  n  B)  U  C  is 
the  set  of  points  in  the  regions  labeled  3,  4,  2,  5,  and  7.  The  diagram  enables  us 
to  see  that,  in  general,  An(BUC)/(AnS)UC  and  it  shows  how  we  could 
construct  a  specific  counterexample.  Namely,  we  could  let  A,  B,  and  C  be  the 
sets 

A  =  (1,  2,  3, 4},  B  =  {3,4.  5.  6),  C  =  {2.  3,  5,  7} 
as  suggested  by  the  diagram  and  then  calculate 

An(BUC)  =  {2,  3,  4}  ^  {2,  3,  4,  5,  7}  =  (A  n  B)  U  C. 

There  is  a  way  to  rewrite  Afl(BUC).  In  Fig  2.1,  we  see  that  AD  B  consists 
of  the  points  in  the  regions  labeled  3  and  4  and  that  ADC  consists  of  the  points 
in  2  and  3.  Thus,  the  points  of  (A  fl  B)  U  (A  fl  C)  are  those  of  2,  3,  and  4.  These 
are  just  the  points  of  A  fl  (B  U  C)  (as  observed  previously),  so  the  Venn  diagram 
makes  it  easy  to  believe  that,  in  general, 

(3)  A  n  (B  UC)  =  (A  n  B)  U  (A  HC). 

While  pictures  can  be  helpful  in  making  certain  statements  seem  plausible,  they 
should  not  be  relied  upon  because  they  can  also  mislead.  For  this  reason,  and 
because  there  are  situations  in  which  Venn  diagrams  are  difficult  or  impossible 
to  create,  it  is  important  to  be  able  to  establish  relationships  among  sets  without 
resorting  to  a  picture. 


PROBLEM  6.  Let  A,  B  and  C  be  sets.  Verify  equation  (3)  without  the  aid  of  a 
Venn  diagram. 

Solution.  As  observed  in  Proposition  2.1.4,  showing  that  two  sets  are  equal  is 
equivalent  to  showing  that  each  is  a  subset  of  the  other.  Here  this  just  amounts 
to  expressing  the  meaning  of  U  and  D  in  words. 

To  show  Afl(BUC)  c  (A  Pi  B)  U  (A  fl  C),  let  x  e  A  fl  (B  U  C).  Then  x  is 
in  A  and  also  in  BUC.  Since  x  e  B  U  C,  either  x  e  B  or  x  e  C.  This  suggests 
cases. 

Case  1:  x  e  B. 

In  this  case,  x  is  in  A  as  well  as  in  B.  so  it’s  in  A  fl  B. 
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Case  2:  x  e  C. 

Here  x  is  in  A  as  well  as  in  C,  so  it’s  in  ADC. 

We  have  shown  that  either  x  €  A  D  B  or  x  e  A  H  C.  By  definition  of  union, 
x  €  (A  n  B)  U  (A  n  C),  completing  this  half  of  our  proof. 

Conversely,  we  must  show  (A  D  B)  U  (A  D  C)  C  A  D  (B  U  C).  For  this,  let 
v  e  (A  fl  B)  U  (A  n  C).  Then  either  x  e  AD  B  or  x  e  A  Pi  C.  Thus,  x  is  in  both 
A  and  B  or  in  both  A  and  C.  In  either  case,  x  e  A.  Also,  x  is  in  either  B  or  C; 
thus,  x  G  B  U  C.  So  x  is  both  in  A  and  in  B  U  C;  that  is,  x  €  A  fl  (B  U  C).  This 
completes  the  proof.  I 


PROBLEM  7.  For  sets  A  and  B,  prove  that  A  fl  B  =  A  if  and  only  if  A  c  B. 

Solution.  Remember  that  there  are  two  implications  to  establish  and  that  we  use 
the  symbolism  (— ►)  and  (<— )  to  mark  the  start  of  the  proof  of  each  implication. 

(-*)  Here  we  assume  A  D  B  =  A  and  must  prove  A  C  B.  For  this,  suppose 
x  €  A.  Then,  x  €  An  B  (because  we  are  assuming  A  =  A  D  B).  Therefore,  x  is 
in  A  and  in  B,  in  particular,  x  is  in  B.  This  proves  A  C  B. 

(-<— )  Now  we  assume  A  c  B  and  prove  A  D  B  =  A.  To  prove  the  equality  of 
AH  B  and  A,  we  must  prove  that  each  set  is  a  subset  of  the  other.  By  definition 
of  intersection,  A  n  B  is  a  subset  of  A,  so  A  H  B  c  A.  On  the  other  hand,  suppose 
x  e  A.  Since  A  c  B,  x  is  in  B  too;  thus,  x  is  in  both  A  and  B.  Therefore, 
A  c  A  n  B.  Therefore,  A  =  A  n  B.  I 

For  sets  A  and  B,  prove  that  A  U  B  —  B  if  and  only  if  A  c  B. 


3.2.2  DEFINITIONS 


Set  Difference 

The  set  difference  of  sets  A  and  B,  written  A  \  B,  is  the  set  of  those  elements  of 
A  which  are  not  in  B.  The  complement  of  a  set  A  is  the  set  Ac  =  U  \  A,  where 
U  is  some  universal  set  made  clear  by  the  context. 


EXAMPLES  8 


.  { a ,  b,  c }  \  {a,  b }  =  (c) 

•  [a,  b,  c]  \  [a,  x}  =  [b,  c j 

•  {a,  b,  0}  \  0  =  {a,  b ,  0} 

.  {a,  b,  0}  \  {0}  =  { a,b } 

•  If  A  is  the  set  {Monday,  Tuesday,  Wednesday,  Thursday,  Friday),  the  context 

suggests  that  the  universal  set  is  the  days  of  the  week,  so  Ac  =  {Saturday, 
Sunday).  A 


Notice  that  A  \  B  =  Aflgf  and  also  that  (Ac')c  =  A.  For  example,  if 
A  =  {x  e  Z  |  x2  >  0),  then  Ac  =  {0)  (it  being  understood  that  U  =  Z) 
and  so 


(Ac)c  =  {0)c  =  {x  g  Z  |  x  >  0)  U  {x  €  Z  |  x  <  0)  =  A. 

You  may  have  previously  encountered  standard  notation  to  describe  various 
types  of  intervals  of  real  numbers. 
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2.2.3  INTERVAL 
NOTATION 


If  a  and  b  are  real  numbers  with  a  <  b,  then 
[a,  b]  =  {x  €  R  |  a  <  x  <  b } 


closed 


(a,  b)  =  {x  e  R  |  a  <  x  <  b]  open 

(a,  b]  =  {x  e  R  |  a  <  x  <  b)  half  open 

[a,  b)  =  {x  G  R  |  a  <  x  <  b}  half  open. 

As  indicated,  a  closed  interval  is  one  which  includes  both  end  points,  an  open 
interval  includes  neither,  and  a  half  open  interval  includes  just  one  end  point.  A 
square  bracket  indicates  that  the  adjacent  end  point  is  in  the  interval.  To  describe 
infinite  intervals,  we  use  the  symbol  co  (which  is  just  a  symbol)  and  make  obvious 
adjustments  to  our  notation.  For  example, 

(— oo,  b]  =  {x  €  R  |  x  <  b}, 


(a,  oo)  =  {x  6  R  |  x  >  a). 
The  first  interval  here  is  half  open;  the  second  is  open. 


2.2.4  THE  LAWS  OF  DE 
MORGAN 


If  A  =  [—4, 4]  and  B  =  [0,  5],  then  A\B  =  [—4,  0).  What  is  B  \  A?  What  is  Ac? 


The  following  two  laws,  of  wide  applicability,  are  attributed  to  Augustus  De 
Morgan  (1806-1871),  who,  together  with  George  Boole  (1815-1864),  helped  to 
make  England  a  leading  center  of  logic  in  the  nineteenth  century.3 

(A  U  B)c  =  Ac  Cl  Bc\  (A  n  BY  =  Ac  LJ  Bc . 


Readers  should  be  struck  by  the  obvious  connection  between  these  laws  and 
the  rules  for  negating  and  and  or  compound  sentences  described  in  Section  1.1. 
We  illustrate  by  showing  the  equivalence  of  the  first  law  of  De  Morgan  and  the 
rule  for  negating  “A  or  33.” 

PROBLEM  9.  Prove  that  (A  U  Bf  =  Ac  Pi  Bc  for  any  sets  A,  B,  and  C. 
Solution.  Let  A  be  the  statement  “x  e  A”  and  33  be  the  statement  “x  e  B.” 
Then 

te(AU  B)c  4*  -fix  e  A  U  B) 

4»  —‘{A  or  33)  definition  of  union 

4*  ->A  and  -'33  rule  for  negating  “or” 

44  x  e  Ac  and  x  e  Bc 

44  x  €  Ac  H  Bc  definition  of  intersection. 


3  As  pointed  out  by  Rudolf  and  Gerda  Fritsch  (Der  Vierfarbensatz,  B.  I.  Wissenschaftsverlag,  Mannheim. 
1994  and  English  translation.  The  Four-Color  Theorem,  by  J.  Peschke,  Springer- Verlag,  1998),  it  was  in  a 
letter  from  De  Morgan  to  Sir  William  Rowan  Hamilton  that  the  question  giving  birth  to  the  famous  “Four- 
Color  Theorem"  was  first  posed.  See  Section  14.2  for  a  detailed  account  of  this  theorem  whose  proof  was 
found  only  quite  recently,  after  over  100  years  of  effort! 
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2.2.5  DEFINITION 


EXAMPLES  10 


The  sets  (A  U  B)r  and  Ac  (T  Br  contain  the  same  elements,  so  they  are  the 
same.  I 

Symmetric  Difference 

The  symmetric  difference  of  two  sets  A  and  B  is  the  set  A  ©  B  of  elements  which 
are  in  A  or  in  6,  but  not  in  both. 

The  notation  A  A  B  is  used  by  some  authors  to  denote  symmetric  difference 
although,  as  suggested,  this  is  not  our  preference. 

Notice  that  the  symmetric  difference  of  sets  can  be  expressed  in  terms  of 
previously  defined  operations.  For  example, 

A®fi  =  (AUB)\(Aflfi) 


and 


A  ®  B  =  {A  \  B)  U  (B  \  A). 

•  { a ,  b,  c }  ©  {x,  y,  a)  =  { b ,  c,  x,  y) 

•  {a,  b,  c}  ©  0  =  {a,  b,  c) 

•  {a,  b,  c}  ©  {0}  =  {a,  b,  c,  0)  ▲ 

PROBLEM  11.  Use  a  Venn  diagram  to  illustrate  the  plausibility  of  the  fact  that 
©  is  an  associative  operation;  that  is,  use  a  Venn  diagram  to  illustrate  that  for  any 
three  sets  A,  B,  and  C, 

(4)  (A©  B)®C  =  A  0(1?  ©C). 

Solution.  With  reference  to  Fig  2. 1  again,  A  ©  B  consists  of  the  points  in  the 
regions  labeled  1,  2,  5,  and  6  while  C  consists  of  the  points  in  the  regions  2, 
3,  5,  and  7.  Thus,  (A  ©  B)  ©  C  is  the  set  of  points  in  the  regions  1,  3,  6,  and 
7.  On  the  other  hand,  B  ©  C  consists  of  the  regions  2,  7,  4,  and  6  and  A,  of 
regions  1,  2,  3,  4.  Thus,  A  ©  (B  ©  C)  also  consists  of  the  points  in  regions  1,  3, 
6,  and  7.  I 

As  a  consequence  of  (4),  the  expression  A  ©  B  ©  C,  which  conceivably  could 
be  interpreted  in  two  ways,  is  in  fact  unambiguous.  Notice  that  A  ©  B  ©  C  is 
the  set  of  points  in  an  odd  number  of  the  sets  A,  B,  C :  Regions  1,  6,  and  7 
contain  the  points  of  just  one  of  the  sets  while  region  3  consists  of  points  in  all 
three.  More  generally,  the  symmetric  difference  A\  ©  Ao  ©  A3  ©  •  •  •  ©  A„  of  n 
sets  A 1,  A2,  A3,  . . .  ,  A„  is  well  defined  and,  as  it  turns  out,  is  the  set  of  those 
elements  which  are  members  of  an  odd  number  of  the  sets  A,  .  (See  Exercise  21 
of  Section  5.1.) 

The  Cartesian  Product  of  Sets 

There  is  yet  another  way  in  which  two  sets  can  be  combined  to  obtain  another. 
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2.2.6  DEFINITIONS 


EXAMPLE  12 


EXAMPLE  13 


If  A  and  B  are  sets,  the  Cartesian  product  (or  the  direct  product)  of  A  and  B  is 
the  set 

Ax  B  =  [(a,b)  |  ae  A,be  B). 

(We  say  “A  cross  B"  for  “ A  x  B.")  The  Cartesian  product  A  x  A  is  denoted  A1. 
More  generally, 

An  =  AxAx---xA  =  {(tfi,  a2, .  ■  ■  ,  an)  \  a,  €  A  for  i  =  1,  2, . . .  ,  n }. 

n  times 


The  elements  of  A  x  B  are  called  ordered  pairs  because  their  order  is  impor¬ 
tant:  (a,  b)  ^  ( b ,  a)  (unless  a  =  b).  The  elements  a  and  b  are  the  coordinates  of 
the  ordered  pair  (a,  b)\  the  first  coordinate  is  a  and  the  second  is  h.  The  elements 
of  An  are  called  n-tuples. 

Elements  of  Ax  B  are  equal  if  and  only  if  they  have  the  same  first  coordinates 
and  the  same  second  coordinates: 


(a\,  b\)  =  (a2,  b2)  if  and  only  if  a\  —  b\  and  a2  =  62- 


Let  A  =  {a,  b)  and  B  —  {x,  y,  z).  Then 

A  x  B  =  {(a,  x),  (a,  y),  (a,  z),  (b,  x),  ( b ,  y),  (b,  z )} 

and 


B  x  A  —  {(x,  a),  (x ,  b),  (y,  a),  (y,  b ),  (z,  a),  (z,  b)). 

This  example  illustrates  that,  in  general,  the  sets  A  x  B  and  B  x  A  are 
different.  ▲ 

The  Cartesian  plane,  in  which  calculus  students  sketch  curves,  is  a  picture  of 
R  x  R  =  R2  =  { (jc ,  y )  {  x,y  €  Rj.  The  adjective  “Cartesian"  is  derived  from 
Descartes,4  as  Cartesius  was  Descartes’s  name  in  Latin.  ▲ 


PROBLEM  14.  Let  A,  B,  and  C  be  sets.  Prove  that  A  x  (B  U  C)  C  (A  x  B)  U 
(A  x  C). 

Solution.  We  must  prove  that  any  element  in  A  x  (BUC)  is  in  (A  x  B)U(A  xC). 
Since  the  elements  in  A  x  (BUC)  are  ordered  pairs,  we  begin  by  letting  ( x ,  y)  e 
A  x  (BUC)  (this  is  more  helpful  than  starting  with  “reAx B ”)  and  ask  ourselves 
what  this  means.  It  means  that  x,  the  first  coordinate,  is  in  A  and  y,  the  second 
coordinate,  is  in  B  U  C.  Therefore,  y  is  in  either  B  or  C.  If  y  is  in  B,  then,  since 
x  is  in  A,  (jc,  y)  e  A  x  B.  If  y  is  in  C,  then,  since  jf  is  in  A,  (x,  y)  e  A  x  C . 
Thus,  (x,  v)  is  either  in  A  x  B  or  in  A  x  C;  thus,  ( x ,  y)  is  in  (AxB)U(Ax  C), 
which  is  what  we  wanted  to  show.  I 


4Rene  Descartes  (1596-1650),  together  with  Pierre  de  Fermat,  the  inventor  of  analytic  geometry,  intro- 
duced  the  method  of  plotting  points  and  graphing  functions  in  R2  with  which  we  are  so  familiar  today. 
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Let  A,  B.  and  C  be  three  sets.  Prove  that  (A  x  B)  U  (A  x  C)  c  A  x  (B  U  C). 
What  can  you  conclude  about  the  sets  Ax(BUC)  and  (A  x  B)U(A  x  C)?  Why? 

Let  A  and  B  be  nonempty  sets.  Prove  that  A  x  B  =  B  x  A  if  and  only  if  A  =  B. 
Is  this  true  if  A  =  0? 

4.  (— >)  Suppose  the  first  statement,  A  U  B  =  B,  is  true.  We  show  A  c  B.  So  let 
x  €  A.  Then  x  is  certainly  in  A  U  B,  by  the  definition  of  U.  But  A  U  B  =  B, 
so  x  e  B.  Thus,  A  c  B. 

(■*—)  Conversely,  suppose  the  second  statement,  A  c  B,  is  true.  We  have  to 
show  A  U  B  =  B.  To  prove  the  sets  A  U  B  and  B  are  equal,  we  have  to  show 
each  is  a  subset  of  the  other.  First,  let  x  e  A  U  B.  Then  x  is  either  in  A  or  in 
B.  If  the  latter,  x  e  B,  and  if  the  former,  x  e  B  because  A  is  a  subset  of  B. 
In  either  case,  x  e  B.  Thus,  AUB  C  B.  Second,  assume  x  e  B.  Then  x  is  in 
A  U  B  by  definition  of  U.  So  B  C  A  U  B  and  we  have  equality,  as  required. 

5.  B  \  A  =  (4,  5];  Af  =  (-oo,  -4)  U  (4,  oo). 

6.  An  element  of  (AxB)U(Ax  C)  is  either  in  A  x  B  or  in  A  x  C;  in  either 
case,  it’s  an  ordered  pair.  So  we  begin  by  letting  (jc,  y)  e  (A  x  B)  U  (A  x  C) 
and  noting  that  either  (x,  y)  e  A  x  B  or  (jc,  y)  e  A  x  C.  In  the  first  case,  x  is 
in  A  and  y  is  in  B;  in  the  second  case,  x  is  in  A  and  y  is  in  C.  In  either  case, 
x  is  in  A  and  y  is  either  in  B  or  in  C;  so  x  e  A  and  y  e  B  U  C.  Therefore, 
(x,  y)  €  A  x  (BUC),  establishing  the  required  subset  relation.  The  reverse 
subset  relation  was  established  in  Problem  14.  We  conclude  that  the  two  sets 
in  question  are  equal;  that  is,  A  x  (B  U  C)  =  (A  x  B)  U  (A  x  C). 

7.  (-»)  Suppose  that  the  statement  AxB  =  BxA  is  true.  We  prove  A  =  B.  So 
suppose  x  €  A.  Since  B  7^  0.  we  can  find  some  y  e  B.  Thus,  (jc,  y)  e  A  x  B. 
Since  A  x  B  =  B  x  A,  (x.  y)  e  B  x  A.  So  ,v  e  B,  giving  us  A  c  B.  Similarly, 
we  show  that  B  c  A  and  conclude  A  =  B. 

( -e- )  On  the  other  hand,  if  A  =  B  is  a  true  statement,  then  AxB  =  AxA  = 
B  x  A. 

Finally,  if  A  =  0  and  B  is  any  nonempty  set,  then  A  x  B  —  0  —  B  x  A,  but 
A  7^  B.  So  A  x  B  =  B  x  A  does  not  mean  A  =  B  in  the  case  A  =  0. 


EXERCISES 


The  symbol  [BB ]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 


1.  Let  A  =  {x  e  N  |  x  <  7},  B  =  [x  e  Z  |  |.r  —  2|  <4} 
and  C  =  {jc  e  R  |  jc3  —  4x  =  0). 

(a)  [BB]  List  the  elements  in  each  of  these  sets. 

(b)  Find  A  U  C,  B  n  C,  B  \C.  A  ®  B.  C  x  (B  n  C), 
(A  \  B)  \  C,  A  \  (B  \  C),  and  (B  U  0)  D  {0}. 

(c)  List  the  elements  in  S  =  {(a.  b)  6  AxB  |  a  =  h+ 2} 
and  in  T  =  {(a,  c)  e  A  x  C  \  a  <  c}. 

2.  Let  5  =  (2,  5,  V2,  25,  jr,  §}  and  T  =  [4,  25,  sfl,  6,  §}. 

(a)  [BB]  Find  5  n  T,  S  U  T,  and  T  x  (S  ft  T). 

(b)  [BB]  Find  Z  U  5,  Z  n  S,  Z  U  T,  and  Z  n  T. 


(c)  List  the  elements  in  each  of  the  sets  Z  n  (S  U  T)  and 
(ZnS)U(Zfir).  What  do  you  notice? 

(d)  List  the  elements  of  ZU(5f17')  and  list  the  elements 
of  (Z  U  S)  D  (Z  U  T).  What  do  you  observe? 

3.  Let  A  =  {(-1,2),  (4,5),  (0,0),  (6,  -5),  (5,  1),  (4,  3)). 
List  the  elements  in  each  of  the  following  sets. 

(a)  [BB]  (a  +  b  |  (a,  b)  g  A] 

(b)  [a  |  a  >  0  and  («,  b)  e  A  for  some  b) 

(c)  {b  |  b  =  k1 2  for  some  ieZ  and  ( a ,  b)  e  A  for 
some  «} 
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4.  List  the  elements  in  the  sets  A  =  {(a,  b)  e  N  x  N  |  a  < 
b,b  <  3}  and  B  =  (|  |  a,  B  e  {— 1,1,2}}. 

5.  For  A  =  [a,  b ,  c,  [a,  B}},  find 

(a)  [BB]  A  \  [a,b] 

(b)  {0}  \  V(A) 

(c)  A  \  0 

(d)  0  \  A 

(e)  [BB]  [a,  B,  c]  \  A 

(f)  ({a,  b,  c}  U  {A})  \  A 

6.  Find  Ac  (with  respect  to  U  =  R)  in  each  of  the  following 
cases. 

(a)  [BB]  A  =  (1,  oo)  U  (-oo,  -2] 

(b)  A  =  (—3,  oo)  fl  (—oo,  4] 

(c)  A  =  [jc  e  R  \x2  <  -1] 

7.  Let  n  >  3  and  A  =  [1, 2,  3, . . .  ,  «}. 

(a)  [BB]  How  many  subsets  of  A  contain  [1, 2}? 

(b)  How  many  subsets  fl  of  A  have  the  property  that 

B  n  [1,2]  =  0? 

(c)  How  many  subsets  B  of  A  have  the  property  that 
fl  U  { 1, 2}  =  A? 

Explain  your  answers. 

8.  [BB]  Let  a  and  b  be  real  numbers  with  a  <  b.  Find 
(a.  b)c,  [a,  b)c,  (a,  oo)c,  and  (— oo,  b]c . 

9.  The  universal  set  for  this  problem  is  the  set  of  students 
attending  Miskatonic  University.  Let 

•  M  denote  the  set  of  math  majors 

•  CS  denote  the  set  of  computer  science  majors 

•  T  denote  the  set  of  students  who  had  a  test  on  Friday 

•  P  denote  those  students  who  ate  pizza  last  Thursday. 

Using  only  the  set  theoretical  notation  we  have  intro¬ 
duced  in  this  chapter,  rewrite  each  of  the  following 
assertions. 

(a)  [BB]  Computer  science  majors  had  a  test  on  Friday. 

(b)  [BB]  No  math  major  ate  pizza  last  Thursday. 

(c)  One  or  more  math  majors  did  not  eat  pizza  last 
Thursday. 

(d)  Those  computer  science  majors  who  did  not  have  a 
test  on  Friday  ate  pizza  on  Thursday. 

(e)  Math  or  computer  science  majors  who  ate  pizza  on 
Thursday  did  not  have  a  test  on  Friday. 

10.  Using  only  the  set  theoretical  notation  introduced  in  this 
chapter,  express  the  negation  and  converse  of  statements 
(a),  (c),  and  (d)  in  the  previous  question. 


11.  Let  P  denote  the  set  of  primes  and  E  the  set  of  even  inte¬ 
gers.  As  always,  Z  and  N  denote  the  integers  and  natural 
numbers,  respectively.  Find  equivalent  formulations  of 
each  of  the  following  statements  using  the  notation  of 
set  theory  that  has  been  introduced  in  this  section. 

(a)  [BB]  There  exists  an  even  prime. 

(b)  0  is  an  integer  but  not  a  natural  number. 

(c)  Every  prime  is  both  a  natural  number  and  an  integer. 

(d)  Every  prime  except  2  is  odd. 

12.  For  n  e  Z,  let  An  =  [a  e  Z  |  a  <  n }.  Find  each  of  the 
following  sets. 

(a)  [BB]  A3  U  A_3 

(b)  A 3  n  a„3 

(e)  a3  n  (A_3)c 

(d>  nto 

13.  [BB]  In  Fig  2.1,  the  region  labeled  7  represents  the  set 
C\(AUB).  What  set  is  represented  by  the  region  labeled 
2?  By  that  labeled  3?  By  that  labeled  4? 

14.  (a)  [BB]  Suppose  A  and  B  are  sets  such  that  AOB  =  A. 

What  can  you  conclude?  Why? 

(b)  Repeat  (a)  assuming  A  U  B  =  A. 

15.  [BB]  Let  n  >  1  be  a  natural  number.  How  many  ele¬ 
ments  are  in  the  set  {(a,b)  €NxN|a<B<  «)? 
Explain. 

16.  Suppose  A  is  a  subset  of  N  x  N  with  the  properties 
.  (1 .  1 )  €  A  and 

.  (a,  b)  €  A  — >■  (a  +  1 .  b)  and  (a  +  1 .  b  +  1)  are  both 
in  A. 

Do  you  think  that  {( m ,  n)  €  N  x  N  |  tn  >  n )  is  a  subset 
of  A?  Explain.  [Hint:  A  picture  of  A  in  the  xy-plane 
might  help.] 

17.  Let  A.  B,  and  C  be  subsets  of  some  universal  set  U . 

(a)  Prove  that  AP\  B  C  C  and  AcnBCC^-BCC. 

(b)  [BB]  Given  that  AflB  =  AflC  and  AeHB  =  AeflC, 
does  it  follow  that  B  =  C?  Justify  your  answer. 

18.  Let  A,  B,  and  C  be  sets. 

(a)  Find  a  counterexample  to  the  statement  A  U  (B  n 
C)  =  (AUB)nC. 

(b)  Without  using  Venn  diagrams,  prove  that  A  U  (B  fl 
C)  =  (AUB)n(AUC). 

19.  Use  the  first  law  of  De  Morgan  to  prove  the  second: 
(A  n  B)c  =  Ac  U  Bc . 

20.  [BB]  Use  the  laws  of  De  Morgan  and  any  other  set 
theoretic  identities  discussed  in  the  text  to  prove  that 
(A\B)\C  =  A\(BUC)  for  any  sets  A,  B,  and  C. 
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21.  Let  A,  B ,  C,  and  D  be  subsets  of  a  universal  set  U.  Use 
set  theoretic  identities  discussed  in  the  text  to  simplify 
the  expression  [(A  U  B)c  fl  ( Ac  U  C)c]c  \  Dc . 

22.  Let  A,  B,  and  C  be  subsets  of  some  universal  set  U . 

Use  set  theoretic  identities  discussed  in  the  text  to  prove 
that  A  \  (B  \  C)  =  (A  \  B)  U  (A  \  Cc ). 

23.  Suppose  A,  B.  and  C  are  subsets  of  some  universal 
set  U . 

(a)  [BB]  Generalize  the  laws  of  De  Morgan  by  finding 
equivalent  ways  to  describe  the  sets  (A  U  B  U  C  f 
and  (AHSn  C)c. 

(b)  Find  a  way  to  describe  the  set  (A  fl  (fl  \  C))c  n  A 
without  using  the  symbol  c  for  set  complement. 

24.  Let  A  and  B  be  sets. 

(a)  [BB]  Find  a  necessary  and  sufficient  condition  for 
A®  B  =  A. 

(b)  Find  a  necessary  and  sufficient  condition  for  A  n 
fl=AUfl. 

Explain  your  answers  (with  Venn  diagrams  if  you  wish). 

25.  Which  of  the  following  conditions  imply  that  B  =  Cl 
In  each  case,  either  prove  or  give  a  counterexample.  ife)  29. 

(a)  [BB]  AUfl  =  AUC 


(b)  Aflfl  =  AnC 

(c)  A  ©  B  =  A  ®  C 

(d )  A  x  B  =  A  x  C 

26.  True  or  false?  In  each  case,  provide  a  proof  or  a  counter¬ 
example. 

(a)  AcC,flcfl-»Ax8CCxD. 

(b)  AxBCCxD—^ACC  and  BCD. 

(c)  ACC  and  BCD  if  and  only  if  A  x  B  C  C  x  D. 

(d)  [BB]  A  U  5  CAD5  A  =  B. 

27.  Show  that  (A  fl  B)  x  C  =  (A  x  C)  fl  (B  x  C)  for  any 
sets  A,  B,  and  C. 

28.  Let  A,  B,  and  C  be  arbitrary  sets.  For  each  of  the  follow¬ 
ing,  either  prove  the  given  statement  is  true  or  exhibit  a 
counterexample  to  prove  it  is  false. 

(a)  A  \  (B  U  C)  =  (A  \  B)  U  (A  \  C) 

(b)  (A  \  B)  x  C  =  (A  x  C)  \  (B  x  C) 

(c)  [BB]  (A®fl)xC  =  (AxC)0(flxC) 

(d)  (AUfl)x(CUD)  =  (AxC)U(flx  D) 

(e)  (A  \  B)  x  (C  \  D)  =  (A  x  C)  \  (B  x  D) 

Find  out  what  you  can  about  George  Boole  and  write  a 
paragraph  or  two  about  him  (in  good  English,  of  course). 


2.3  Binary  Relations 

If  A  and  B  are  sets,  remember  that  the  Cartesian  product  of  A  and  B  is  the  set 
A  x  B  ~  {(a,  b)  \  a  e  A,  b  €  B}.  There  are  occasions  when  we  are  interested  in 
a  certain  subset  of  A  x  fl.  For  example,  if  A  is  the  set  of  former  major  league 
baseball  players  and  B  =  NU{0}  is  the  set  of  nonnegative  integers,  then  we  might 
naturally  be  interested  in 

TZ  =  {(a,  b)  |  a  e  A,  b  €  B,  player  a  had  b  career  home  runs}. 

For  example,  (Hank  Aaron,  755)  and  (Mickey  Mantle,  536)  are  elements  of  71. 


2.3.1  DEFINITIONS 


Let  A  and  B  denote  sets.  A  binary  relation  from  A  to  B  is  a  subset  of  A  x  B.  A 
binary  relation  on  A  is  a  subset  of  A  x  A. 


The  empty  set  and  the  entire  Cartesian  product  A  x  B  are  always  binary 
relations  from  A  to  B.  although  these  are  generally  not  as  interesting  as  certain 
nonempty  proper  subsets  of  A  x  fl. 

•  If  A  is  the  set  of  students  who  were  registered  at  the  University  of  Toronto 
during  the  Fall  2001  semester  and  B  is  the  set  (History,  Mathematics,  English, 
Biology},  then  TZ  =  {( a ,  b)  \  a  e  A  is  enrolled  in  a  course  in  subject  b }  is  a 
binary  relation  from  A  to  B. 


EXAMPLES  15 
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2.3.2  DEFINITION 


EXAMPLES  16 


2.3.3  DEFINITION 


EXAMPLES  18 


.  Let  A  be  the  set  of  surnames  of  people  listed  in  the  Seattle  telephone  directory. 
Then  R  —  {{a,  n)  |  a  appears  on  page  n]  is  a  binary  relation  from  A  to  the 
set  N  of  natural  numbers. 

•  {(a,  b)  |  a,  b  e  N,  |  is  an  integer}  and  {(a,  b)  \  a,  b  e  N,  a  —  b  =  2}  are 
binary  relations  on  N. 

•  {  (jc,  y)  I  y  =  a2}  is  a  binary  relation  on  R  whose  graph  the  reader  may 

recognize.  ▲ 

Our  primary  intent  in  this  section  is  to  identify  special  properties  of  binary 
relations  on  a  set,  so,  henceforth,  all  binary  relations  will  be  subsets  of  A  x  A  for 
some  set  A. 

A  binary  relation  R  on  a  set  A  is  reflexive  if  and  only  if  (a,  a)  e  R  for  all  a  &  A. 

•  {  (jc  ,  y)  e  R2  |  x  <  y}  is  a  reflexive  relation  on  R  since  .v  <  x  for  any  .v  e  R. 

•  {(a,  b)  e  N2  |  |  e  N}  is  a  reflexive  relation  on  N  since  is  an  integer,  namely 
1,  for  any  a  e  N. 

.  R.  =  {(a,  y)  €  R2  |  ,v2  +  y2  >  0}  is  not  a  reflexive  relation  on  R  since 
(0,  0)  ^  R.  (This  example  reminds  us  that  a  reflexive  relation  must  contain 
all  pairs  of  the  form  (a,  a):  Most  is  not  enough.)  ▲ 


PROBLEM  17.  Suppose  R  =  {(a,  b)  e  Z  x  Z  |  a2  =  b2}.  Criticize  and  then 
correct  the  following  "proof'  that  R  is  reflexive: 

(a,  a)  e  R  if  a~  =  a2. 

Solution.  The  statement  “(a,  a)  €  R  if  a2  —  a2”  is  the  implication  “ a 2  —  a 2  -*■ 
(a,  a )  €  R,"  which  has  almost  nothing  to  do  with  what  is  required.  To  prove  that 
R  is  reflexive,  we  must  establish  an  implication  of  the  form  “something  — ^  72.  is 
reflexive.”  Here  is  a  good  argument,  in  this  case. 

For  any  integer  a,  we  have  a2  —  a 2  and,  hence,  (a,  a)  &R.  Therefore, 

R  is  reflexive.  | 


A  binary  relation  R  on  a  set  A  is  symmetric  if  and  only  if 

if  a,  b  e  A  and  (a,  b )  €  R,  then  (b,  a )  e  R. 

•  R  —  {(a,  y)  e  R2  |  a2  +  y2  =  1 }  is  a  symmetric  relation  on  R  since  if 
a2  +  y2  =  1,  then  y2  +  a2  =  1  too:  If  (a,  y)  e  R,  then  (y,  a)  e  R. 

•  {(a,  y)  e  Z  x  Z  |  a  —  y  is  even)  is  a  symmetric  relation  on  Z  since  if  a  —  y  is 
even,  so  is  y  —  a. 

•  R  =  {(a,  y)  €  R2  |  a2  >  y)  is  not  a  symmetric  relation  on  R.  For  example, 

(2,  1)  e  R  because  22  >  1,  but  (1,2)  0  R  because  l2  ^  2.  A 

Suppose  R  is  a  binary  relation  on  A  =  R2.  In  this  case,  the  elements  of  R, 
being  ordered  pairs  of  elements  of  A,  are  ordered  pairs  of  elements  each  of  which 
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is  an  ordered  pair  of  real  numbers.  Consider,  for  example, 

1Z  =  {(  (jc,  y),  ( u ,  v) )  e  R2  x  R2  |  x2  +  y2  =  w2  +  v2}. 


Pause  8 


This  is  a  symmetric  relation  on  R2  since  if  ( (x,  y ),  («,  v))  e  1Z,  then  .v2  +  y2  = 
u2  +  u2,  so  u2  +  v 2  —  .v2  +  y2,  so  ( ( u ,  v),  (x,  y) )  e  7Z. 

Is  this  relation  reflexive? 


A  binary  relation  on  the  real  numbers  (or  on 
any  subset  of  R)  is  symmetric  if,  when  its  points  are 
plotted  as  usual  in  the  Cartesian  plane,  the  figure 
is  symmetric  about  the  line  with  equation  y  =  x. 

The  set  {(x,  y)  6  R2  |  x2  +  y2  =  1}  is  a  symmetric 
relation  because  its  points  are  those  on  the  graph  of 
the  unit  circle  centered  at  the  origin  and  this  circle 
is  certainly  symmetric  about  the  line  with  equation 
y  =  x. 

If  a  set  A  has  n  elements  and  n  is  reasonably  small,  a  binary  relation  on  A 
can  be  conveniently  described  by  labeling  with  the  elements  of  A  the  rows  and 
the  columns  of  an  nxn  grid  and  then  inserting  some  symbol  in  row  a  and  column 
b  to  indicate  that  (a,  b )  is  in  the  relation. 


EXAMPLE  19  The  picture  in  Fig  2.2  describes  the  relation 

1Z  =  {{  1,  1),  (1,2),  (1,4),  (2,  1),  (2.  2),  (3.  2),  (3,  3),  (4,  4)} 

on  the  set  A  =  {1,2,  3,  4).  This  relation  is  reflexive  (all  points  on  the  main 
diagonal — top  left  corner  to  lower  right — are  present),  but  not  symmetric  (the  x’s 
are  not  symmetrically  located  with  respect  to  the  main  diagonal).  For  example, 
there  is  a  x  in  row  1,  column  4,  but  not  in  row  4,  column  1.  ▲ 


Figure  2.2 


2.3.4  DEFINITION 


A  binary  relation  1Z  on  a  set  A  is  antisymmetric  if  and  only  if 

if  a,  b  €  A  and  both  (a,  b)  e  7Z  and  ( b ,  a)  are  in  1Z ,  then  a  =  b. 


EXAMPLE  20  1Z  =  {(x,  y)  e  R2  |  x  <  y)  is  an  antisymmetric  relation  on  R  since  x  <  y  and 
y  <  x  implies  x  =  y;  thus,  (x,  y)  e  1Z  and  (y,  x)  el Z  implies  x  =  y.  ▲ 

EXAMPLE  21  If  S  is  a  set  and  A  =  V(S)  is  the  power  set  of  S,  then  {(X,  Y)  \  X,  Y  e  V(S),  X  C 
T)  is  antisymmetric  since  X  c  Y  and  Y  c  X  implies  X  =  Y.  A 
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EXAMPLE  22 


ft  =  {(1,  2),  (2,  3),  (3,  3),  (2,  1)}  is  not  antisymmetric  on  A  =  { 1, 2,  3}  because 
(1,  2)  eft  and  (2,1)  eft  but  1#  2.  A 


Note  that  “antisymmetric'’  is  not  the  same  as  “not  symmetric.”  The  relation 
in  Example  22  is  not  symmetric  but  neither  is  it  antisymmetric. 

Why  is  this  relation  not  symmetric? 

Is  the  relation  ft  =  {(  (x,  y),  (n,  v) )  e  R2xR2  |  x2+_y2  =  w2  +  tr)  antisymmetric? 


2.3.5  DEFINITION 


A  binary  relation  ft  on  a  set  A  is  transitive  if  and  only  if 

if  a,  b,c  e  A,  and  both  (a,  b )  and  ( b ,  c)  are  in  ft,  then  (a,  c)  e  ft. 


EXAMPLE  23 

EXAMPLE  24 

EXAMPLES  25 


Answers  to  Pauses 


ft  =  {(jc,  y)  e  R2  |  x  <  y}  is  a  transitive  relation  on  R  since  if  x  <  y  and  y  <  z, 

then  x  <  z'-  if  ( x ,  y)  and  (y,  z)  are  in  ft,  then  (x,  z)  e  ft,  A 

{(a,  b)  e  Z  x  Z  |  |  is  an  integer}  is  a  transitive  relation  on  Z  since  if  |  and  |  are 

integers,  then  so  is  “  because  ~  =  f,  ■  A 

•  ft  =  {(x,  y),  (x,  z),  (y,  «),  (x,  m)}  is  a  transitive  binary  relation  on  the  set 
{x,  y,  z,  u}  because  there  is  only  one  pair  of  the  form  (a,  b),  ( b ,  c)  belonging 
to  ft  (namely  (x,y)  and  (y,  u))  and,  for  this  pair,  it  is  true  that  ( a,c )  = 
(x,  u )  €  ft. 

•  ft  =  {(a,  b),  (b,  a).  (a,  a)}  is  not  transitive  on  [a,b]  since  it  contains  the 
pairs  (b,  a)  and  ( a ,  b),  but  not  the  pair  ( b ,  b). 

•  {(o,  b)  |  a  and  b  are  people  and  a  is  an  ancestor  of  b]  is  a  transitive  relation 
since  if  a  is  an  ancestor  of  b  and  b  is  an  ancestor  of  c,  then  a  is  an  ancestor 
of  c. 

.  ft  =  {(x,  y)  e  R2  |  x2  >  y}  is  not  transitive  on  R  because  (3,  4)  e  ft  (32  >  4) 
and  (4,  10)  e  ft  (42  >  10)  but  (3,  10)  g  ft  (32  t  10).  A 

8.  The  answer  is  yes.  For  any  (x,  y)  €  R2,  we  have  x2  +  y2  =  x2  +  y2;  in  other 
words,  ( (x,  y),  (x,  y) )  e  ft  for  any  (x,  y)  e  R2. 

9.  (2,  3)  e  ft  but  (3,  2)  £  ft. 

10.  No.  For  example,  ((1,2),  (2,  1))  €  ft  because  l2  +  22  =  22  +  l2  and, 
similarly,  ( (2,  1),  (1,  2) )  €  ft;  however,  (1,2)^  (2,  1). 


EXERCISES 


Back  of  the  Book. 


The  symbol  [BB]  means  that  an  answer  can  be  found  in  the 

1.  [BB]  Let  B  denote  the  set  of  books  in  a  college  library 
and  S  denote  the  set  of  students  attending  that  college. 
Interpret  the  Cartesian  product  S  x  B.  Give  a  sensible 
example  of  a  binary  relation  from  S  to  B . 

2.  Let  A  denote  the  set  of  names  of  streets  in  St.  John’s, 
Newfoundland  and  B  denote  the  names  of  all  the 


residents  of  St.  John’s.  Interpret  the  Cartesian  product 
A  x  B.  Give  a  sensible  example  of  a  binary  relation 
from  A  to  B. 

3.  Determine  which  of  the  properties  reflexive,  symmetric, 
transitive  apply  to  the  following  relations  on  the  set  of 
people. 
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(a)  [BB]  is  a  father  of 

(b)  is  a  friend  of 

(c)  [BB]  is  a  descendant  of 

(d)  have  the  same  parents 

(e)  is  an  uncle  of 

4.  With  a  table  like  that  in  Fig  2.2,  illustrate  a  relation  on 
the  set  (a,  b,  c,  d }  which  is 

(a)  [BB]  reflexive  and  symmetric 

(b)  not  symmetric  and  not  antisymmetric 

(c)  not  symmetric  but  antisymmetric 

(d)  transitive 

Include  at  least  six  elements  in  each  relation. 

5.  Let  A  =  [1 , 2,  3}.  List  the  ordered  pairs  in  a  relation  on 
A  which  is 

(a)  [BB]  not  reflexive,  not  symmetric,  and  not  transitive 

(b)  reflexive,  but  neither  symmetric  nor  transitive 

(c)  symmetric,  but  neither  reflexive  nor  transitive 

(d)  transitive,  but  neither  reflexive  nor  symmetric 

(e)  reflexive  and  symmetric,  but  not  transitive 

(f)  reflexive  and  transitive,  but  not  symmetric 

(g)  [BB]  symmetric  and  transitive,  but  not  reflexive 

(h)  reflexive,  symmetric,  and  transitive 

6.  Is  it  possible  for  a  binary  relation  to  be  both  symmetric 
and  antisymmetric?  If  the  answer  is  no,  why  not?  If  it 
is  yes,  find  all  such  binary  relations. 

7.  [BB]  What  is  wrong  with  the  following  argument  which 
purports  to  prove  that  a  binary  relation  which  is  sym¬ 
metric  and  transitive  must  necessarily  be  reflexive  as 
well? 

Suppose  TZ  is  a  symmetric  and  transitive 
relation  on  a  set  A  and  let  a  e  A.  Then 
for  any  b  with  (a,b)  e  TZ,  we  have  also 
( b ,  a)  e  TZ  by  symmetry.  Since  now  we 
have  both  (a,b)  and  (b,a)  in  TZ,  we  have 
(a,  a)  €  TZ  as  well,  by  transitivity.  Thus, 

(a,  a)  e  TZ,  so  7Z  is  reflexive. 

8.  Determine  whether  or  not  each  of  the  binary  relations 
TZ  defined  on  the  given  sets  A  are  reflexive,  symmetric, 
antisymmetric,  or  transitive.  If  a  relation  has  a  certain 
property,  prove  this  is  so;  otherwise,  provide  a  coun¬ 
terexample  to  show  that  it  does  not. 

(a)  [BB]  A  is  the  set  of  all  English  words;  (a,  b)  e  TZ 
if  and  only  if  a  and  b  have  at  least  one  letter  in 
common. 


(b)  A  is  the  set  of  all  people,  (a,  b)  e  TZ  if  and  only  if 
neither  a  nor  b  is  currently  enrolled  at  the  Miska- 
tonic  University  or  else  both  are  enrolled  at  MU  and 
are  taking  at  least  one  course  together. 

9.  Answer  Exercise  8  for  each  of  the  following  relations; 

(a)  A  =  {1,2};  K=  {(1,2)}. 

(b)  [BB]  A  =  [1,2,3, 4};  TZ  =  {(1,  1),  (1,2),  (2,  1), 
(3,4)}. 

(c)  [BB]  A  =  Z;  (a,  b)  €  TZ  if  and  only  if  ab  >  0. 

(d)  A  =  R;  (a,  b)  e  TZ  if  and  only  if  a 2  =  b2. 

(e)  A  =  R;  (a,  b)  e  TZ  if  and  only  if  a  —  b  <  3. 

(f)  A  =  ZxZ;  ((a,  b),  (c,  d))  €  TZ  if  and  only  if  a  —  c  = 
b-d. 

(g)  A  =  N;  TZ  is  /. 

(h)  A  =  Z;  TZ  =  [(x ,  y)  \  x  +  y  =  10}. 

(i)  [BB]  A  =  R2;  TZ  =  {((x,  y),  (a,  u))  |  x+y  <  u  +  v}. 

(j)  A  =  N;  (a,  b)  €  TZ  if  and  only  if  |  is  an  integer. 

(k)  A  =  Z;  (a,  b)  e  TZ  if  and  only  if  |  is  an  integer. 

10.  Let  5  be  a  set  which  contains  at  least  two  elements  a 
and  b.  Let  A  be  the  power  set  of  S.  Determine  which 
of  the  properties — reflexivity,  symmetry,  antisymmetry, 
transitivity — each  of  the  following  binary  relations  TZ 
on  A  possesses.  Give  a  proof  or  counterexample  as 
appropriate. 

(a)  [BB]  (X,  Y)  €  TZ  if  and  only  if  X  c  Y. 

(b)  (X,  Y)  6  TZ  if  and  only  if  X  C  y. 

(c)  (X,  Y)  6  TZ  if  and  only  if  X  D  F  =  0. 

11.  Let  A  be  the  set  of  books  for  sale  in  a  certain  university 
bookstore  and  assume  that  among  these  are  books  with 
the  following  properties. 


Book 

Price 

Length 

U 

$10 

100  pages 

W 

$25 

125  pages 

X 

$20 

150  pages 

Y 

$10 

200  pages 

Z 

$  5 

100  pages 

(a)  [BB]  Suppose  (a,  b)  €  1 Z  if  and  only  if  the  price  of 
book  a  is  greater  than  or  equal  to  the  price  of  book 
b  and  the  length  of  a  is  greater  than  or  equal  to  the 
length  of  b.  Is  TZ  reflexive?  Symmetric?  Antisym¬ 
metric?  Transitive? 

(b)  Suppose  (a,  b)  e  TZ  if  and  only  if  the  price  of  a  is 
greater  than  or  equal  to  the  price  of  b  or  the  length 
of  a  is  greater  than  or  equal  to  the  length  of  b.  Is  TZ 
reflexive?  Symmetric?  Antisymmetric?  Transitive? 
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2.4  Equivalence  Relations 

It  is  useful  to  think  of  a  binary  relation  on  a  set  A  as  establishing  relationships 
between  elements  of  A,  the  assertion  “(a,  b)  e  VS'  relating  the  elements  a  and  b. 
Such  relationships  occur  everywhere.  Two  people  may  be  of  the  same  sex,  have 
the  same  color  eyes,  live  on  the  same  street.  These  three  particular  relationships 
are  reflexive,  symmetric,  and  transitive  and  hence  equivalence  relations. 


2.4.1  DEFINITION 


An  equivalence  relation  on  a  set  A  is  a  binary  relation  V  on  A  which  is  reflexive, 
symmetric,  and  transitive. 


Suppose  A  is  the  set  of  all  people  in  the  world  and 

V  =  {(a,  b)  e  A  x  A  \  a  and  b  have  the  same  parents}. 

This  relation  is  reflexive  (every  person  has  the  same  set  of  parents  as  him¬ 
self/herself),  symmetric  (if  a  and  b  have  the  same  parents,  then  so  do  b  and 
a),  and  transitive  (if  a  and  b  have  the  same  parents,  and  b  and  c  have  the  same 
parents,  then  a  and  c  have  the  same  parents)  and  so  V  is  an  equivalence  relation. 
It  may  be  because  of  examples  like  this  that  it  is  common  to  say  “a  is  related  to 
b ,”  rather  than  “( a ,  b)  e  V,"  even  for  an  abstract  binary  relation  V. 

If  V  is  a  binary  relation  on  a  set  A  and  a.b  e  A,  some  authors  use  the 
notation  aVb  to  indicate  that  (a,b)  e  V.  In  this  section,  we  will  usually  write 
a  ~  b  and,  in  the  case  of  an  equivalence  relation,  say  “ct  is  equivalent  to  b .”  Thus, 
to  prove  that  V  is  an  equivalence  relation,  we  must  prove  that  V  is 

reflexive:  a  ~  a  for  all  a  e  A, 

symmetric:  if  a  e  A  and  b  e  A  and  a  ~  b,  then  b  ~  a.  and 
transitive:  if  a,  b,  c  €  A  and  both  a  ~  b  and  b  ~  c,  then  a  ~  c. 


EXAMPLE  26  Let  A  be  the  set  of  students  currently  registered  at  the  University  of  Southern 
California.  For  a,  b  e  A,  call  a  and  b  equivalent  if  their  student  numbers  have  the 
same  first  two  digits.  Certainly  a  ~  a  for  every  student  a  because  any  number 
has  the  same  first  two  digits  as  itself.  If  a  ~  b,  the  student  numbers  of  a  and  b 
have  the  same  first  two  digits,  so  the  student  numbers  of  b  and  a  have  the  same 
first  two  digits;  therefore,  b  ~  a.  Finally,  if  a  ~  b  and  b  ~  c,  then  the  student 
numbers  of  a  and  b  have  the  same  first  two  digits,  and  the  student  numbers  of 
b  and  c  have  the  same  first  two  digits,  so  the  student  numbers  of  a  and  c  have 
the  same  first  two  digits.  Since  V  is  reflexive,  symmetric,  and  transitive,  V  is  an 
equivalence  relation  on  A.  A 

EXAMPLE  27  Let  A  be  the  set  of  all  residents  of  the  continental  United  States.  Call  a  and  b 
equivalent  if  a  and  b  are  residents  of  the  same  state  (or  district).  The  student 
should  mentally  confirm  that  ~  defines  an  equivalence  relation.  A 

EXAMPLE  28  (Congruence  mod  3)5  Define  ~  on  the  set  Z  of  integers  by  a  ~  b  if  a  —  b  is 
divisible  (evenly)  by  3.6  For  any  a  eZ,  a—  a  =  0  is  divisible  by  3  and  so  a  ~  a. 


5This  is  an  example  of  an  important  equivalence  relation  called  congruence  to  which  we  later  devote 
an  entire  section,  Section  4.4. 

6Within  the  context  of  integers,  divisible  always  means  divisible  evenly ,  that  is,  with  remainder  0. 
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If  a,  b  e  Z  and  a  ~  b,  then  a  -  b  is  divisible  by  3,  so  —  a  (the  negative  of 
a  —  b)  is  also  divisible  by  3.  Hence,  b  ~  a.  Finally,  if  a.  b,  c  €  I  with  <2  ~  b  and 
b  ~  c,  then  both  a  —  b  and  &  —  c  are  divisible  by  3,  so  a  —  c,  being  the  sum  of 
a  —  b  and  b  —  c,  is  also  divisible  by  3.  Thus,  ~  is  an  equivalence  relation.  ▲ 


EXAMPLE  29  The  relation  <  on  the  real  numbers — a  ~  b  if  and  only  if  a  <  b — is  not  an 
equivalence  relation  on  R.  While  it  is  reflexive  and  transitive,  it  is  not  symmetric: 
4  <  5  but  5  2:  4.  ▲ 

■Bl  Let  A  be  the  set  of  all  people.  For  a,b  e  A,  define  a  ~  b  if  either  (i)  both  a  and 
b  are  residents  of  the  same  state  of  the  United  States  or  (ii)  neither  a  nor  b  is  a 
resident  of  any  state  of  the  United  States.  Does  ~  define  an  equivalence  relation? 

Surely  the  three  most  fundamental  properties  of  equality  are 

reflexivity:  a  =  a  for  all  a; 

symmetry:  if  a  —  b,  then  b  —  a;  and 

transitivity:  if  a  =  b  and  b  =  c,  then  a  —  c. 

Thus,  equality  is  an  equivalence  relation  on  any  set.  For  this  reason,  we  think 
of  equivalence  as  a  weakening  of  equality.  We  have  in  mind  a  certain  characteristic 
or  property  of  elements  and  wish  only  to  consider  as  different,  elements  which 
differ  with  respect  to  this  characteristic.  Little  children  may  think  of  their  brothers 
and  sisters  as  the  same  and  other  children  as  “different.”  A  statistician  trying 
to  estimate  the  percentages  of  people  in  the  world  with  different  eye  colors  is 
only  interested  in  eye  color;  for  her,  two  people  are  only  “different”  if  they 
have  different  colored  eyes.  All  drop-off  points  in  a  given  neighborhood  of  town 
may  be  the  “same”  to  the  driver  of  a  newspaper  truck.  An  equivalence  relation 
changes  our  view  of  the  universe  (the  underlying  set  A);  instead  of  viewing  it 
as  individual  elements,  attention  is  directed  to  certain  groups  or  subsets.  The 
equivalence  relation  “same  parents”  groups  people  into  families;  “same  color 
eyes”  groups  people  by  eye  color;  “same  neighborhood”  groups  newspaper  drop¬ 
off  points  by  neighborhood. 

The  groups  into  which  an  equivalence  relation  divides  the  underlying  set  are 
called  equivalence  classes.  The  equivalence  class  of  an  element  is  the  collection 
of  all  things  related  to  it. 


2.4.2  DEFINITION 


If  ~  denotes  an  equivalence  relation  on  a  set  A,  the  equivalence  class  of  an 
element  a  e  A  is  the  set  a  =  [x  e  A  |  x  ~  a}.  The  set  of  all  equivalence  classes 
is  called  the  quotient  set  of  A  mod  ~  and  denoted  A/~. 


Since  an  equivalence  relation  is  symmetric,  it  does  not  matter  whether  we 
write  x  ~  a  or  a  ~  x  in  the  definition  of  a.  The  set  of  things  related  to  a  is  the 
same  as  the  set  of  things  to  which  a  is  related. 

For  the  equivalence  relation  in  Example  26,  the  students  who  are  related  to  a 
particular  student  x  are  those  whose  student  numbers  have  the  same  first  two  digits 
as  x’s  student  number.  For  this  equivalence  relation,  a  typical  equivalence  class  (a 
typical  member  of  the  quotient  set)  is  the  set  of  all  students  whose  student  numbers 
begin  with  the  same  first  two  digits.  The  set  of  all  students  has  been  grouped  into 
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smaller  sets — the  class  of  87,  for  instance  (all  students  whose  numbers  begin  87), 
the  class  of  90  (all  students  whose  numbers  begin  90),  and  so  forth. 

In  Example  27,  if  x  is  a  resident  of  the  continental  United  States,  then  the 
people  to  whom  x  is  related  are  those  people  who  reside  in  the  same  state  or 
district  as  x.  The  residents  of  Colorado,  for  example,  form  one  equivalence  class, 
as  do  the  residents  of  Rhode  Island,  the  residents  of  Florida,  and  so  on.  The 
quotient  set  is  the  set  of  (forty-nine)  American  states  on  the  North  American 
continent,  together  with  the  District  of  Columbia. 

What  are  the  equivalence  classes  for  the  equivalence  relation  which  is  con¬ 
gruence  mod  3?  What  is  0,  the  equivalence  class  of  0,  for  instance?  If  a  ~  0, 
then  a  —  0  is  divisible  by  3;  in  other  words,  a  is  divisible  by  3.  Thus,  0  is  the  set 
of  all  integers  which  are  divisible  by  3.  We  shall  denote  this  set  3Z.  So  0  =  3Z. 
What  is  1?  If  a  ~  1,  then  a  —  1  =  3  k  for  some  integer  k,  so  a  =  3k  +  1.  Thus, 
1  =  {3&+1  |  k  e  Z},  a  set  we  denote  3Z+1.  Similarly,  2  =  {3k+2  |  k  e  Z}  =  3Z+2 
and  we  have  found  all  the  equivalence  classes  for  congruence  mod  3. 

Why? 


In  general,  for  given  natural  numbers  n  and  r,  nZ  +  r  is  the  set  of  integers  of 
the  form  na  +  r  for  some  a  e  Z: 


(5) 


nZ  +  r  =  {na  +  r  \  a  e  Z). 


Also,  we  write  nZ  instead  of  nZ  +  0: 


(6) 


nZ  =  {na  \  a  e  Z). 


The  even  integers,  for  instance,  can  be  denoted  2Z. 

What  are  the  equivalence  classes  for  the  equivalence  relation  described  in 
Pause  1 1  ?  How  many  elements  does  the  quotient  set  contain? 


PROBLEM  30.  For  (x.  v)  and  («,  v)  in  R2,  define  (x,  y)  ~  («,  v)  if  x2  +  y2  = 
u2  +  v2.  Prove  that  ~  defines  an  equivalence  relation  on  R2  and  interpret  the 
equivalence  classes  geometrically. 

Solution.  If  (,r,  y)  e  R2,  then  x2+y2  =  x2  +  y2,  so  (x,  y)  ~  (x,  y):  The  relation 
is  reflexive.  If  (x,  y)  ~  (n.  v),  then  x2  +  y2  =  u2  +  v2,  so  n2  +  v2  —  x2  +  y2 
and  (n,  v)  ~  (x.  y):  The  relation  is  symmetric.  Finally,  if  (x,y)  ~  (u,v)  and 
(u,  v )  ~  ( w ,  z),  then  x2  +  y2  =  u 2  +  v2  and  u 2  +  ir  =  ur  +  z2.  Thus,  x2  +  y2  = 
u2  +  v2  =  w2  +  z2-  Since  x2  +  y2  =  w2  +  z2,  (x,  y)  ~  (w,  z),  so  the  relation  is 
transitive. 

The  equivalence  class  of  {a,  b)  is 

(a,  b )  =  (U,  y)  |  (x,  y)  ~  ( a ,  b)}  =  {(x,  y)  |  x2  +  y2  =  a2  +  b2}. 

For  example,  (1,0)  =  {(x,  y)  |  x2  +  y2  =  l2  +  02  =  1 ),  which  we  recognize  as 
the  graph  of  a  circle  in  the  Cartesian  plane  with  center  (0,  0)  and  radius  1.  For 
general  (a,  b ),  letting  c  =  a2  +  h2,  the  equivalence  class  {a,  b)  is  the  set  of  points 
(x,  y)  satisfying  x2  +  v2  =  c.  So  this  equivalence  class  is  the  circle  with  center 
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(0,  0)  and  radius  *fc.  With  one  exception,  the  equivalence  classes  are  circles  with 
center  (0,  0).  I 


What  is  the  exception? 


2.4.3  PROPOSITION 


Let  ~  denote  an  equivalence  relation  on  a  set  A.  Let  a  e  A.  Then  for  any  x  e  A, 
x  ~  a  if  and  only  if  x  =  a. 


Proof  (•«— )  Suppose  x  =  a.  We  know  x  ex  because  x  ~  x,  so  x  e  a;  thus,  x  ~  a.  It  is 

the  implication  — >  which  is  the  substance  of  this  proposition. 

(—*■)  Suppose  that  x  ~  a.  We  must  prove  that  the  two  sets  x  and  a  are  equal. 
As  always,  we  do  this  by  proving  that  each  set  is  a  subset  of  the  other.  First 
suppose  y  ex.  Then  y  ~  x  and  x  ~  a,  so  y  ~  a  by  transitivity.  Therefore,  yea, 
so  x  c  a.  On  the  other  hand,  suppose  y  €  a.  Then  v  ~  a.  Since  we  also  have 
a  ~  x,  we  have  both  y  ~  a  and  a  ~  x;  therefore,  by  transitivity,  y  ~  x.  Thus, 
y  e  x  and  a  Qx.  Therefore,  a  =x.  I 

In  each  of  the  examples  of  equivalence  relations  which  we  have  discussed 
in  this  section,  different  equivalence  classes  never  overlapped.  If  a  person  is  a 
resident  of  one  state,  he  or  she  is  not  a  resident  of  another.  A  student  number 
cannot  begin  with  79  and  also  with  84.  An  integer  which  is  a  multiple  of  3  is  not  of 
the  form  3k+l.  These  examples  are  suggestive  of  a  result  which  is  true  in  general. 

Suppose  ~  denotes  an  equivalence  relation  on  a  set  A  and  a,b  e  A.  Then  a  ^  b 
if  and  only  if  a  Pi  b  =  0. 


2.4.4  PROPOSITION 


Proof  (-»•)  Suppose  a  ^  b.  We  must  prove  a  Cib  =  0  and  offer  a  proof  by  contradiction. 

Suppose  a  fl  b  ^  0.  Then  there  is  an  element  x  e  a  C\b.  Since  x  ea,x  =a 
by  Proposition  2.4.3.  Similarly,  since  x  e  b,  we  also  have  x  —  b.  Thus,  a  =  b, 
which  is  a  contradiction. 

(4-)  On  the  other  hand,  if  a  fl  b  =  0,  then  a  ea  but  a  £  b,  so  a  b.  I 

If  ~  denotes  an  equivalence  relation  on  A,  reflexivity  says  that  every  ele¬ 
ment  a  in  A  belongs  to  some  equivalence  class,  namely,  to  a.  In  conjunction  with 
Proposition  2.4.4,  this  observation  says  that  the  equivalence  classes  of  any  equiv¬ 
alence  relation  divide  A  into  disjoint  (that  is,  nonoverlapping)  subsets  which  cover 
the  entire  set,  just  like  the  pieces  of  a  jigsaw  puzzle.  We  say  that  the  equivalence 
classes  “partition”  A  or  “form  a  partition  of’  A.  (The  word  partition  is  used  as 
both  verb  and  noun.) 


2.4.5  DEFINITION 


A  partition  of  a  set  A  is  a  collection  of  disjoint  nonempty  subsets  of  A  whose 
union  is  A.  These  disjoint  sets  are  called  cells  (or  blocks).  The  cells  are  said  to 
partition  A. 


EXAMPLES  31 


•  Canada  is  partitioned  into  ten  provinces  and  three  territories.7 


7Nunavut,  created  from  the  eastern  half  of  the  former  Northwest  Territories,  joined  Canada  on  April  1. 
1999. 
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•  Students  are  partitioned  into  groups  according  to  the  first  two  digits  of  their 
student  numbers. 

•  The  human  race  is  partitioned  into  groups  by  eye  color. 

•  A  deck  of  playing  cards  is  partitioned  into  four  suits. 

•  If  A  —  {a,  b,  c,  d ,  e,  f,  x },  then  {{a,  b},  {c,  d ,  e },  {/},  }x}}  is  a  partition  of  A. 

So  is  {[a,  x},  {b,  d,  e,  /},  {c}).  ▲ 


We  have  seen  that  the  equivalence  classes  of  an  equivalence  relation  on  a  set 
A  are  disjoint  sets  whose  union  is  A;  each  element  a  e  A  is  in  precisely  one 
equivalence  class,  namely,  a.  Thus,  we  have  the  following  basic  theorem  about 
equivalence  relations. 


2.4.6  THEOREM 


The  equivalence  classes  associated  with  an  equivalence  relation  on  a  set  A  form 
a  partition  of  A. 


Not  only  does  an  equivalence  relation 
determine  a  partition,  but,  conversely,  any 


partition  of  a  set  A  determines  an  equivalence 
relation,  specifically,  that  equivalence  relation 
whose  equivalence  classes  are  the  cells  of  the 

a 

g 

b 

d 

e 

/  C 

a 

X 

X 

partition.  The  partition  of  the  integers  into 

g 

X 

X 

“evens”  and  “odds”  corresponds  to  the  equiv¬ 

b 

X 

X 

X 

X 

alence  relation  which  says  two  integers  are 

d 

X 

X 

X 

X 

equivalent  if  and  only  if  they  are  both  even 

e 

X 

X 

X 

X 

or  both  odd.  The  partition 

f 

X 

X 

X 

X 

{{a,  g},  {b,  d,  e,  /},  {c}} 

c 

X 

of  the  set  {a,  b,  c,  d,  e ,  f,  g }  corresponds  to  the  equivalence  relation  whose  equiv¬ 
alence  classes  are  (a,  g},  { b ,  d,  e,  /}  and  {c},  that  is,  to  the  equivalence  relation 
described  in  the  figure,  where  a  cross  in  row  x  and  column  y  is  used  to  indicate 

x  ~  y. 

The  suits  “heart,”  “diamond,”  “club,”  “spade"  partition  a  standard  deck  of  playing 
cards.  Describe  the  corresponding  equivalence  relation  on  a  deck  of  cards. 

The  correspondence  between  equivalence  relations  and  partitions  provides  a 
simple  way  to  exhibit  equivalence  relations  on  small  sets.  For  example,  the  equiv¬ 
alence  relation  defined  in  Fig  2.3  can  also  be  described  by  listing  its  equivalence 
classes:  {a,  b }  and  {c}. 


a  b  c 

a 

X  X 

b 

X  X 

c 

X 

Figure  2.3  An  equivalence  relation  with  two  equivalence  classes,  [a,  b J  and  {c}. 


Answers  to  Pauses 


11.  It  sure  does.  First,  every  person  is  either  a  resident  of  the  same  state  in  the 
United  States  as  himself/herself  or  not  a  resident  of  any  U.S.  state,  so  a  ~  a 
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for  all  a  €  A:  ~  is  reflexive.  Second,  if  a,  b  e  A  and  a  ~  6,  then  either  a 
and  b  are  residents  of  the  same  U.S.  state  (in  which  case,  so  are  b  and  a)  or 
else  neither  a  nor  b  is  a  resident  of  any  state  in  the  United  States  (in  which 
case,  neither  is  b  or  a).  Thus,  b  ~  a:  ~  is  symmetric.  Finally,  suppose  a  ~  b 
and  b  ~  c.  Then  either  a  and  b  are  residents  of  the  same  U.S.  state  or  neither 
is  a  resident  of  any  U.S.  state,  and  the  same  holds  true  for  b  and  c.  It  follows 
that  either  all  three  of  a ,  b,  and  c  live  in  the  same  U.S.  state,  or  none  is  a 
resident  of  a  U.S.  state.  Thus,  a  ~  c:  ~  is  transitive  as  well. 

12.  The  equivalence  class  of  3  is  [3k  +  3  |  k  €  Z},  but  this  is  just  the  set  3Z  of 
multiples  of  3.  Thus  3  =  0.  The  equivalence  class  of  4  is  [3k  -f  4  |  k  e  Z), 
but  3&  +  4  =  3(lfc  +  l)  +  l,  so  this  set  is  just  3Z  +  1,  the  equivalence  class 
of  1:  4  =  1.  In  general,  the  equivalence  class  of  an  integer  r  is  3Z  if  r  is  a 
multiple  of  3,  3Z+  1  if  r  is  of  the  form  3 a  +  1,  and  3Z  +  2  if  r  is  of  the  form 
3a  +  2.  Since  every  integer  r  is  either  a  multiple  of  3,  or  3a  +  1  or  3a  +  2 
for  some  a,  the  only  equivalence  classes  are  3Z,  3Z  +  1  and  3Z  +  2. 

13.  The  equivalence  class  of  a  consists  of  those  people  equivalent  to  a  in  the  sense 
of  ~.  If  a  does  not  live  in  any  state  of  the  United  States,  the  equivalence  class 
of  a  consists  of  all  those  people  who  also  live  outside  any  U.S.  state.  If  a 
does  live  in  a  U.S.  state,  the  equivalence  class  of  a  consists  of  those  people 
who  live  in  the  same  state.  The  quotient  set  has  51  elements,  consisting  of 
the  residents  of  the  50  American  states  and  the  set  of  people  who  do  not  live 
in  any  U.S.  state. 

14.  The  equivalence  class  of  (0,  0)  is  the  set  {(0.  0)}  whose  only  element  is  the 
single  point  (0,  0). 

15.  Two  cards  are  equivalent  if  and  only  if  they  have  the  same  suit. 


EXERCISES 


The  symbol  [BB]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 


1.  Let  A  be  the  set  of  all  citizens  of  New  York  City.  For 
a,b  €  A,  define  a  ~~  b  if  and  only  if 

(a)  neither  a  nor  b  have  a  cell  phone,  or 

(b)  both  a  and  b  have  cell  phones  in  the  same  exchange 
(that  is,  the  first  three  digits  of  each  phone  number 
are  the  same). 

Show  that  ~  defines  an  equivalence  relation  on  A  and 
find  the  corresponding  equivalence  classes. 

2.  Explain  why  each  of  the  following  binary  relations  on 
S  —  {1,2,3}  is  not  an  equivalence  relation  on  S. 

(a)  [BB]  11  =  {(1 ,  1),  (1, 2),  (3,  2),  (3,  3),  (2,  3),  (2,  1 )) 

(b)  n  =  {{  1, 1),  (2,  2),  (3,  3),  (2,  1),  (1,2),  (2,  3),  (3,  1), 
(1,3)} 


1 

2 

3 

1 

X 

X 

X 

2 

X 

X 

3 

X 

3.  [BB]  The  sets  [1],  [2],  [3],  {4},  {5}  are  the  equivalence 
classes  for  a  well-known  equivalence  relation  on  the 
set  5  =  [1 , 2,  3,  4,  5}.  What  is  the  usual  name  for  this 
equivalence  relation? 

4.  [BB]  For  a.  b  e  R  \  {0},  define  a  ~  b  if  and  only  if 

f  eQ. 

(a)  Prove  that  ~  is  an  equivalence  relation. 

(b)  Find  the  equivalence  class  of  1. 

(c)  Show  that  \/3  =  VT2. 
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5.  For  natural  numbers  a  and  b,  define  a  ~  b  if  and  only 
if  a2  +  b  is  even.  Prove  that  ~  defines  an  equivalence 
relation  on  N  and  find  the  quotient  set  determined  by 

6.  [BB]  For  a,  b  e  R,  define  a  ~  b  if  and  only  if  a  —  b  €  Z. 

(a)  Prove  that  ~  defines  an  equivalence  relation  on  Z. 

(b)  What  is  the  equivalence  class  of  5?  What  is  the 
equivalence  class  of  5  j? 

(c)  What  is  the  quotient  set  determined  by  this  equiva¬ 
lence  relation? 

7.  [BB]  For  integers  a,b,  define  a  ~  b  if  and  only  if 
2a  +  3b  =  5 n  for  some  integer  n.  Show  that  ~  defines 
an  equivalence  relation  on  Z. 

8.  Define  ~  on  Z  by  a  ~  b  if  and  only  if  3a  +  b  is  a 
multiple  of  4. 

(a)  Prove  that  ~  defines  an  equivalence  relation. 

(b)  Find  the  equivalence  class  of  0. 

(c)  Find  the  equivalence  class  of  2. 

(d)  Make  a  guess  about  the  quotient  set. 

9.  For  integers  a  and  b ,  define  a  ~  b  if  3a  +  4b  =  In  for 
some  integer  n . 

(a)  Prove  that  ~  defines  an  equivalence  relation. 

(b)  Find  the  equivalence  class  of  0. 

10.  [BB]  For  a,  b  €  Z  \  [0],  define  a  ~  b  if  and  only  if 
ab  >  0. 

(a)  Prove  that  ~  defines  an  equivalence  relation  on  Z. 

(b)  What  is  the  equivalence  class  of  5?  What’s  the 
equivalence  class  of  —5? 

(c)  What  is  the  partition  of  Z  \  [0]  determined  by  this 
equivalence  relation? 

11.  For  a,  be  Z,  define  a  ~  b  if  and  only  if  a2  —  b 2  is 
divisible  by  3. 

(a)  [BB]  Prove  that  ~  defines  an  equivalence  relation 
on  Z. 

(b)  What  is  0?  What  is  T? 

(c)  What  is  the  partition  of  Z  determined  by  this  equiv¬ 
alence  relation? 

12.  Determine,  with  reasons,  whether  or  not  each  of  the 
following  defines  an  equivalence  relation  on  the  set  A. 

(a)  [BB]  A  is  the  set  of  all  triangles  in  the  plane;  a  ~  b 
if  and  only  if  a  and  b  are  congruent. 

(b)  A  is  the  set  of  all  circles  in  the  plane;  a  ~  b  if  and 
only  if  a  and  b  have  the  same  center. 

(c)  A  is  the  set  of  all  straight  lines  in  the  plane;  a  ~  b 
if  and  only  if  a  is  parallel  to  b. 

(d)  A  is  the  set  of  all  lines  in  the  plane;  a  ~  b  if  and 
only  if  a  is  perpendicular  to  b. 


13.  List  the  pairs  in  the  equivalence  relation  associated  with 
each  of  the  following  partitions  of  A  =  { 1 , 2,  3, 4,  5}. 

(a)  [BB]  {[1,2],  [3,  4,  5}} 

(b)  {{ 1 },  [2],  [3,  4],  [5]) 

(c)  {{1,2,  3,  4,  5}} 

14.  (a)  List  all  the  equivalence  relations  on  the  set  {a}.  How 

many  are  there  altogether? 

(b)  Repeat  (a)  for  the  set  {a,  b (. 

(c)  [BB]  Repeat  (a)  for  the  set  [a,  b ,  c], 

(d)  Repeat  (a)  for  the  set  {a,  b,  c ,  d}. 

(Remark:  The  number  of  partitions  of  a  set  of  n  elements 
grows  rather  rapidly.  There  are  52  partitions  of  a  set  of 
five  elements,  203  partitions  of  a  set  of  six  elements, 
and  877  partitions  of  a  set  of  seven  elements.) 

15.  Define  ~  on  R2  by  ( x .  y)  ~  («,  v)  if  and  only  if  x  —  y  = 
u  —  v. 

(a)  [BB]  Criticize  and  then  correct  the  following 
“proof’  that  ~  is  reflexive. 

“If  (.v,  y)  ~  (,v,  y),  then  x  —  y  =  x  —  y, 

which  is  true.” 

(b)  What  is  wrong  with  the  following  interpretation  of 
symmetry  in  this  situation? 

“If  (x,  y)  e  71,  then  (y,  x)  e  71." 

Write  a  correct  statement  of  the  symmetric  property 
(as  it  applies  to  the  relation  ~  in  this  exercise). 

(c)  Criticize  and  then  correct  the  following  “proof’  that 
~  is  symmetric. 

“(x,  y)  ~  («,  v)  if  x  —  y  —  u  —  v. 

Then  «  —  u  =  x  —  y.  So  (m,  i>)  ~  (x,  y).” 

(d)  Criticize  and  correct  the  following  “proof'  of  tran¬ 
sitivity. 

“(x,  y)  ~  (it,  v)  and  (u,  v)  ~  (to,  z). 

Then  u  —  v  =  w  —  z,  so  if  x  —  y  =  u  —  v, 
then  x  —  y  =  w  —  z.  So  (x,  y)  ~  (w,  z).” 

(e)  Why  does  ~  define  an  equivalence  relation  on  R2? 

(f)  Determine  the  equivalence  classes  of  (0, 0)  and 
(2,  3)  and  describe  these  geometrically. 

16.  [BB]  For  (x,  y)  and  (n,  u)  e  R2  define  (x,  y)  ~  («,  v) 
if  and  only  if  x2  —  y2  =  u2  —  v2.  Prove  that  ~  defines 
an  equivalence  relation  on  R2.  Describe  geometrically 
the  equivalence  class  of  (0.  0).  Describe  geometrically 
the  equivalence  class  of  (1, 0). 
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17.  Determine  which  of  the  following  define  equivalence 
relations  in  R2.  For  those  which  do,  give  a  geometrical 
interpretation  of  the  elements  of  the  quotient  set. 


(a) 

(a,  b)  - 

~  (c,  d) 

if 

and 

only 

if  a  +  2b  =  c  +  2d. 

(b) 

(a,  b )  - 

“  (c,  d) 

if 

and 

only 

if  ab  =  cd. 

(c) 

(a.,  b )  - 

-  (c,  d) 

if 

and 

only 

if  a2  +  b  —  c  +  d2. 

(d) 

(a,  b )  - 

-  (c,  d) 

if 

and 

only 

if  a  =  c. 

(e) 

(a,  b) 

-  (c,  d) 

if 

and 

only 

if  ab  =  c2. 

18.  Let  ~  denote  an  equivalence  relation  on  a  set  A.  Assume 
a,b,c,d  e  A  are  such  that  a  e  b,  c  e  d,  and  deb. 
Prove  that  a  =c. 


(c)  Explain  why  ~  defines  an  equivalence  relation 
on  A. 


20.  [BB]  In  Exercise  19,  let  A  be  the  set  of  all  natural  num¬ 
bers  and  ~  be  as  given.  Show  that  ~  defines  an  equiv¬ 
alence  relation  on  A. 


21. 


22. 


Repeat  Exercise  19  for  A  = 
relation  on  A  defined  by  a 


{1. 2,  3,  4,  5,  6,  7}  and  the 
a 

-  b  if  and  only  if  -  is  a 
b 


U  f 

power  of  2,  that  is,  —  =  2'  for  some  integer  t,  positive, 
b 

negative  or  zero. 

In  Exercise  21,  let  A  be  the  set  of  all  natural  numbers 
and  ~  be  as  given.  Show  that  ~  defines  an  equivalence 
relation  on  A. 


19.  [BB]  Let  A  =  [1 , 2,  3,  4,  5,  6,  7,  8,  9}.  For  a,b  e  A, 
define  a  ~  b  if  and  only  if  ab  is  a  perfect  square  (that 
is,  the  square  of  an  integer). 

(a)  What  are  the  ordered  pairs  in  this  relation? 

(b)  For  each  a  e  A,  find  a  =  {x  e  A  \  x  ~  a}. 


23.  Let  R  be  an  equivalence  relation  on  a  set  S  and  let 
[Si,  $2,  •  •  •  ■  St  1  be  a  collection  of  subsets  of  S  with  the 
property  that  (a.  b)  e  R  if  and  only  if  a  and  b  are 
elements  of  the  same  set  Sj,  for  some  i.  Suppose  that 
for  each  i,  Si  <2  (J y//  Sj-  Prove  that  {Si,  S2, . . .  ,  St}  is 
a  partition  of  S. 


2.5  Partial  Orders 

In  the  previous  section,  we  defined  an  equivalence  relation  as  a  binary  relation 
which  possesses  the  three  fundamental  properties  of  equality — reflexivity,  sym¬ 
metry,  transitivity.  We  mentioned  that  we  view  equivalence  as  a  weak  form  of 
equality  and  employed  a  symbol,  suggesting  “equals.”  In  an  analogous  manner, 
in  this  section,  we  focus  on  three  fundamental  properties  of  the  order  relation  < 
on  the  real  numbers — reflexivity,  antisymmetry,  transitivity — and  define  a  binary 
relation  called  partial  order,  which  can  be  viewed  as  a  weak  form  of  <.  We 
shall  use  the  symbol  <  for  a  partial  order  to  remind  us  of  its  connection  with 
<  and,  for  the  same  reason,  say  that  “a  is  less  than  or  equal  to  b ”  whenever 
a  <b. 


2.5.1  DEFINITIONS 


A  partial  order  on  a  set  A  is  a  reflexive,  antisymmetric,  transitive  relation  on 
A.  A  partially  ordered  set,  poset  for  short,  is  a  pair  (A,  <),  where  <  is  a  partial 
order  on  a  set  A. 


Writing  a  <  b  to  mean  that  (a,  b)  is  in  the  relation,  a  partial  order  on  A  is  a 
binary  relation  which  is 

reflexive:  a  <  a  for  all  a  e  A, 

antisymmetric:  If  a,b  e  A,  a  <  b  and  b  <  a,  then  a  —  b,  and 
transitive:  If  a,b,c  e  A,  a  <b  and  b  <c,  then  a  <c. 

It  is  convenient  to  use  the  notation  a  <  b  (and  to  say  “a  is  less  than  b”)  to 
signify  a  <  b,  a  ^  b,  just  as  we  use  a  <  b  to  mean  a  <  b,  a  ^  b.  Similarly,  the 
meanings  of  a  >  b  and  a  >  b  should  be  apparent. 
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EXAMPLES  32 


EXAMPLE  33 


2.5.2  DEFINITION 


2.5.3  DEFINITION 


There  is  little  purpose  in  making  a  definition  unless  there  is  at  hand  a  variety 
of  examples  which  fit  the  definition.  Here  then  are  a  few  examples  of  partial 
orders. 


•  The  binary  relation  <  on  the  real  numbers  (or  on  any  subset  of  the  real 
numbers)  is  a  partial  order  because  a  <  a  for  all  a  e  R  (reflexivity),  a  <  b 
and  b  <  a  implies  a  =  b  (antisymmetry)  and  a  <  b,  b  <  c  implies  a  <  c 
(transitivity). 

•  For  any  set  S ,  the  binary  relation  c  on  the  power  set  'P(S)  of  S  is  a  partial 

order  because  X  c  X  for  any  X  e  ’P(S)  (reflexivity).  X  c  Y,  Y  c  X 
for  K,F  e  V(S)  implies  X  =  Y  (antisymmetry)  and  X  C  Y,  Y  c  Z  for 
X,  Y,  Z  e  V{S)  implies  X  c  Z  (transitivity).  ▲ 

(Lexicographic  Ordering)  Suppose  we  have  some  alphabet  of  symbols  (perhaps 
the  English  alphabet)  which  is  partially  ordered  by  some  relation  <.  By  “word,” 
we  mean  any  string  of  letters  from  this  alphabet.  For  words  a  =  a\ai  -  •  ■  an  and 
b  =  b\b2-  •  •  bm,  define  a  <  b  if 

•  a  and  b  are  identical,  or 

•  cii  ^  bi  in  the  alphabet  at  the  first  position  /  where  the  words  differ,  or 

•  n  <  m  and  n,  =  b;  for  i  =  1 . n.  (This  is  the  situation  where  word  a, 

which  is  shorter  than  b,  forms  the  initial  sequence  of  letters  in  b.) 

This  ordering  of  words  is  called  lexicographic  because  when  the  basic  alphabet 
is  the  English  alphabet,  it  is  precisely  how  words  are  ordered  in  a  dictionary;  car 
:<  cat  <  catalog.  ▲ 

The  adjective  "partial”,  as  in  “partial  order”  draws  our  attention  to  the  fact 
that  the  definition  does  not  require  that  every  pair  of  elements  be  comparable,  in 
the  following  sense. 

If  {A,  <)  is  a  partially  ordered  set,  elements  a  and  b  of  A  are  said  to  be  comparable 
if  and  only  if  either  a  <  b  or  b  -<  a. 

If  X  and  Y  are  subsets  of  a  set  S,  it  need  not  be  the  case  that  X  c  Y  or 
Y  c  X\  for  example,  {a}  and  {b,  c]  are  not  comparable. 

If  <  is  a  partial  order  on  a  set  A  and  every  two  elements  of  A  are  comparable, 
then  <  is  called  a  total  order  and  the  pair  (A,  -<)  is  called  a  totally  ordered 
set. 


The  real  numbers  are  totally  ordered  by  <  because  for  every  pair  a,  b  of 
real  numbers  either  a  <  b  or  b  <  a.  On  the  other  hand,  the  set  of  sets, 
{ {a},  {b},  {c},  {a,  c}}  is  not  totally  ordered  by  c  since  neither  {a}  c  {£>}  nor 
{b}C{fl}. 
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Is  lexicographic  order  on  a  set  of  words  (in  the  usual  sense)  a  total  order? 

Partial  orders  are  often  pictured  by  means  of  a  diagram  named  after  Helmut 
Hasse  (1898-1979),  for  many  years  professor  of  mathematics  at  Gottingen.8  In 
the  Hasse  diagram  of  a  partially  ordered  set  (A,  <), 

•  there  is  a  dot  (or  vertex)  associated  with  each  element  of  A; 

•  if  a  <  b,  then  the  dot  for  b  is  positioned  higher  than  the  dot  for  a\  and 

•  if  a  <b  and  there  is  no  intermediate  c  with  a  <  c  <b,  then  a  line  is  drawn 
from  a  to  b.  (In  this  case,  we  say  that  the  element  b  covers  a.) 

The  effect  of  the  last  property  here  is  to  remove  redundant  lines.  Two  Hasse 
diagrams  are  shown  in  Fig  2.4.  The  reader  should  appreciate  that  these  would  be 


3  ? 

2  o 

1  <> 

0  © 


[b,  c} 


(c) 


Figure  2.4  The  Hasse  diagrams  for  ((0,  1,2,  3),  <)  and  (' P({a,b,c}),  c). 


unnecessarily  complicated  were  we  to  draw  all  lines  from  a  to  b  whenever  a  ±  b 
instead  of  just  those  lines  where  b  covers  a.  No  knowledge  of  the  partial  order  is 
lost  by  this  convention:  After  all,  if  a  <b  and  b  <  c,  then  (by  transitivity),  a  <c, 
so  if  there  is  a  line  from  a  to  b  and  a  line  from  b  to  c,  then  we  can  correctly  infer 
that  a  <  c,  from  the  diagram.  For  example,  in  the  diagram  on  the  left,  we  can 
infer  that  1  <  3  since  1  <  2  and  2  <  3.  In  the  diagram  on  the  right,  we  similarly 
infer  that  {&}  ;<  {a,b,c}. 

■B  Suppose  that  in  some  Hasse  diagram,  a  vertex  c  is  “above”  another  vertex  a ,  but 
there  is  no  line  from  a  to  c.  Is  it  the  case  that  a  <  cl  Explain. 


2.5.4  DEFINITIONS 


An  element  a  of  a  poset  (A,  <)  is  maximum  if  and  only  if  b  <  a  for  every  b  €  A 
and  minimum  if  and  only  if  a  <  b  for  every  b  e  A. 


In  the  poset  (' P({a,  b,  cj),  C),  0  is  a  minimum  element  and  the  set  { a ,  b,  c } 
a  maximum  element.  In  the  poset  {la},  {/?},  {c} ,  {a,  c} }  (with  respect  to  C),  there 
is  neither  a  maximum  nor  a  minimum  because,  for  each  of  the  elements  {a},  {/>}, 
{c},  and  {a,  c},  there  is  another  of  these  with  which  it  is  not  comparable. 


^There  is  a  fascinating  account  by  S.  L.  Segal  of  the  ambiguous  position  in  which  Hasse  found  himself 
during  the  Nazi  period.  The  article,  entitled  “Helmut  Hasse  in  1934,”  appears  in  Historia  Mathematicci  7 
(1980),  45-56. 
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If  a  poset  has  a  maximum  element,  then  this  element  is  unique;  similarly,  a 
poset  can  have  at  most  one  minimum.  (See  Exercise  1 1.) 

One  must  be  careful  to  distinguish  between  maximum  and  maximal  elements 
and  between  minimum  and  minimal  elements. 


2.5.5  DEFINITIONS 


An  element  a  of  a  poset  A  is  maximal  if  and  only  if, 


if  b  e  A  and  a  <  b,  then  b  =  a 


and  minimal  if  and  only  if, 

if  b  e  A  and  b  <  a,  then  b  =  a. 


Thus,  a  maximum  element  is  “bigger”  (in  the  sense  of  ;<)  than  every  other 
element  in  the  set  while  a  maximal  element  is  one  which  is  not  less  than  any 
other.  Considering  again  the  poset  { {a},  {b},  (c),  {a,  c } },  while  there  is  neither  a 
maximum  nor  a  minimum,  each  of  {a},  [b],  and  (c)  is  minimal  while  both  {b} 
and  {a,c}  are  maximal. 

What,  if  any,  are  the  maximum,  minimum,  maximal,  and  minimal  elements  in  the 
poset  whose  Hasse  diagram  appears  in  Fig  2.5? 


Figure  2.5 


Let  (A,  ;<)  be  a  poset.  An  element  g  is  a  greatest  lower  bound  (abbreviated  gib ) 
of  elements  a,  b  e  A  if  and  only  if 

1.  g  <  a,  g  <  b,  and 

2.  if  c  <  a  and  c  <b  for  some  c  e  A,  then  c  <  g. 

Elements  a  and  b  can  have  at  most  one  gib  (see  Exercise  14).  When  this  element 
exists,  it  is  denoted  a  A  b,  pronounced  “ a  meet  b.” 

An  element  l  is  a  least  upper  bound  (abbreviated  lub)  of  a  and  b  if 

1.  a  <  l,  b  <  t,  and 

2.  if  a  <  c,  b  <  c  for  some  c  e  A,  then  l  -<  c. 

As  with  greatest  lower  bounds,  a  least  upper  bound  is  unique  if  it  exists.  The  lub 
of  a  and  b  is  denoted  a  v  b,  “o  join  b ,”  if  there  is  such  an  element. 

EXAMPLES  34  «  In  the  poset  (R,  <),  the  gib  of  two  real  numbers  is  the  smaller  of  the  two  and 

the  lub  the  larger. 

•  In  the  poset  (' P(S),  C)  of  subsets  of  a  set  S,  AaB  =  AHB  and  AvB  =  AUB. 
(See  Exercise  12.)  Remembering  that  v  means  U  and  A  means  n  in  a  poset 
of  sets  provides  a  good  way  to  avoid  confusing  the  symbols  v  and  A  in  a 
general  poset.  A 


2.5.6  DEFINITIONS 
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^ |  With  reference  to  Fig  2.5,  find  a  w  b,  a  Ab,  b  w  d  and  b  A  d,  if  these  exist. 

A  poset  (A,  ;<)  in  which  every  two  elements  have  a  greatest  lower  bound  in  A 
and  a  least  upper  bound  in  A  is  called  a  lattice. 


2.5.7  DEFINITION 


EXAMPLE  35 


Answers  to  Pauses 


The  posets  described  in  Examples  34  are  both  lattices.  ▲ 

16.  Sure  it  is;  otherwise,  it  would  be  awfully  hard  to  use  a  dictionary. 

17.  We  can  conclude  a  <  c  only  if  there  is  a  sequence  of  intermediate  vertices 
between  a  and  c  with  lines  between  each  adjacent  pair.  Look  at  the  Hasse 
diagram  (V({a.  b,  c}),  C)  in  Fig  2.4.  Here  we  have  {b.  c]  above  {a},  but 
{a}  ^  [b,  c]  because  these  elements  are  incomparable.  On  the  other  hand, 
{a,  b ,  c}  is  above  {a}  and  we  can  infer  that  {a}  <  { a,b ,  c}  because,  for  the 
intermediate  vertex  {a,c},  we  have  upward  directed  lines  from  {a}  to  {a,c} 
and  from  { a ,  c]  to  { a ,  b.  c). 

18.  There  is  no  maximum  but  a  and  b  are  maximal;  d  is  both  minimal  and  a 
minimum. 

19.  aw  b  does  not  exist;  a  A  b  —  c;  b  v  d  =  b\  b  A  d  =  d . 


EXERCISES 


The  symbol  [ BB ]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 


1.  Determine  whether  or  not  each  of  the  following  relations 
is  a  partial  order  and  state  whether  or  not  each  partial 
order  is  a  total  order. 

(a)  [BB]  For  a,  b  €  R,  a  <  b  means  a  >  b. 

(b)  [BB]  For  a,  b  €  R,  a  ■<  b  means  a  <  b. 

(c)  (R,  <),  where  a  <b  means  a 2  <  b 2. 

(d)  (N  x  N,  ;<),  where  ( a ,  b )  <  (c.  d)  if  and  only  if 
a  <  c. 

(e)  (N  x  N,  <),  where  {a.b)  <  ( c.d )  if  and  only  if 
a  <  c  and  b  >  d. 

(f)  (W,  2)  where  W  is  the  set  of  all  strings  of  letters 
from  the  alphabet  (“words”  real  or  imaginary)  and 
to i  ;<  u>2  if  and  only  if  w\  has  length  not  exceeding 
the  length  of  w 2.  (Length  means  number  of  letters.) 

2.  (a)  [BB]  List  the  elements  of  the  set  [1 1,  1010,  100,  I, 

101,111,110,1001,10,1000)  in  lexicographic 
order,  given  0^1. 

(b)  Repeat  part  (a)  assuming  1  <  0. 

3.  List  all  pairs  (x,  y)  with  x  y  in  the  partial  orders 
described  by  each  of  the  following  Hasse  diagrams. 


(a)  [BB]  d  <j> 

c  o 
b  o 
a  o 


(b)  [BB]  b  9 


d  o 


a  6  co 


68  Chapter  2  Sets  and  Relations 


4.  [BB;  (a),  (b)]  List  all  minimal,  maximal,  minimum, 
and  maximum  elements  for  each  of  the  partial  orders 
described  in  Exercise  3. 

5.  Draw  the  Hasse  diagrams  for  each  of  the  following  par¬ 
tial  orders. 

(a)  ({1,2, 3,4,5, 6},  <) 

(b)  ( { {a},  { a ,  h),  {a,  b ,  c},  { a ,  h,  c,  d},  (a,  c},  (c,  d } } , 
C) 

6.  List  all  minimal,  minimum,  maximal,  and  maximum  ele¬ 
ments  for  each  of  the  partial  orders  in  Exercise  5. 

7.  [BB]  In  the  poset  ( V(S ),  C)  of  subsets  of  a  set  S,  under 
what  conditions  does  one  set  B  cover  another  set  A ? 

A  8.  Learn  what  you  can  about  Helmut  Hasse  and  write  a 
short  biographical  note  about  this  person,  in  good  clear 
English  of  course! 

9.  (a)  [BB]  Prove  that  any  finite  (nonempty)  poset  must 
contain  maximal  and  minimal  elements. 

(b)  Is  the  result  of  (a)  true  in  general  for  posets  of  arbi¬ 
trary  size?  Explain. 

10.  (a)  Let  A  —  Z1 2  and,  for  a  =  (01,02)  and  b  =  (/q.M) 

in  A,  define  a  <  b  if  and  only  if  oj  <  b  1  and 
O]  +02  <  b\  +  bi.  Prove  that  <  is  a  partial  order 
on  A.  Is  this  partial  order  a  total  order?  Justify  your 
answer  with  a  proof  or  a  counterexample. 

(b)  Generalize  the  result  of  part  (a)  by  defining  a  partial 
order  on  the  set  Z”  of  n-tuples  of  integers.  (No  proof 
is  required.) 

11.  (a)  [BB]  Prove  that  a  poset  has  at  most  one  maximum 

element. 

(b)  Prove  that  a  poset  has  at  most  one  minimum  ele¬ 
ment. 

12.  Let  S  be  a  nonempty  set  and  let  A  and  B  be  elements  of 
the  power  set  of  S.  In  the  partially  ordered  set  {V(S).  C), 

(a)  [BB]  prove  that  A  A  B  =  A  n  B: 


(b)  prove  that  A  v  B  =  A  U  B. 

13.  Let  a  and  b  be  two  elements  of  a  poset  (A,<)  with 
a  <b. 


(a)  [BB]  Show  that  a  v  b  exists,  find  this  element,  and 
explain  your  answer. 

(b)  Show  that  aAb  exists,  find  this  element,  and  explain 
your  answer. 


14.  (a)  [BB]  Prove  that  a  gib  of  two  elements  in  a  poset 

(A,  <)  is  unique  whenever  it  exists. 

(b)  Prove  that  a  lub  of  two  elements  in  a  poset  (A,  <) 
is  unique  whenever  it  exists. 

15.  (a)  If  a  and  b  are  two  elements  of  a  partially  ordered 

set  (.4,  <),  the  concepts 


max(a,  b) 


a  if  b  <  a 
b  if  a  <  b 


and 


min(a,  b) 


a  if  a  <b 
b  if  b  <  a 


do  not  make  sense  unless  the  poset  is  totally  ordered. 
Explain. 

(b)  Show  that  any  totally  ordered  set  is  a  lattice. 

16.  (a)  [BB]  Give  an  example  of  a  partially  ordered  set 
which  has  a  maximum  and  a  minimum  element  but 
is  not  totally  ordered. 

(b)  Give  an  example  of  a  totally  ordered  set  which  has 
no  maximum  or  minimum  elements. 


17.  Prove  that  in  a  totally  ordered  set,  any  maximal  element 
is  a  maximum. 


18.  Suppose  (A.  <)  is  a  poset  containing  a  minimum  ele¬ 
ment  a. 


(a)  [BB]  Prove  that  a  is  minimal. 

(b)  Prove  that  a  is  the  only  minimal  element. 


REVIEW  EXERCISES  FOR  CHAPTER  2 


1.  If  A  =  [jc  €  N  |  .v  <  7],  B  =  [x  €  Z  |  \x  —  5|  <  3]  and 
C  =  [2,  3),  find  (A  ®  B)  \  C. 

2.  Let  A  =  [x  6  Z  |  —  1  <  x  <  2],  B  =  [2.v  —  3  |  x  e  A} 
and  C  =  [jc  e  R  |  x  =  | ,  a  e  A,  b  e  Bj. 

(a)  List  the  elements  of  A,  B,  and  C. 

(b)  List  the  elements  of  (A  D  B)  x  B. 

(c)  List  the  elements  of  B\C. 


(d)  List  the  elements  of  A  ©  C. 

3.  Let  A,  B ,  and  C  be  sets.  Are  the  following  statements 
true  or  false?  In  each  case,  provide  a  proof  or  exhibit  a 
counterexample. 

(a)  A  D  B  =  A  if  and  only  if  A  C  B. 

(b)  (Anfl)uc  =  A  n  (B  u  C). 

4.  Let  A  =  [1],  Find  V(V(A)). 
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5.  Let  A,  B,  C,  and  D  be  sets. 

(a)  Give  an  example  showing  that  the  statement  “A  © 
(Z?  \  C)  =  (A  ©  B)  \  C”  is  false  in  general. 

(b)  Prove  that  the  statement  “A  c  C,  B  c  D  — *  A  x 
B  c  C  x  D"  is  true. 

6.  Give  an  example  showing  that  the  statement  “(A  x  B)  c 
(C  x  D)  — >  A  C  C  and  B  c  D”  is  false  is  general. 

7.  Let  A  be  a  set. 

(a)  What  is  meant  by  the  term  binary  relation  on  A? 

(b)  Suppose  A  has  ten  elements.  How  many  binary  rela¬ 
tions  are  there  on  A? 

8.  For  a,  be  N,  define  a  ~  b  if  a  <  2b.  Determine  whether 
or  not  ~  is  reflexive,  symmetric,  antisymmetric,  transi¬ 
tive,  an  equivalence  relation  or  a  partial  order  on  N. 

9.  Define  a  relation  7?.  on  Z  by  alZb  if  4a  +b  is  a  multiple 
of  5.  Show  that  72  defines  an  equivalence  relation  on  Z. 

10.  Define  a  relation  72  on  Z  by  alZb  if  2a+5b  is  a  multiple 
of  7. 

(a)  Prove  that  7 2  defines  an  equivalence  relation. 

(b)  Is  72  a  partial  order?  Explain  your  answer  briefly. 


11.  Let  ~  denote  an  equivalence  relation  on  a  set  A.  Prove 
that  x  ~  a  -o-  x  =  a  for  any  x,  a  e  A. 

12.  Let  ~  denote  an  equivalence  relation  on  a  set  A.  Assume 
a,b,c,d  e  A  are  such  that  a  e  b,  d  <£  c  and  d  e  b. 
Prove  that  d  n  c  =  0. 

13.  Let  A  be  the  set  of  points  different  from  the  origin  in  the 
Euclidean  plane.  For  p,q  e  A,  define  p  ~  q  if  p  =  q 
or  the  line  through  the  distinct  points  p  and  q  passes 
through  the  origin. 

(a)  Prove  that  ~  defines  an  equivalence  relation  on  A. 

(b)  Find  the  equivalence  classes  of  ~. 

14.  Show  that  (P(Z),  c)  is  a  partially  ordered  set. 

15.  Let  A  =  { 1 , 2,  4,  6,  8)  and,  for  a,  b  e  A,  define  a  <  b 
if  and  only  if  |  is  an  integer. 

(a)  Prove  that  ■<  defines  a  partial  order  on  A. 

(b)  Draw  the  Hasse  diagram  for  <. 

(c)  List  all  minimum,  minimal,  maximum  and  maximal 
elements. 

(d)  Is  (A,  <)  totally  ordered?  Explain. 

16.  Let  (A,  :<)  be  a  poset  and  a,b  e  A.  Can  a  and  b  have 
two  least  upper  bounds?  Explain. 


CHAPTER 


3 

Functions 


3.1 


Domain,  Range,  One-to-One,  Onto 


Next  in  importance  to  the  primitive  notion  of  “set”  is  the  idea  of  “function.”  This 
is  a  term  which  nowadays  is  introduced  informally  quite  early  in  the  mathematics 
curriculum.  It’s  often  defined  as  a  rule  which  associates  to  each  element  of  a 
set  A  (usually  a  set  of  real  numbers)  an  element  of  another  set  B  (often  R  as 
well).  The  function  keys  on  a  calculator  are  so  named  because  when  a  number 
x  is  entered  into  a  calculator  and  a  “function  key”  /  is  pressed,  another  number 
f{x)  appears.1  The  essential  characteristic  of  a  function  is  that  the  value  which 
it  associates  with  a  given  element  is  uniquely  determined  by  that  element.  If  we 
enter  4  into  a  calculator  and  then  press  some  function  key,  we  are  presented  with 


exactly  one  number,  not  a  choice  of  several  numbers:  4 


=  2,  for  instance, 


not  ±2. 


Here  is  the  formal  definition  of  “function.” 


3.1.1  DEFINITION 


A  function  from  a  set  A  to  a  set  B  is  a  binary  relation  /  from  A  to  B  with  the 
property  that,  for  every  a  €  A,  there  is  exactly  one  b  e  B  such  that  (a.  b)  e  f .  (In 
some  areas  of  mathematics,  map  is  a  commonly  employed  synonym  fox  function.) 


In  this  definition,  the  idea  that  a  function  should  associate  with  each  element 
a  e  A  a  unique  element  b  e  B  is  captured  by  viewing  a  function  /  as  a  subset  of 
A  x  B  with  the  special  property  that  for  each  a  €  A,  there  is  just  one  pair  (a,  b ) 
in  /  having  first  coordinate  a.  If  A  =  {1,  2,  3},  B  =  {x,  y }  and  /  is 

/  =  {(1,  x),  (2,  y),  (3,  x)}, 

then  /  is  just  the  rule  which  associates  x  with  1,  y  with  2.  and  x  with  3. 


i 


Here  we  assume  that  we  haven't  entered  a  number  like  -1  and  then  pressed  the  square  root  key! 
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There  are  two  key  points  about  the  definition  of  “function.”  First,  every  a 
in  A  must  be  the  first  coordinate  of  an  ordered  pair  in  the  function.  Again  with 
A  —  {1,  2,  3}  and  B  =  { x ,  y},  the  set 

g=f(l,x),(3,y)} 

is  not  a  function  from  A  to  B  because  g  contains  no  ordered  pair  with  first 
coordinate  2.  Second,  each  element  of  A  must  be  the  first  coordinate  of  exactly 
one  ordered  pair.  With  the  same  A  and  B , 

h  =  {(l,x),(2,x),(3,y),(2,y)} 

is  not  a  function  because  2  is  the  first  coordinate  of  two  ordered  pairs. 


PROBLEM  1.  Suppose  A  is  the  set  of  surnames  of  people  listed  in  the  Salt 
Lake  City  telephone  directory.  Is  it  likely  that 

/  =  {(a,  n)  |  a  is  on  page  n ) 

is  a  function  from  A  to  the  set  of  natural  numbers?  Comment. 

Solution.  By  definition  of  /,  each  element  of  A  is  the  first  coordinate  of  a  pair 
in  /,  so  the  key  question  here  is  whether  or  not  each  a  e  A  determines  a  unique 
n  e  N.  Is  each  surname  on  a  unique  page  in  the  telephone  directory?  This  is  not 
very  likely  since  some  surnames  (Smith?)  are  undoubtedly  listed  on  a  number  of 
different  pages.  It  is  unlikely  that  /  is  a  function.  I 


3.1.2  FUNCTION 
NOTATION 


When  /  is  a  function  from  A  to  B,  the  element  b  e  B,  which  is  uniquely 
determined  by  the  element  a  e  A,  is  denoted  f(a)  and  called  the  image  of  a. 
Thus, 


(1) 


(a,  b)  e  f  if  and  only  if  b  —  f{a). 


It  is  customary  to  write  / :  A  —*■  B  to  mean  that  /  is  a  function  from  A  to  B 
and  to  write  / :  a  b  to  mean  that  f(a )  =  b.  (Note  the  differences  between  the 
symbols  — ►  between  sets,  as  in  A  ->  B,  and  i->-  between  elements,  as  in  a  b.) 


Thus,  the  function  /  =  {(Lx),  (2,  y),  (3,  x)}  could  also  be  described  by 

1  x 
/:  2  y 

3  i-»-  x 

although  the  description  as  ordered  pairs  seems  simpler. 

Sometimes  (often  in  calculus),  a  function  is  sufficiently  nice  that  it  is  possible 
to  write  down  a  precise  formula  showing  how  f(x)  is  determined  by  x;  for 
example,  f(x)  =  x3,  /(x)  =  3x  —  7,  /(x)  =  ln(x).  When  we  talk  about  “the 
function  /(x)  =  x2,”  we  are  really  talking  about  that  function  / :  R  — ►  R  which 
associates  with  any  x  e  R,  its  square  x2;  that  is  to  say,  x  m-  x2.  As  a  binary 
relation,  /  =  {(x,x2)  |  x  £  R).  When  this  function  is  graphed  as  usual  in  the 
xy-plane,  we  are,  in  actual  fact,  making  a  picture  of  the  ordered  pairs  which 
comprise  it. 
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3.1.3  DEFINITIONS 


EXAMPLE  2 


EXAMPLE  3 


EXAMPLE  4 


Let  f :  A  B  be  a  function  from  A  to  B. 

•  The  domain  of  /,  written  domf,  is  the  set  A. 

•  The  target  of  /  is  the  set  B. 

•  The  range  or  image  of  /,  written  mg  /,  is 

mg  /  =  [b  e  B  |  (a,  b)  e  /  for  some  a  €  A} 

=  [b  e  B  |  b  =  f(a)  for  some  a  e  A}. 

•  The  function  is  onto  or  surjective  if  its  range  is  the  target,  mg  /  =  B\  that 
is,  every  b  e  B  is  of  the  form  b  =  f  (a)  for  some  a  e  A;  equivalently. 


For  any  b  e  B,  the  equation  b  =  f(x)  has  a  solution  x  e  A. 


•  It  is  one-to-one  (1-1)  or  injective  if  and  only  if  different  elements  of  A  have 
different  images:  in  symbols,  a\  ^  a2  — >  f(a\)  /  f(a2)\  equivalently  (taking 
the  contrapositive), 


If  f(a\)  =  f(a2),  then  a\  =  a2. 


•  It  is  a  bijection  or  bijective  junction  if  it  is  both  one-to-one  and  onto. 

Some  Discrete  Examples 

Suppose  A  =  {1, 2,  3,  4},  B  =  [x,  y,  z]  and 

/  =  {(L  x),  (2,  y),  (3,  z),  (4,  y)}. 

Then  /  is  a  function  A  ->  B  with  domain  A  and  target  B.  Since  mg/  = 
[x,y,  z]  =  B,  f  is  onto.  Since  /( 2)  =  /( 4)  (=  y)  but  2  ^  4,  /  is  not  one- 
to-one.  [In  fact,  there  can  exist  no  one-to-one  function  A  ->  B.  Why  not?  See 
Exercise  25(a).]  A 

Suppose  A  =  [1,  2,  3],  B  =  jx,  y,  z,  w]  and 

/  =  {(1,  w),  (2,  y),  (3,  x)}. 

Then  f :  A  ^  B  is  a  function  with  domain  A  and  range  {w,  y,  x}.  Since  mg  /  ^ 
B,  f  is  not  onto.  [No  function  A  —>■  B  can  be  onto.  Why  not?  See  Exercise  25(b).] 
This  function  is  one-to-one  because  /( 1),  /( 2),  and  /( 3)  are  all  different:  If 
f(a\)  =  f(a2),  then  a \  =  a2.  A 

Suppose  A  =  [1,  2,  3],  B  =  [x,  y,  z), 

/  =  {(l,z),  (2,  y),  (3,  y)}  and  g  =  {(1,  z),  (2,  y),  (3,  x)}. 

Then  /  and  g  are  functions  from  A  to  B.  The  domain  of  /  is  A  and  doing  =  A 
too.  The  range  of  /  is  [z,  y],  which  is  a  proper  subset  of  B,  so  /  is  not  onto. 
On  the  other  hand,  g  is  onto  because  rngg  =  [z,  y,  x]  =  B.  This  function  is  also 
one-to-one  because  g(l),  g( 2),  and  g(3)  are  all  different:  If  g{a\)  =  g(a2).  then 
a\  =  a2.  Notice  that  /  is  not  one-to-one:  /( 2)  =  /( 3)  (=  y),  yet  2^3.  A 
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EXAMPLE  5 


EXAMPLE  6 


Let  /:  Z  -»  Z  be  defined  by  f(x)  =  2x  —  3.  Then  dom  /  =  Z.  To  find  rng/, 
note  that 

b  e  rng  /  b  =  2a  —  3  for  some  integer  a 

b  =  2 (a  —  2)  +  1  for  some  integer  a 

and  this  occurs  if  and  only  if  b  is  odd.  Thus,  the  range  of  /  is  the  set  of  odd 
integers.  Since  mg  /  ^  Z,  /  is  not  onto.  It  is  one-to-one,  however:  If  f(x\)  = 
/(X2),  then  2*1  —  3  =  2*2  —  3  and  x\  =  xi-  A 

Let  /:  N  — >  N  be  defined  by  f(x)  =  2x  —  3.  This  might  look  like  a  perfectly 
good  function,  as  in  the  last  example,  but  actually  there  is  a  difficulty.  If  we  try  to 
calculate  /( 1),  we  obtain  /(l)  =  2(1)  —  3  =  — 1  and  — 1  ^  N.  Hence,  no  function 
has  been  defined.  A 


PROBLEM  7.  Define  / :  Z  -*  Z  by  f(x)  =  x2  —  5x  +  5.  Determine  whether  or 
not  /  is  one-to-one  and/or  onto. 

Solution.  To  determine  whether  or  not  /  is  one-to-one,  we  consider  the  pos¬ 
sibility  that  f(x i)  =  f(x 2).  In  this  case,  x2  —  5xi  +  5  =  x\  —  5x2  +  5,  so 
x2  —  x|  =  5xi  —  5x2  and  (xi  —  X2)(xj  +  X2)  =  5(xi  —  X2).  This  equation  indeed 
has  solutions  with  xi  ^  X2\  Any  x\ ,  X2  satisfying  x\  +xi  =  5  will  do,  for  instance, 
xi  =  2,  X2  =  3.  Since  /( 2)  =  /( 3)  =  —1,  we  conclude  that  /  is  not  one-to-one. 

Is  /  onto?  Recalling  that  the  graph  of  /(x)  =  x2  —  5x  +  5,  x  e  R,  is  a 
parabola  with  vertex  (|,  — |),  clearly  any  integer  less  than  -1  is  not  in  the  range 
of  /.  Alternatively,  it  is  easy  to  see  that  0  is  not  in  the  range  of  /  because 
x2  -  5x  +  5  =  0  has  no  integer  solutions  (by  the  quadratic  formula).  Either 
argument  shows  that  /  is  not  onto.  I 


PROBLEM  8.  Define  /:  Z  — »  Z  by  f(x)  =  3x3  —  x.  Determine  whether  or  not 
/  is  one-to-one  and/or  onto. 

Solution.  Suppose  f(x  1)  =  f(x 2)  for  x\,  X2  e  Z.  Then  3x3  —  x\  =  3x|  —  X2,  so 
3(x3  —  x3)  =  xi  —  X2  and 

3(Xj  —  X2)(X|  +XiX2+X2)=X]  —  X2- 

If  X]  /  X2,  we  must  have  x2  +X1X2  +  x\  =  which  is  impossible  since  x\  and 
X2  are  integers.  Thus,  x\  =  X2  and  /  is  one-to-one. 

Is  /  onto?  If  yes,  then  the  equation  b  =  f(x)  =  3x3  —  x  has  a  solution 
in  Z  for  every  integer  b.  This  seems  unlikely  and,  after  a  moment’s  thought,  it 
occurs  to  us  that  the  integer  b  —  1,  for  example,  cannot  be  written  this  way: 
1  =  3x3  —  x  for  some  integer  x  implies  x(3x2  —  1)  =  1.  But  the  only  pairs  of 
integers  whose  product  is  1  are  the  pairs  1 ,  1  and  —  1 ,  —  1 .  So  here,  we  would 
require  x  =  3x2  —  1  =  1  or  x  =  3x2  —  1  =  —  1,  neither  of  which  is  possible. 
The  integer  b  =  1  is  a  counterexample  to  the  assertion  that  /  is  onto,  so  /  is 
not  onto.  i 


Define  g:  Z  — >  Z  by  g(x)  —  2x2  +  lx.  Is  g  onto?  Is  g  one-to-one? 
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EXAMPLE  9 


Figure  3.1 

EXAMPLE  10 

EXAMPLE  11 

EXAMPLE  12 


Functions  of  a  Real  variable 

Let  / :  R  ->  R  be  defined  by  f(x)  =  2x  —  3.  The  domain  of  /  is  R  and  mg  /  =  R 
since  any  real  number  y  can  be  expressed  y  =  2x  —  3  (for  x  =  ^(y  +  3)). 
Graphically,  this  means  that  any  horizontal  line  intersects  the  graph  of  y  =  2x  —  3. 
(See  Fig  3.1.)  Since  mg  /  =  R,  /  is  onto.  It  is  also  one-to-one  (the  argument  in 
Example  5  can  be  used  again),  so  being  onto  and  one-to-one,  it  is  a  bijection  from 
R  to  R.  A 


The  graph  of  y  =  2x  —  3. 


Let  g :  R  -»•  R  be  defined  by  g(.t)  =  x2.  The  domain  of  g  is  R;  the  range  of  g  is 
the  set  of  nonnegative  real  numbers.  Since  this  is  a  proper  subset  of  R,  g  is  not 
onto.  Neither  is  g  one-to-one  since  g(3)  =  g(— 3),  but  3  ^  —3.  A 

Define  h :  [0,  oo)  -»  R  by  h(x)  =  x2.  This  function  is  identical  to  the  function  g 

of  the  preceding  example  except  for  its  domain.  By  restricting  the  domain  of  g  to 
the  nonnegative  reals  we  have  produced  a  function  h  which  is  one-to-one  since 

h(x i)  =  h(x2)  implies  x2  =  x 2  and  hence  x\  =  ±X2-  Since  x\  >  0  and  xi  >  0, 

we  must  have  xi  =  xi.  A 

Let  /:  R  — >  R  be  defined  by  f(x)  =  3x2  —x.  Students  of  calculus  should  be  able 
to  plot  the  graph  of  /  and  see  immediately  that  /  is  onto,  but  not  one-to-one. 
(See  Fig  3.2.)  A 


Figure  3.2  The  graph  of  y  =  3x3  -  x. 
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3.1.4  REMARK 


3.1.5  THE  ABSOLUTE 
VALUE  FUNCTION 


Contrast  this  last  example  with  Problem  8  on  page  74.  There  we  saw  that  the 
function  with  the  same  rule  as  the  one  here  but  with  Z  as  domain  was  one-to-one 
but  not  onto.  A  function  is  more  than  a  rule;  domain  and  range  are  critical  too. 


The  absolute  value  of  a  number  x,  denoted  |x|,  is  defined  by 


x  if  x 
—x  if  x  < 


3.1.6  THE  FLOOR  AND 
CEILING  FUNCTIONS 


The  absolute  value  function  is  a  function  with  domain  R  and  range  [0,  oo)  = 
{y  e  R  |  y  >  0}.  It  is  not  one-to-one  because,  for  example,  |2|  =  |  —  2|. 

For  any  real  number  x,  the  floor  of  x,  written  |_x_l,  is  the  greatest  integer  less 
than  or  equal  to  x,  that  is,  the  unique  integer  |_*J  satisfying 


X  —  1  <  [xj  <  X. 


The  ceiling  of  x,  written  |"x~|,  is  the  least  integer  greater  than  or  equal  to  x,  that 
is,  the  unique  integer  fx]  satisfying 

X  <  fxl  <  X  +  1  . 

For  instance,  L2.01J  =2,  |_1.99J  =  1,  L15J  =  15.  L-2.0IJ  =  -3,  and  [2.01]  =3, 
T1.991  =  2,  f!51  =  15,  T— 2.011  =  -2. 


Figure  3.3 


The  graph  of  the  floor  function  y  =  |xj. 


By  the  floor  function,  we  mean  the  function  /:  R  — ►  R  defined  by  /(x)  = 
[xj.  (At  one  time,  the  floor  function  was  commonly  called  the  greatest  integer 
function.)  Similarly,  the  ceiling  function  is  the  function  g :  R  -*  R  defined  by 
g(x)  =  fx] .  The  domain  of  these  functions  is  R  and  both  have  range  Z.  The  graph 
of  the  floor  function  is  shown  in  Fig  3.3. 

Functions  whose  graphs  resemble  the  floor  function  are  encountered  fre¬ 
quently.  In  January  2001,  the  Canadian  post  office  set  rates  for  mail  within  Canada 
as  shown  in  Table  3.1.  The  graph  of  this  postal  function  would  certainly  resemble 
the  floor  function,  with  jumps  at  30,  50,  100,  and  200  gms. 


The  identity  Function 

The  special  function  we  now  define  looks  innocuous  and  it  is,  but  it  arises  in  so 
many  situations  that  it  is  helpful  to  give  it  a  name. 
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Table  3.1  2001  Canadian  postal  rates. 

Weight  w  (gms) 

Cost 

w  <  30 

$0.47 

30  <  w  <  50 

$0.75 

50  <  w  <  100 

$0.94 

100  <  w  <  200 

$1.55 

200  <  w  <  500 

$2.05 

3.1.7  DEFINITION 


For  any  set  A,  the  identity  function  on  A  is  the  function  la  -  A 
la(o)  =  a  for  all  a  e  A.  In  terms  of  ordered  pairs, 


A  defined  by 


i A  =  {(a,  a)  |  a  e  A}. 

When  there  is  no  possibility  of  confusion  about  A,  we  will  often  write  i,  rather 
than  la-  (The  Greek  symbol  l  is  pronounced  “yota”,  so  that  “la”  is  read  “yota 
sub  A.” 


The  graph  of  the  identity  function  on  R  is  the  familiar  line  with  equation 
y  =  x.  The  identity  function  on  a  set  A  is  indeed  a  function  A  — ►  A  since,  for 
any  a  €  A,  there  is  precisely  one  pair  of  the  form  (a,  y)  e  i,  namely,  the  pair 
(a,  a). 


Answers  to  Pauses 


Prove  that  the  identity  function  on  a  set  A  is  one-to-one  and  onto. 

1.  Since  the  graph  of  g(x)  =  2x 2  +  lx,  x  e  R,  is  a  parabola,  g  is  not  onto. 
To  determine  whether  or  not  g  is  one-to-one,  we  must  see  if  g(*i)  =  g(x2) 
implies  that  x\  must  equal  X2-  Suppose  g(xj)  =  g(x2).  Then  2xf  +  7x\  = 
2xn+lx2  and  so  2(x\  —  X2)(x\  +X2 )  =  7(X2  —  x\).  If  x\  ^  X2,  then  x\+X2  = 
—  which  is  impossible  for  x\,  X2  e  Z.  Thus,  x\  =  X2',  g  is  one-to-one. 

2.  If  i(a i)  =  t(«2).  then  a\  —  ai  (since  i(a\)  =  a \  and  i(a 2)  =  02),  so  1  is 
one-to-one.  To  show  it  is  onto,  we  have  to  show  that  the  equation  a  =  i( x) 
has  a  solution  for  any  a.  It  is  evident  that  x  =  a  works. 


EXERCISES 


The  symbol  [BB]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 


1.  Determine  whether  or  not  each  of  the  following  relations 
is  a  function  with  domain  {1, 2,  3,  4).  For  any  relation 
that  is  not  a  function,  explain  why  it  isn’t. 

(a)  [BB]  /  =  {(1,  1),  (2.  1).  (3,  1),  (4,  1),  (3,  3)} 

(b)  /  =  {(1,2),  (2,  3),  (4,  2)) 

(c)  [BB]  /  =  {(1,  1),  (2.  1),  (3,  1),  (4,  1)} 


(d)  /  =  {(1,1),  (1,2),  (1,3),  (1,4)} 

(e)  /  =  {(1,4),  (2,  3),  (3,  2),  (4,  1)) 

2.  Suppose  A  is  the  set  of  students  currently  registered  at 
the  University  of  Calgary.  B  is  the  set  of  professors  at  the 
University  of  Calgary,  and  C  is  the  set  of  courses  currently 
being  offered  at  the  University  of  Calgary.  Under  what 
conditions  is  each  of  the  following  a  function? 
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(a)  [BB]  {(a,  b)  |  a  is  taking  a  course  from  b } 

(b)  {(a,  c)  |  a’s  first  class  each  week  is  in  c] 

(c)  {(a,  c)  |  a  has  a  class  in  c  on  Friday  afternoon} 
Explain  your  answers. 

3.  [BB]  Suppose  A  and  B  are  nonempty  sets.  Can  A  x  B 
ever  be  a  function  A  — >  B‘)  Explain. 

4.  Give  an  example  of  a  function  N  — >■  N  which  is 

(a)  [BB]  one-to-one  but  not  onto; 

(b)  onto  but  not  one-to-one; 

(c)  neither  one-to-one  nor  onto; 

(d)  both  one-to-one  and  onto. 

5.  Let  X  be  the  set  of  all  countries  in  the  British  Common¬ 
wealth  and  Y  be  the  set  of  all  people  who  live  in  these 
countries. 

(a)  [BB]  Show  that  Prime  Minister:  X  ->  Y  and 
Domicile;  Y  — »  X  are  functions. 

(b)  Show  that  Prime  Minister  is  one-to-one  but  not  onto. 

(c)  Show  that  Domicile  is  onto  but  not  one-to-one. 

6.  [BB]  Give  examples  of  some  common  functions  whose 
graphs  resemble  the  graph  of  the  floor  function. 

7.  Let  S  =  (1,2,  3, 4,  5)  and  define  /:  S  Z  by 

x2  +  I  if  x  is  even 
2*  —  5  if  jc  is  odd. 

Express  /  as  a  subset  of  S  x  Z.  Is  /  one-to-one? 

8.  The  addition  and  multiplication  of  real  numbers  are 
functions  add,  mult:  R  x  R  ->  R,  where 

add(*,  y)  =  x  +  y,  multi*,  y)  =  xy. 

(a)  [BB]  Is  add  one-to-one?  Is  it  onto? 

(b)  Is  mult  one-to-one?  Is  it  onto? 

Explain  your  answers. 

9.  [BB]  Define  g:  Z  — *  B  by  g(*)  =  [*[  +  1.  Deter¬ 
mine  (with  reasons)  whether  or  not  g  is  one-to-one  and 
whether  or  not  it  is  onto  in  each  of  the  following  cases. 

(a)  B  =  Z 

(b)  B  =  N 

10.  Define  / :  A  — >  A  by  f  (x)  =  3*  +  5.  Determine  (with 
reasons)  whether  or  not  /  is  one-to-one  and  whether  or 
not  /  is  onto  in  each  of  the  following  cases. 

(a)  [BB]  A  =  Q 

(b)  A  =  N 

11.  Define  h :  A  —*  A  by  h(x)  =  x2  +  2.  Determine  (with 
reasons)  whether  or  not  h  is  one-to-one  and  whether  or 
not  h  is  onto  in  each  of  the  following  cases. 


(a)  A  =  Z 

(b)  A  =  N 

12.  Define  g:  A  —>  A  by  g (x )  =  3.r2  +  14*  —  51.  Deter¬ 
mine  (with  reasons)  whether  or  not  g  is  one-to-one  and 
whether  or  not  g  is  onto  in  each  of  the  following  cases. 

(a)  A  =  Z 

(b)  A  =  R 

13.  Define  /:  A  — *■  B  by  /(*)  =  x2  +  14*  —  51.  Deter¬ 
mine  (with  reasons)  whether  or  not  /  is  one-to-one 
and  whether  or  not  it  is  onto  in  each  of  the  following 
cases. 

(a)  [BB]  A  =  N,  B  =  {b€Z\b>  -100) 

(b)  A  =  Z,  B  as  in  (a) 

(c)  A  =  R,  B  =  {b  e  R  |  b  >  -100) 

14.  Find  the  domain  and  range  of  each  of  the  given  functions 
of  a  real  variable.  In  each  case,  determine  whether  or 
not  the  function  is  one-to-one  and  whether  or  not  it  is 
onto. 

(a)  [BB]  /:  R  -*  R  defined  by  /(*)  =  *3. 

(b)  g:  R  ->  R  defined  by  g(*)  =  *|*|. 

(c)  p :  (|.  oo )  ->  R  defined  by  /)(*)  =  log2(3*  —  4). 

(d)  f:R—>  R  defined  by  /(*)  =  2X~'  +  3. 

15.  (a)  [BB]  Define  /:  R  -»  R  by  /(*)  =  3*3+*.  Graph  / 

in  order  to  determine  whether  or  not  /  is  one-to-one 
and/or  onto. 

(b)  [BB]  Define  / :  Z  ->  Z  by  /(*)  =  3*3  +  *.  Deter¬ 
mine  (with  reasons)  whether  or  not  /  is  one-to-one 
and/or  onto. 

16.  (a)  Define  g :  R  — »■  R  by  g(x)  —  *3  —  *  +  1 .  Graph  g  in 

order  to  determine  whether  or  not  g  is  one-to-one 
and/or  onto. 

(b)  Define  g :  Z  — *■  Z  by  g(*)  =  *3  —  *  +  1.  Determine 
(with  reasons)  whether  or  not  g  is  one-to-one  and/or 
onto. 

(c)  Repeat  (b)  for  the  function  g:  N  — >  N  defined  by 

g(*)  =  *3  —  *  +  1. 

17.  Determine  whether  or  not  each  of  the  following  defines 
a  one-to-one  and/or  an  onto  function.  Either  give  a  proof 
or  exhibit  a  counterexample  to  justify  every  answer. 

(a)  [BB]  f(n,  m)  =  2 n  +  3m;  / :  N  x  N  — >  N 

(b)  [BB]  f(n,  m)  =  2 n  +  3m;  / :  Z  x  Z  — ►  Z 

(c)  /(«,  m)  =  14  n  +  22  m;  /:NxN->N 

(d)  /(n,m)  =  89«  +  246/h ;  /:  Z  x  Z  — >  Z  [Hint: 
1  =  (— 17)(246)  +  47(89).] 

(e)  fin ,  tn)  =  n2  -F  in2  +  1 ;  / :  Z  x  Z  — >  N 

(f)  f(n,  m)  =  L^J  +l;/:NxN-rN 


3.2  Inverses  and  Composition  79 


18.  For  each  of  the  following,  find  the  largest  subset  A  of 
R  such  that  the  given  formula  for  / ( x )  defines  a  func¬ 
tion  /  with  domain  A.  Give  the  range  of  /  in  each 
case. 

(a)  [BB]  fix)  =  — !— 

x  —  3 

(b)  fix )  =  -=L= 

V 1  —  x 

19.  In  each  of  the  following  cases,  explain  why  the  given 
function  is  not  one-to-one.  Then  restrict  the  domain  of 
the  function  to  as  large  a  set  as  possible  so  as  to  make 
a  one-to-one  function. 

(a)  /  =  {(a,  a),  ( b ,  ft),  (c,  y),  id,  o')} 

(b)  [BB]  /:  R  — ►  R  defined  by  fix)  =  —4x2  + 
1  2jc  —  9 

(c)  / :  R  R  defined  by  fix)  =  sinjc 

20.  Let  S  be  a  set  containing  the  number  5.  Let  A  =  {/ : 

S  ->  S]  be  the  set  of  all  functions  S  — ►  S.  For  /,  g  e  A, 
define  /  ~  g  if  /(5)  =  g(5). 

(a)  Prove  that  ~  defines  an  equivalence  relation  on 
A. 

(b)  Find  the  equivalence  class  of  /  =  {(5,  a),  (a,  b), 
(b,  b )}  in  the  case  S  =  (5,  a,  b}. 

21.  Let  A  be  a  set  and  let  /:  A  -*■  A  be  a  function.  For 
-v,  y  €  A,  define  x  ~  y  if  f(x)  =  /(>’)• 

(a)  Prove  that  ~  is  an  equivalence  relation  on  A. 

(b)  For  A  =  R  and  fix)  =  [xj,  find  the  equivalence 
classes  of  0,  and  —  |. 

(c)  Suppose  A  =  { 1 , 2,  3,  4,  5,  6}  and 

/  =  {(1,2),  (2,1),  (3,1),  (4,  5),  (5, 6),  (6,1)}. 
Find  all  equivalence  classes. 

22.  [BB]  Let  X  =  {a,b\  and  Y  =  {1,2,  3). 


(a)  List  all  the  functions  from  X  to  Y  and  Y  to  X. 

(b)  List  all  the  one-to-one  functions  from  X  to  Y  and 

Y  to  X. 

(c)  List  all  the  onto  functions  from  X  to  Y  and  Y 
to  X. 

23.  Let  X  —  {a,  b,  c)  and  Y  =  { 1 , 2,  3,  4}. 

(a)  How  many  one-to-one  functions  are  there  from  X  — > 

Y  and  Y  -*■  X?  In  each  case,  list  all  the  func¬ 
tions. 

(b)  How  many  onto  functions  are  there  X  — >  Y  and 

Y  —>  X?  In  each  case,  list  all  the  functions. 

24.  [BB]  Given  sets  X  and  Y  with  |X|  =  m.  and  |T|  =  n, 
guess  a  general  formula  for  the  number  of  functions 
X  — >  Y .  (A  table  showing  the  possible  values  for  each 
possible  pair  m,  n  e  {1, 2,  3,  4)  may  be  helpful.) 

25.  Suppose  A  and  B  are  sets,  A  containing  n  elements  and 
B  containing  m  elements. 

(a)  If  n  >  in,  prove  that  no  function  A  — »■  B  can  be 
one-to-one. 

(b)  If  m  >  n,  prove  that  no  function  A  — >  B  can  be 
onto. 

26.  (a)  [BB  (-»)]  Suppose  that  A  and  B  are  sets  each  con¬ 

taining  the  same  (finite)  number  of  elements  and  that 
/ :  A  — >  B  is  a  function.  Prove  that  /  is  one-to-one 
if  and  only  if  /  is  onto. 

(b)  Give  an  example  of  a  one-to-one  function  N  ->  N 
which  is  not  onto.  Does  this  contradict  (a)?  Explain. 

(c)  Give  an  example  of  an  onto  function  N  — >  N  which 
is  not  one-to-one.  Does  this  contradict  (a)?  Explain. 

27.  [BB]  Let  / :  R  -*•  R  be  defined  by  fix)  =  x  +  LjcJ  . 

(a)  Graph  this  function. 

(b)  Find  the  domain  and  range  of  /. 

28.  Define  s:  R  — ►  R  by  six)  =  x  —  [xj.  Is  s  one-to-one? 
Is  it  onto?  Explain. 


3.2  Inverses  and  Composition 

The  Inverse  of  a  Function 

Suppose  that  /  is  a  one-to-one  onto  function  from  A  to  B.  Given  any  b  e  B, 
there  exists  a  €  A  such  that  fia)  =  b  (because  /  is  onto)  and  only  one  such 
a  (because  /  is  one-to-one).  Thus,  for  each  b  e  B,  there  is  precisely  one  pair 
of  the  form  (a,  b)  e  /.  It  follows  that  the  set  {( b ,  a)  \  (a,  b )  e  /},  obtained  by 
reversing  the  ordered  pairs  of  /,  is  a  function  from  B  to  A  (since  each  element 
of  B  occurs  precisely  once  as  the  first  coordinate  of  an  ordered  pair). 
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EXAMPLE  13  If  A  =  { 1,  2,  3,  4}  and  B  =  {je,  y,  z,  f },  then 

/  =  {(!.*),  (2.  y),  (3,  z).  (4.01 


is  a  one-to-one  onto  function  from  A  to  B  and,  reversing  its  pairs,  we  obtain  a 
function  B  A:  {(tv,  I ),  (y,  2),  (z,  3),  (/.  4)).  A 


3.2.1  DEFINITION 


A  function  / :  A  — >  B  has  an  inverse  if  and  only  if  the  set  obtained  by  reversing 
the  ordered  pairs  of  /  is  a  function  B  — A.  If  f:  A  — »  B  has  an  inverse,  the 
function 


/"'  ={(h,a)\(a,b)  6/} 


is  called  the  inverse  of  /. 

We  pronounce  /”',  “/  inverse,”  terminology  which  should  not  be  confused 
with  4:  /~1  is  simply  the  name  of  a  certain  function,  the  inverse  of  / ? 

If  f :  A  —>■  B  has  an  inverse  /-I  :  B  A,  then  /_1  also  has  an  inverse 
because  reversing  the  pairs  of  /-l  gives  a  function,  namely  /:  thus,  (/_l)~'  =  /. 


EXAMPLE  14  If  A  =  {1,  2,  3,4}  and  B  =  {.v,  y,  z,  ?},  and 

/  =  {(l.x),(2,y),(3,z),(4,  0} 


then 

/“'  -  {(*,  1),  (y,  2),  (z,  3),  (/,  4)} 


and  (/  1 )  1 


{ ( I ,  -E ) .  (2,  y),  (3,  z),  (4,  t)}  =  f . 


At  the  beginning  of  this  discussion,  we  saw  that  any  one-to-one  onto  function 
has  an  inverse.  On  the  other  hand,  suppose  that  a  function  /:  A  -*  B  has  an 
inverse  Then  /  must  be  onto  for,  given  b  e  B,  there  is  some  pair 

(b,a)  e  (since  dom  f~]  =  B),  and  so  the  pair  ( a,b )  is  in  /.  Moreover,  / 
must  be  one-to-one. 

Why? 


So  we  have  the  following  proposition. 


3.2.2  PROPOSITION 


A  function  / :  A  — >  B  has  an  inverse  B  — *  A  if  and  only  if  /  is  one-to-one  and 
onto. 


For  any  function  g,  remember  that  (x,  y)  €  g  if  and  only  if  g(x)  =  y;  in 
particular,  (b,  a)  e  /-1  if  and  only  if  a  —  f~](b).  Thus, 

a  =  /-l  (b)  -o-  ( b ,  a)  e  /“'  ++  (a,  b)  e  f  f(a )  =  b. 


^Readers  may  wonder  whether  or  not  it  is  ever  possible  for  /  1  and  ;  to  coincide.  Indeed  it  is,  and  this 

observation  led  to  an  article  by  R.  Cheng,  A.  Dasgupta,  B.  R.  Ebanks,  L.  F.  Larson,  and  R.  B.  McFadden, 
“When  Does  /“ *  =  j  American  Mathematical  Monthly  105  (1998),  no.  8,  704-717. 


3.2  inverses  and  Composition  81 


EXAMPLE  15 


EXAMPLE  16 


EXAMPLE  17 


The  equivalence  of  the  first  and  last  equations  here  is  very  important: 


(2) 


a  =  f  1  if  and  only  if  f (a)  —  b 


For  example,  if  for  some  function  f,n=f  1  (—7),  then  we  can  conclude 
that  f(n)  =  -7.  If  /( 4)  =  2,  then  4  =  /~‘( 2). 

The  solution  to  the  equation  2x  =  5  is  x  =  f  =  2_1  •  5.  Generally,  to  solve 
the  equation  ax  =  h,  we  ask  if  a  0,  and  if  this  is  the  case,  we  multiply  each 
side  of  the  equation  by  a-1,  obtaining  x  =  a~]b  =  Since  all  real  numbers 
except  0  have  a  multiplicative  inverse,  checking  that  a  ^  0  is  just  checking  that 
a  has  an  inverse. 

Look  again  at  statement  (2).  We  solve  the  equation  f(x)  =  y  for  x  in  the 
same  way  we  solve  ax  =  b  for  x.  We  first  ask  if  /  has  an  inverse,  and  if  it  does, 
apply  /-'  to  each  side  of  the  equation,  obtaining  *  =  /_1(y). 

The  “application”  of  /-1  to  each  side  of  the  equation  y  =  /(x)  is  very  much 
like  multiplying  each  side  by  f~l.  “Multiplying  by  Z'1”  may  sound  foolish,  but 
there  is  a  context  (called  group  theory )  in  which  it  makes  good  sense.  Our  intent 
here  is  just  to  provide  a  good  way  to  remember  the  fundamental  relationship 
expressed  in  (2). 


If  /:  R  —>  R  is  defined  by  f(x)  =  2x  —  3,  then  /  is  one-to-one  and  onto,  so  an 
inverse  function  exists.  According  to  (2),  if  y  =  /_1(x),  then  x  =  f(y)  =  2y  —  3. 
Thus,  y  =  \(x  +  3)  =  /~'(x).  A 

Let  A  ■#  {x  e  R  |  x  <  0},  B  =  {x  €  R  |  x  >  0}  and  define  /:  A  ->  B  by 
/( jc)  =  x2.  This  is  just  the  squaring  function  with  domain  restricted  so  that  it 
is  one-to-one  as  well  as  onto.  Since  /  is  one-to-one  and  onto,  it  has  an  inverse. 
To  obtain  f~\x),  let  y  —  f~](x),  deduce  [by  the  relationship  expressed  in  (2)] 
that  f(y)  =  x  and  so  y2  =  x.  Solving  for  y,  we  get  y  =  ±sfx.  Since  x  =  f(y), 
y  e  A,  so  y  <  0.  Thus,  y  =  —^/x\  f~](x)  =  —^Jx.  A 

Denoting  the  positive  real  numbers  by  R+,  let  /:  R  — >  R+  be  defined  by  f(x )  = 
3X .  Since  /  is  one-to-one  and  onto,  it  has  an  inverse.  To  find  f~^{x),  let  y  = 
f~[(x)  and  write  down  its  equivalent  form,  x  =  f(y)  =  3V.  Solving  for  y,  we 
get  y  =  log3  x.  So  f~\x)  =  log3  x.  A 


PROBLEM  18. 


Let  A  =  [x  |  x  7^  1)  and  define  /: 


A  ->  R  by  f{x)  = 


Ax 

2x  —  1 


Is  /  one-to-one?  Find  rng /.  Explain  why  /:  A  — »  rng  /  has  an  inverse.  Find 
dom/-1,  rng/'1,  and  a  formula  for  /~’(jc). 


Solution.  Suppose  f(a\)  =  fiai).  Then  - 

2a\  —  1 

8a  i02  —  4a2,  hence  ai  =  ai.  Thus  /  is  one-to-one. 


402 

2a2  -  1  ’ 


so  8aia2  —  4ai  = 


82  Chapter  3  Functions 


Next, 


y  e  rng  /  y  =  fix)  for  some  x  e  A 

there  is  an  x  e  A  such  that  y  = 


4.x 


2x  —  1 

there  is  an  x  e  A  such  that  2 xy  —  y  —  4x 
•o-  there  is  an  x  e  A  such  that  x(2y  —  4)  =  y. 


If  V  =  2,  the  equation  x(2y  —  4)  =  y  becomes  0  =  2  and  no  x  exists.  On  the  other 

v 

hand,  if  y  ^  2,  then  2y  —  4  ^  0  and  so,  dividing  by  2y  —  4,  we  obtain  x  = - . 

2y  —  4 

(It  is  easy  to  see  that  such  x  is  never  i;  that  is,  x  e  A.)  Thus  y  e  mg  /  if  and 
only  if  y  ±  2.  So  rng  /  =  B  =  {y  e  R  |  y  ^  2}. 

Since  f :  A  ^  B  is  one-to-one  and  onto,  it  has  an  inverse  /_l :  Z?  -*  A. 
Also,  dom /_l  =  rng /  =  I?  and  rng/~*  =  dom  f  =  A.  To  find  /-1(x),  set 
v  =  ./  1  (x ).  Then 


x  =  f(y)  = 


4  y 

2y  —  l 


and,  solving  for  y,  we  get  y  = 


x 

2x  -4 


r'w. 


i 


The  following  important  property  of  /  1  will  be  used  in  Section  3.3. 


3.2.3  PROPOSITION 


If/:  A 
onto. 


B  is  one-to-one  and  onto,  then  /  1  :  Z?  — A  is  also  one-to-one  and 


Prove  Proposition  3.2.3. 


Composition  of  Functions 


3.2.4  DEFINITION 


If  /:  A  — >  B  and  g:  B  — >•  C  are  functions,  then  the  composition  of  g  and  f  is 
the  function  g  o  f :  A  — ►  C  defined  by  (g  o  f)(a)  =  g(f(a))  for  all  a  6  A. 


EXAMPLE  19  If  A  =  {a,  b,  c },  B  =  {x,  y},  and  C  =  { u ,  v,  w],  and  if  / :  A  — >  B  and  g:  B  —*■  C 
are  the  functions 

/  =  {( a ,  x),  (b,  y),  (c,  x)},  g  =  {(x,  u ),  (y,  in)}, 

then 

(g  o  /)(«)  =  gif  {a))  =  g(x)  =  u, 

(g  °  f)(b)  =  gif(b))  =  g(y)  =  w, 

(g  o  f)(c)  =  gific))  =  g(x)  -  II 

and  so  g  o  /  =  {(a,  n),  (&,  w),  ( c ,  «)}.  A 


It  is  seldom  the  case  that  g  o  f  —  f  o  g.  In  this  last  example,  for  instance. 
/  o  g  is  not  even  defined.  We  must  be  careful  then  to  distinguish  between  “the 
composition  of  g  and  /”  and  “the  composition  of  /  and  g.” 
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EXAMPLE  20 


If  /  and  g  are  the  functions  R  — >■  R  defined  by 

f(x)  =  2x-3,  g(x)  =  x2  +  1, 
then  both  g  o  f  and  /  o  g  are  defined  and  we  have 

o  /)( X)  =  g(f(x))  =  g(2x  -  3)  =  (2x  -  3)2  +  1 


and 

(/  O  g)(x)  =  f(g(x))  =  f(x2  +  1)  =  2(x2  +  1)  -  3.  A 


EXAMPLE  21  In  the  definition  of  g  o  f,  it  is  required  that  mg/  c  B  =  dom  g.  If  /:  R  — >•  R 
and  g :  R  \  { 1 }  — >  R  are  the  functions  defined  by 

f(x)  =  2x  -3  and  g(x)  =  — 

x  —  I 

then  g  o  /  is  not  defined  because  mg  /  =  R  £  dom  g.  On  the  other  hand,  fog 
is  defined  and 

(/o*)0c)  =  2(j^-y)-3.  A 

EXAMPLE  22  If  / :  A  ->  A  is  any  function  and  i  denotes  the  identity  function  on  A,  then  /  ot  is 
a  function  A  -*  A.  Also,  for  any  a  e  A,  if  o  t)(a)  =  /(t(a))  =  fia).  Similarly, 
(t  o  f)ia)  =  fia)  for  any  a  e  A.  Thus,  /  o  i  =  f  =  i  o  /.  It  is  for  just  this  reason 
that  l  is  called  the  identity  function  on  A;  it  behaves  with  respect  to  o  the  way 
the  number  1  behaves  with  respect  to  multiplication.  A 


Under  what  conditions  are  two  functions  /  and  g  equal?  Since  a  function  is 
a  set  and  two  sets  are  equal  if  and  only  if  they  contain  the  same  elements,  it  must 
be  that  f  =  g  if  and  only  if  it  is  the  case  that 

(a,  b)  G  /  if  and  only  if  (a,  b)  e  g. 


3.2.5  EQUALITY  OF 
FUNCTIONS 


Suppose  f  —  g  and  a  e  dom  /.  Then  (a,  b)  e  f  for  some  b  —  fia),  so 
ia,b )  e  g.  Thus,  a  g  domg  and  b  =  gia).  This  shows  that  dom/  c  domg 
and,  for  every  a  €  dom/,  fia)  =  gia).  Reversing  the  roles  of  /  and  g,  we 
see  similarly  that  domg  c  dom  /  and  gia)  =  fia)  for  every  a  e  do  mg.  We 
are  led  to  the  following  necessary  and  sufficient  condition  for  the  equality  of 
functions. 

Functions  /  and  g  are  equal  if  and  only  if  they  have  the  same  domain,  same 
target,  and  fia)  =  gia)  for  every  a  in  the  common  domain. 


Notice  how  we  proved  that  /ot  =  /  in  Example  22.  We  noted  that  each  of  the 
functions  /ot  and  /  has  domain  A  and  target  A  and  proved  that  (/ot)(o)  =  f(a) 
for  all  a  e  A. 


Suppose  /:  A  —>  B  is  a  function.  Explain  why  (go  /  =  /. 

One  of  the  fundamental  properties  of  composition  of  functions  is  the  content 
of  the  next  proposition. 
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3.2.6  PROPOSITION 


Proof 


3.2.7  PROPOSITION 


Composition  of  functions  is  an  associative  operation. 

We  must  prove  that  (f  og)oh  =  f  o(goh)  whenever  each  of  the  two  functions — 
(fog)oh  and  /  o  (g  o  h) — is  defined.  Thus,  we  assume  that  for  certain  sets  A, 
B,  C,  and  D,  h  is  a  function  A  — >•  B,  g  is  a  function  Bi  — ►  C,  and  /  is  a  function 
C  -*■  D.  A  direct  proof  is  suggested. 

Since  the  domain  of  (/  o  g)  o  h  is  the  domain  of  /  o  (g  o  h)  (namely,  the  set 
A ),  we  have  only  to  prove  that  ((/  o  g)  o  h)(a)  =  (/  o  (g  o  h)){a)  for  any  a  e  A. 
For  this,  we  have 

((/  °g)  oh)(a)  =  {f  og){h{a))  =  f{g{h{a ))) 

and 

(/  o  (g  o  h))(a)  =  /((g  o  h)(a))  =  f(g(h(a))) 
as  desired.  I 

If  / :  A  —*■  B  has  an  inverse  :  B  — >  A,  then,  recalling  (2), 
f~\b)  =  a  if  and  only  if  b  —  f(a). 

So  for  any  a  e  A, 

a  =  rl(b)  =  f~l(f(a))  =  rl  of  (a). 

In  other  words,  the  composition  f~]  o  f  =  the  identity  function  on  A.  Simi¬ 
larly,  for  any  element  b  €  B, 

b  =  f(a)  =  /(/-'  (b))  =  f  o  f~\b). 

Thus,  the  composition  /o/_l  =  is  is  the  identity  function  on  B.  We  summarize. 

Functions  /:  A  — >■  B  and  g  :  B  ->  A  are  inverses  if  and  only  if  g  o  /  =  ia  and 
/  o  g  =  t  b  ;  that  is,  if  and  only  if 

g(/(a))  =  a  and  f(g(b))  =  b  for  all  a  €  A  and  all  b  e  B. 


PROBLEM  23.  Show  that  the  functions  /:  R  — »■  (1,  oo)  and  g:  (1,  oo)  ->  R 
dehned  by 

fix)  =  32t  +  1,  g(x)  =  \  log3  (x  -  1) 

are  inverses. 

Solution.  For  any  x  e  R. 

ig  o  f)(x)  =  g(/(x))  =  g02x  +  1) 

=  i(log3K32-r  + 1)- l]) 

=  5(log332A)  =  \_2x  =  x 
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and  for  any  x  e  (1,  oo), 

(/  o  g)(x)  =  f(g(x))  =  f(\  log3 (x  -  1)) 

_  iog3u-i))  _|_  j 
=  3l0®3^-')  +  1  =  (x  -  1)  +  1  =  x. 

By  Proposition  3.2.7,  /  and  g  are  inverses.  I 


Answers  to  Pauses 


We  have  noted  the  similarity  in  the  solutions  to  the  equations  ax  =  b  and 
f  (x)  =  y.  If  a  (or  /)  has  an  inverse,  we  multiply  by  this  inverse  and  obtain  x  = 
a~xb{ or  x  =  /”'  (y)).  It  is  interesting  also  to  observe  that  the  connection  between 
a  and  its  inverse  a  1 2  (their  product  is  1)  is  strikingly  similar  to  the  connection 
between  /  and  its  inverse  /  1  (their  composition  is  the  identity  function). 

3.  Suppose  f(a\)  =  /(tfi)  for  some  a\,  a2  €  A.  Let  b  =  f(a i)  (=  f(a2)). 
Then  (a j,  b)  and  (#2,  b)  are  both  in  /,  so  ( b ,  a\)  and  (b,  af)  are  both  in  /-1. 
But  /-1  is  a  function  and  so  each  element  of  B  occurs  just  once  as  the  first 
coordinate  of  an  ordered  pair  in  f~l .  It  follows  that  a\  =  at;  /  is  one-to-one. 

4.  To  show  that  is  one-to-one,  assume  f~l(b j)  —  1  (^2)-  Setting  a\  = 

f~\b  1)  and  02  =  /_l(&2),  we  have  a  1  =  ai-  Therefore,  b\  —  f(a\)  = 
f(ao)  =  7>2-  To  show  that  / _1  is  onto,  let  a  €  A.  Then  /(a)  =  b  e  B,  so 
a  =  f~l(b)  as  desired. 

5.  Since  f :  A  ^  B  and  t#:  B  -*  B,  the  composition  (5  o  /  is  a  function 

A  fi.  Thus,  dorm#  o  f  =  A  =  dom  /  and  target  /  =  target  lb  o  f  =  B. 

Also,  for  any  a  e  A,  tg  o  /(a)  =  t ^ (/"(«))  =  /(a),  because  /(a)  6  #  and 
iB{b)  =  b  for  all  b  e  5.  Thus,  the  functions  igo  /  and  /  are  equal. 


EXERCISES 


77!£  symbol  [BB]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 


1.  [BB]  Let  A  =  {1, 2,  3,  4,  5).  Find  the  inverse  of  each  of 
the  following  functions  /:  A  —*■  A. 

(a)  /  =  {(1,2),  (2,  3),  (3, 4),  (4,  5),  (5,1)} 

(b)  /  =  {(!,  2),  (2,4),  (3,  3),  (4.  1),  (5,  5)} 

2.  Graph  each  of  the  following  functions  and  find  each 
inverse.  Specify  the  domain  and  range  of  each  inverse. 

(a)  [BB]  / :  R  ->  R  given  by  f(x)  =  3x  +  5 

(b)  / :  R  -*•  R  given  by  f(x)  =  x3  -  2 

(c)  /8:  (3,  00)  R  given  by  /3(jc)  =  log2(3.r  -  4) 

(d)  g:  R  ->  R  given  by  g(x)  =  x\x\ 

3.  Show  that  each  of  the  following  functions  f:  A  -*■  R 
is  one-to-one.  Find  the  range  of  each  function  and  a 
suitable  inverse. 


(a)  [BB]  A  =  [x  €  R  |  *  /  4],  f(x)  =  1  + - - 

jr  —  4 

(b)  A  =  [xeR|x#-l),  /(jc)  =  5--4- 

1  +  x 

(c)  A  =  [x  e  R  |  x  /  -  j],  f{x )  =  3X  - 

z  2x  4-  1 

(d)  A  =  {x  €  R  \x  #  -3},  fix)  =  4 

x  +  3 

4.  Define  /:  N  by  /(«)  -  j 

(a)  Show  that  /  has  an  inverse. 

(b)  Find  /“'  (2586). 
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5.  Suppose  A  is  the  set  of  all  married  people,  mother:  A  — »• 
A  is  the  function  which  assigns  to  each  married  per¬ 
son  his/her  mother,  and  father  and  spouse  have  similar 
meanings.  Give  sensible  interpretations  of  each  of  the 
following: 

(a)  [BBJ  mother  o  mother 

(b)  mother  o  father 

(c)  father  o  mother 

(d)  mother  o  spouse 

(e)  spouse  o  mother 

(f)  father  o  spouse 

(g)  spouse  o  spouse 

(h)  (spouse  o  father)  o  mother 

(i)  spouse  o  (father  o  mother) 

6.  Let  S  =  {1, 2,  3, 4,  5)  and  T  =  { 1 , 2,  3,  8,  9)  and  define 
functions  f :  S  —>■  T  and  g :  S  — »  S  by 

/  =  {(!,  8),  (3,  9),  (4,  3),  (2,  1),  (5,2)1  and 

g  =  {(l,2),  (3,  1),(2,2),  (4,3),  (5,2)}. 

(a)  [BB]  Find  fog  or  explain  why  fog  is  not  defined. 
Repeat  for  g  o  /,  /  o  /,  and  gog. 

(b)  Which  of  /,  g  are  one-to-one?  Which  are  onto? 
Explain. 

(c)  Find  /“'  if  it  exists,  ff  it  doesn't,  explain  why  not. 

(d)  Find  g^1  if  it  exists.  If  it  doesn't,  explain  why  not, 

7.  Let  S  =  {1,2,3,4}  and  define  functions  /,#:£—»•  S 
by 

/  =  {(1,3),  (2,  2),  (3,  4),  (4,  1)1  and 
£  =  {(1,4),  (2,  3),  (3,  1).  (4,  2)}. 

Find 

(a)  [BB]  g~'  o  f  og 

(b)  /og-1  og 

(c)  go  f  o  g~~ 1 

(d)  g  o  g~ 1  o  / 

(e)  f~l  og-'  o  f  og 

8.  [BB]  Let  /,  g  and  li :  R  — >  R  be  defined  by 

f(x)=x  +  2,  g(x)  =  '  h(x)  =  3. 

xz  +  1 

Compute  gof(x),  f  og(x),  h  og  o  f(x),  goho  f(x), 
go  f-'  o  /(x),  and  f'oso  f(x). 


9.  Let  A  be  the  set  A  =  {x  e  R  |  x  >  0)  and  define 
/,  g,  h  :  A  ->  R  by 

x  1 

f(x)  =  — —,  g(x)  =  -,  h(x)  =  .V  +  1. 

-V  +  1  .v 

Find  g  o  f{x),  f  o  g(x),  /logo  f(x),  and  fog  o  h(x). 

10.  [BB]  Let  / :  R  ->  R  be  a  function,  let  c  be  a  real  number, 
and  define  g :  R  — *•  R  by  g(jc)  =  x  —  c.  Explain  how  the 
graphs  of  /  and  go/  are  related. 

11.  Let  /:  R  — >•  R  be  a  function,  let  c  be  a  real  number, 
and  define  g :  R  -»  R  by  g(x)  =  x  —  c.  Explain  how  the 
graphs  of  /  and  fog  are  related. 

12.  Let  /:  R  — >  R  be  a  function  and  let  g:  R  ->  R  be 
defined  by  g(x)  =  —  |jc|. 

(a)  [BB]  How  are  the  graphs  of  /  and  /  o  g  related? 

(b)  How  are  the  graphs  of  /  and  go/  related? 

13.  Let  A  denote  the  set  R  \  [0,  1).  Let  i  denote  the 
identity  function  on  A  and  define  the  functions 
/,  g,  h,s,r  :  A  ->  A  by 

f  (x)  =  1  —  — ,  g(x)  =  — 1 — ,  h(x)=-, 
x  I  —  x  x 

x 

r(x)  =  - f,  s(x)  =  1  —  x. 

x  —  1 

(a)  Show  that  f  o  g  =  i  and  g  o  r  —  s.  Complete  the 
table,  thereby  showing  that  the  composition  of  any 
two  of  the  given  functions  is  one  of  the  given  five, 
or  the  identity.3 


o 

i  f  g  h  r  s 

l 

f 

L 

g 

S 

h 

r 

s 

(b)  Which  of  the  given  six  functions  have  inverses? 
Find  (and  identify)  any  inverses  which  exist. 

14.  Let  A  =  [1, 2,  3)  and  define  f\,  //  /;,  fs :  A  — »  A 
as  follows: 

/,  =  {(1,  1),  (2,  3),  (3,  2)} 
fl  =  [(1,3),  (2,  2),  (3,  1)) 

/3  =  {(L2),  (2,1),  (3,  3)} 

/4  =  {(1,2),  (2,  3),  (3,  1)} 
fs  =  {(L3J,  (2,1),  (3,  2)}. 


3The  table  you  will  construct  in  this  exercise  is  the  multiplication  table  for  an  important  mathematical  object  known  as  a  group.  This  particular 
group  is  the  smallest  one  which  is  not  commutative. 
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(a)  Show  that  each  composite  function  /,■  o  fj  is  one  of 
the  given  functions,  or  the  identity,  by  completing  a 
table  like  the  following  one.4  For  example,  if  fi  o 
fo  =  /4,  the  entry  in  row  /2,  column  fo,  should 
be  /4. 


0 

i  fi  h  h  U  fs 

( 

f\ 

h 

h 

h 

fs 

(b)  Find  the  inverses  of  those  of  the  six  functions  in  (a) 
which  have  inverses. 

15.  Let  S  =  (1,2,  3,  4,  5}  and  let  /,  g,  h :  S  — >  S  be  the 
functions  defined  by 

/  =  {(!.  2),  (2,  1),  (3,  4),  (4,  5),  (5,  3)} 


*  =  {(1,3),  (2,  5),  (3,  1),  (4,  2),  (5,4)) 


h  =  {(1,2),  (2, 2),  (3, 4),  (4,  3),  (5,1)). 


(a)  [BB]  Find  fog  and  go/.  Are  these  functions  equal? 

(b)  Explain  why  /  and  g  have  inverses  but  h  does  not. 
Find  /~l  and  g”1. 

(c)  Show  that  (/  o  g)_1  =  g-1  o  f~l  /  /“ 1  o*-1. 

16.  [BB]  Let  A  =  {x  e  R  |  *  ^  2}  and  B  =  [x  e  R  |  .v  ^  l). 
Define  / :  A  B  and  g :  B  ->•  A  by 


/(*)  = 


x 

x  —  2' 


(a)  Find  (fog)(x). 

(b)  Are  /  and  g  inverses?  Explain. 

17.  Let  A  be  a  subset  of  R  and  suppose  /:  A  — >■  A  is  a 

1 

function  with  the  property  that  /  (jt)  =  — -  for  all 

jc  e  A. 


(a)  Show  that  0  /  A. 

(b)  Show  that  f4  =  iA,  the  identity  function  on  A.  (By 
/4,  we  mean  fofofof.) 


18.  Suppose  /:  A  —*■  B  and  g:  B  -*■  C  are  functions. 

(a)  [BB]  If  go  f  is  one-to-one  and  /  is  onto,  show  that 
g  is  one-to-one. 

(b)  If  g  o  /  is  onto  and  g  is  one-to-one,  show  that  /  is 
onto. 

19.  (a)  [BB]  Prove  that  the  composition  of  one-to-one  func¬ 

tions  is  a  one-to-one  function. 

(b)  Show,  by  an  example,  that  the  converse  of  (a)  is  not 
true. 

(c)  Show  that  if  g  o  /  is  one-to-one,  then  f  must  be 
one-to-one. 

20.  (a)  Prove  that  the  composition  of  onto  functions  is  onto. 

(b)  [BB]  Show,  by  an  example,  that  the  converse  of  (a) 
is  not  true. 

(c)  Show  that  if  g  o  /  is  onto,  then  g  must  be  onto. 

21.  [BB]  Is  the  composition  of  two  bijective  functions  bijec- 
tive?  Explain. 

22.  Define  / :  Z  — ►  Z  by 


/(«)  = 


In  —  2 

(/(/(«  +  4» 


n  >  1000 
n  <  1000. 


(a)  Find  the  values  of  /(1000),  /( 999),  /(998), 
/( 997),  and  /( 996). 

(b)  Guess  a  formula  for  f(n). 

(c)  Guess  the  range  of  /. 

23.  Show  that  the  function  / :  R  — >■  R  defined  by  f(x)  = 

x 

■  is  one-to-one. 

six1  +  2 

Find  mg  /  and  a  suitable  inverse. 

24.  Let  / :  R  ->  R  be  the  function  defined  by  t  (x)  = 
2Lxj  -x. 

(a)  Graph  t  and  use  this  to  decide  whether  or  not  t  has 
an  inverse. 

(b)  [BB]  Prove  that  t  is  one-to-one  without  using  the 
graph  of  t. 

(c)  Prove  that  t  is  onto  without  using  the  graph  of  t. 

(d)  Find  a  formula  for  t~x(x). 


3.3  One-to-One  Correspondence  and  the  Cardinality  of  a  Set 

In  this  section  we  think  about  the  size  of  a  set,  or,  more  correctly,  its  "cardinality.” 
We  show  that  our  natural  instincts  about  the  relative  sizes  of  finite  sets  can  be 


4 As  with  Exercise  13,  this  table  also  describes  the  multiplication  for  the  smallest  group  which  is  not  commutative. 
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3.3.1  DEFINITIONS 


EXAMPLES  24 


extended  to  allow  the  comparison  of  infinite  sets  too,  though  sometimes  with 
surprising  results.  It  turns  out,  for  example,  that  the  natural  numbers  and  the  set 
of  all  rational  numbers  have  the  same  “size,”  whereas  both  these  sets  are  “smaller” 
than  the  set  of  real  numbers. 

In  discussions  about  cardinality,  it  is  common  to  employ  the  term  one-to-one 
correspondence  rather  than  its  synonym,  one-to-one  onto  function.  Either  of  the 
statements 

“A  and  B  are  in  one-to-one  correspondence”  or 

“There  is  a  one-to-one  correspondence  between  A  and  B” 

means  that  there  is  a  one-to-one  onto  function  /  from  A  to  B.  Recall  from 
Section  3.2  that  this  implies  there  is  also  a  one-to-one  onto  function  from  B  to 
A ,  namely,  /-1, 


A  finite  set  is  a  set  which  is  either  empty  or  in  one-to-one  correspondence  with 
the  set  { 1,  2,  3, . . .  ,  n]  of  the  first  n  natural  numbers,  for  some  n  e  N.  A  set  which 
is  not  finite  is  called  infinite. 

If  A  is  a  finite  set  and  A  ^  0,  then,  for  some  natural  number  n,  there  exists  a 
one-to-one  onto  function  /:  {1, 2,  3,  . . .  ,  n]  — >  A.  Letting  f(i)  —  a,  ,  this  means 
that  A  =  {<a i ,  ci2 . an}. 

If  A  is  a  finite  set,  the  cardinality  of  A  is  the  number  of  elements  in  A;  this 
is  denoted  |A|.  Thus,  |0|  =  0  and,  if  A  =  [a\ ,  «2.  •  ■  •  ,  an },  then  | A |  —  n. 

•  \{a,  b,  x}\  =  3, 

•  the  letters  of  the  English  alphabet  comprise  a  set  of  cardinality  26, 

.  \{x  e  R  |  jr2+  1  =0}|  =0.  ▲ 

How  might  we  determine  if  two  finite  sets  contain  the  same  number  of  ele¬ 
ments?  We  could  count  the  elements  in  each  set,  but  if  the  sets  were  large,  this 
method  would  be  slow  and  highly  unreliable.  Confronted  with  two  enormous  pails 
of  jelly  beans  and  asked  to  determine  if  the  number  of  jelly  beans  in  each  pail  is 
the  same,  the  best  strategy  would  be  to  pair  the  beans  in  the  two  pails.  Remove 
one  jelly  bean  from  each  pail  and  lay  the  two  aside;  remove  another  jelly  bean 
from  each  pail,  lay  these  aside,  and  so  on.  If  the  last  jelly  beans  in  each  pail  are 
removed  together,  then  certainly  we  would  know  that  the  numbers  in  each  pail 
were  the  same. 

If  the  numbers  are  the  same,  the  set  of  pairs  of  beans  removed  from  the  pails 
defines  a  one-to-one  onto  function  from  the  jelly  beans  in  one  pail  to  those  in 
the  other.  Different  beans  in  the  first  pail  are  paired  with  different  beans  in  the 
second  (one-to-one)  and  every  bean  in  the  second  pail  is  paired  with  some  bean 
in  the  first  (onto). 

If  two  finite  sets  A  and  B  have  the  same  cardinality — we  write  | A |  =  \B\ — 
then  there  is  a  one-to-one  onto  function  from  A  to  B.  Conversely,  if  there  is  a 
one-to-one  onto  function  from  A  to  B,  then  |A|  =  |f?|.  This  idea  allows  us  to 
extend  the  notion  of  “same  size”. 


3.3  One-to-One  Correspondence;  Cardinality  89 


3.3.2  DEFINITION 


EXAMPLES  25 


Sets  A  and  B  have  the  same  cardinality  and  we  write  |A|  =  \B\,  if  and  only  if 
there  is  a  one-to-one  correspondence  between  them;  that  is,  if  and  only  if  there 
exists  a  one-to-one  onto  function  from  A  to  B  (or  from  B  to  A). 

•  a  i->-  x,  b  i->  y  is  a  one-to-one  correspondence  between  { a ,  b }  and  {x,  _y } ; 
hence,  |{n,  b}|  =  \{x,  y}|  (=  2). 

•  The  function  /:  N  — »  N  U  {0}  defined  by  f(n)  =  n  —  1  is  a  one-to-one 
correspondence  between  N  and  N  U  (0);  so  |N|  =  | N  U  {0} | . 

•  The  function  /:  Z  — >  2Z  defined  by  f(n)  =  2n  is  a  one-to-one  correspon¬ 

dence  between  the  set  Z  of  integers  and  the  set  2Z  of  even  integers;  thus,  Z 
and  2Z  have  the  same  cardinality.  ▲ 

Many  people  find  the  second  and  third  examples  here  surprising  and  perhaps 
mildly  disconcerting;  nevertheless,  they  must  be  accepted.  Remember  that  our 
definition  of  “same  cardinality”  coincides  with  what  we  know  to  be  the  case  with 
finite  sets. 

We  can  prove  that  {a,  b)  and  {x,  y)  have  the  same  cardinality  (without  using 
the  word  “two”)  by  pairing  the  elements  in  the  two  sets 

flKr,  b  h->  y 

and  this  is  exactly  how  we  argue,  for  instance,  that  Z  and  2Z  have  the  same 
cardinality.  We  pair  0  with  0,  1  with  2,  -1  with  —2,  2  with  4,  —2  with  -4,  n 
with  2 n. 

Many  readers  of  this  book  will  be  familiar  with  the  graph  of  the  function 
R  -»  R  defined  by  /( x)  =  2*.  This  is  a  one-to-one  function  and  its  range  is 
the  set  R+  of  positive  reals.  We  conclude  that  the  real  numbers  and  the  positive 
real  numbers  have  the  same  cardinality.  In  fact,  the  real  numbers  have  the  same 
cardinality  as  any  interval,  for  example,  the  interval  (0,  1).  (Exercise  12(c)  shows 
that  any  open  interval  has  the  same  cardinality  as  (0,  1).) 

PROBLEM  26.  Show  that  the  set  R+  of  positive  real  numbers  has  the  same 
cardinality  as  the  open  interval  (0,  1)  =  {x  e  R  |  0  <  x  <  1). 

Solution.  Let  /:  (0,  1)  ->  R+  be  defined  by 

/(■*)  =  -  -  L 
x 

We  claim  that  /  establishes  a  one-to-one  correspondence  between  (0,  1)  and  R+. 

To  show  that  /  is  onto,  we  have  to  show  that  any  y  e  R+  is  fix )  for  some 
x  e  (0,  1).  But 

1  1 

y  = - 1  implies  x  = - 

x  1  +  y 

which  is  in  (0,  1)  since  y  >  0.  Therefore, 

y  e  R+  implies  y  = 


90  Chapter  3  Functions 


so  /  is  indeed  onto.  Also,  /  is  one-to-one  because 


fix l)  =  f(X2)  -+  ~  ~  1  =  -  -  1 

X\  X2 

1  _  1 

X]  X2 

— >•  X\  =  Xi.  I 


Just  as  the  notion  of  cardinality  partitions  finite  sets  into  classes  (each  class 
consisting  of  sets  with  the  same  number  of  elements),  so  cardinality  also  partitions 
infinite  sets.  In  this  book,  we  consider  just  two  classes  of  infinite  sets. 


3.3.3  DEFINITIONS 


A  set  A  is  countably  infinite  if  and  only  if  |A|  =  |N|  and  countable  if  and  only  if 
it  is  either  finite  or  countably  infinite.  A  set  which  is  not  countable  is  uncountable. 


The  symbol  Ko  (pronounced  “aleph  naught”)  has  traditionally  been  used  to 
denote  the  cardinality  of  the  natural  numbers.  Thus,  a  countably  infinite  set  has 
cardinality  Kq.  As  the  name  suggests,  countably  infinite  sets  are  those  whose 
elements  can  be  listed  in  a  systematic  and  definite  way,  because  to  list  them  is 
to  rank  them  as  first,  second,  third,  and  so  on,  and  this  ranking  establishes  the 
required  one-to-one  correspondence  with  N. 


PROBLEM  27.  Show  that  |Z|  =  H0. 

Solution.  The  set  of  integers  is  infinite.  To  show  they  are  countably  infinite,  we 
list  them:  0,  1,  —1,  2,  —2,  3,  —3, ....  This  list  is  just  /( 1),  /( 2),  /( 3),  . . .  where 
/ :  N  — >  Z  is  defined  by 


fin)  = 


\n  if  n  is  even 
—  ^(n  —  1)  if  n  is  odd. 


which  is  certainly  both  one-to-one  and  onto.  I 

Can  you  express  this  function  /  with  a  single  equation? 

When  we  say  list  “in  a  systematic  and  definite  way”  we  mean  that  when 
one  person  terminates  the  list  (with  three  dots),  another  should  be  able  to  con¬ 
tinue.  In  a  good  list,  it  should  be  possible  to  determine  the  position  of  any  ele¬ 
ment.  It  is  not  acceptable  to  “list”  the  integers  as  0,  1, 2,  3,  . . .  ,  —  1,  —2,  —3, . . . 
because  —1  does  not  have  a  position:  0  is  the  first  number,  1  is  the  second,  but 
in  what  position  is  —1?  Similarly,  it  is  not  acceptable  to  “list”  the  integers  as 
...  ,  — 3,  —2,  —1,0,  1 , 2,  3, . . .  because  no  integer  has  a  position.  What’s  the  first 
element  in  the  list?  In  fact,  this  is  not  a  list  at  all.  On  the  other  hand,  listing  the 
integers  this  way — 0,  1,  —1, 2,  —2,  3,  —3,  . . . — assigns  every  integer  a  definite 
position. 
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In  the  list  0,  1,  —1,  2,  —2,  3,  —3, . . . ,  —1  is  the  third  number  and  3  is  the  sixth. 
In  what  position  is  1003?  What  number  is  in  position  1003? 

The  following  argument  establishes  the  remarkable  fact  that  the  set  N  x  N  = 
{(m,  n)  |  m ,  n  e  N}  is  countable;  it  has  the  same  cardinality  as  N. 


PROBLEM  28.  Show  that  |N  x  N|  =  |N|. 

Solution.  The  elements  of  N  x  N  can  be  listed  by  the  scheme  illustrated  in 
Fig  3.4.  The  arrows  indicate  the  order  in  which  the  elements  of  N  x  N  should 
be  listed — (1,  1),  (2,  1),  (1,  2),  (1,  3),  (2,  2), . . .  .  Wherever  the  arrows  terminate, 
there  is  no  difficulty  in  continuing,  so  each  ordered  pair  acquires  a  definite 
position.  I 


t 

(1,4)  (2,4)  (3,4)  (4,4)  ... 

\ 

(1,3)  (2,3)  (3,3)  (4,3)  ... 

t  \  \ 

(1,2)  (2,2)  (3,2)  (4,2)  ... 

\  \  \ 

(1,1) — >■  (2,1)  (3,1)-K4,1)  (5,1)  ... 

Figure  3.4  The  elements  of  N  x  N  can  be  systematically  listed  by  starting  at  (1,  1)  and  following  the 
arrows. 


In  Fig  3.4  suppose  we  write  the  positive  rational  f  at  the  point  (m,  n).  Then 
every  rational  number  appears  in  the  picture,  actually  many  times  over,  since,  for 
instance,  |  appears  beside  the  points  (2,  3),  (4,  6),  (6,  9),  (40.  60),  and  so  on.  By 
ignoring  the  repetitions  of  a  given  fraction,  the  set  of  positive  rationals  can  be 
enumerated.  We  conclude  that  the  set  of  positive  rationals  is  a  countable  set. 

The  scheme  which  we  have  just  described  for  listing  the  positive  rationals  begins 
1,2,  j,  5  (ignore  the  number  |,  which  has  already  been  listed),  3,4,  |,  j, 
Continue  the  list  in  order  to  determine  the  position  of  |. 

Not  so  long  ago,  there  came  to  light  another  way  to  establish  a  one-to-one 
correspondence  between  the  positive  rationals  and  N.  For  a  very  neat  and  direct 
approach  to  an  interesting  problem,  see  Exercise  35  of  Section  4.3. 

This  result  is  astonishing  when  you  consider  how  much  “bigger”  than  the 
natural  numbers  the  set  of  positive  rationals  “seems.”  It  is  nonetheless  true  that 
these  infinite  sets  have  exactly  the  same  cardinality.  Actually,  more  is  true.  Since 
we  are  able  to  list  the  positive  rational  numbers — a\  ,02,03, ...  — it  becomes  easy 
to  list  all  the  rationals,  simply  by  intertwining  the  positive  and  negative  rationals, 
like  this — 0,  a \,  —01,02,  —02,  <33,  ...  .  What  do  we  conclude?  The  entire  set  Q  of 
rational  numbers  is  countable! 


-5 

In  the  enumeration  of  the  rationals  just  described,  in  what  position  is  |? 


92  chapter  3  Functions 


At  this  point,  we  might  be  tempted  to  conjecture  that  every  set  is  countable, 
but  such  is  indeed  not  the  case!  We  present  another  old  and  well-known  argument 
which  shows  that  the  set  of  real  numbers  is  not  countable.  Actually  it  shows  that 
the  set  (0,  1)  =  {x  e  R  |  0  <  x  <  1}  of  real  numbers  between  0  and  1  is  not 
countable.  It  then  follows  that  R  cannot  be  countable  because  if  it  were,  so  also 
would  the  subset  (0,  1 )  because  a  subset  of  a  countable  set  is  countable.  (See  the 
remarks  at  the  end  of  this  section.) 


PROBLEM  29.  Show  that  (0,  1)  is  uncountable. 

Solution.  The  argument  we  present  is  a  splendid  example  of  a  proof  by  contra¬ 
diction. 

Suppose  that  there  does  exist  a  list  a],  no,  A3,  •••  of  all  the  real  numbers 
between  0  and  1.  Write  each  of  these  numbers  in  decimal  form,  agreeing  to  write 
0.200 , . .  rather  than  0. 199 ... ,  for  example,  so  that  the  same  number  does  not 
appear  twice.  Our  list  would  look  like  this: 

a i  =  0.a\\a\2O,\'iCi\4  •  ■  ■ 

02  =  0-G2\a22a23a24  '  '  ' 

<23  =  0.a3i  A32A33A34  •  •  • 


and  remember,  every  real  number  between  0  and  1  is  supposed  to  be  here.  We 
can,  however,  write  down  a  number  which  is  guaranteed  not  to  be  in  the  list. 
Look  at  an-  If  a\\  =  1,  let  b\  =  2;  otherwise,  let  b[  =  1.  Next  look  at  022-  If 
022  =  1,  let  62  =  2;  otherwise,  let  b2  =  1.  Then  look  at  033,  in  general  continuing 
down  the  diagonal  of  the  following  square  array 

«i  1  a j2  o 1 3  •  •  • 

02]  022  «23  ••  • 

031  032  033  ••• 


and  defining 


2  if  an  =  1 
1  if  an  ^  1 . 


Thus,  bi  is  always  different  from  an.  Consider  the  number  b  =  0.b\b2bj  •  •  ■  • 
Since  each  bi  is  1  or  2 ,  b  ^  0.000 . . .  =  0  and  b  ^  0.999 Thus,  b  is  in 
the  interval  (0,  1),  so  it  must  be  a,-,  for  some  i.  But  b  ^  a\  since  b  differs  from 
a  1  in  the  first  decimal  place,  b  ^  02  since  b  differs  from  02  in  the  second  decimal 
place  and,  generally,  b  ^  a,  since  b  differs  from  a;  in  the  /  th  decimal  place.  The 
hypothesis  that  the  reals  in  (0,  1)  are  countable  has  led  us  to  a  contradiction,  so 
the  hypothesis  is  false.  The  real  numbers  are  not  countable.  I 


The  “diagonal”  argument  just  sketched,  due  to  the  great  German  mathemati¬ 
cian  Georg  Cantor  (1845-1918),  should  be  read  several  times.  It  is  ingenious  and 
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3.3.4  CONTINUUM 
HYPOTHESIS 

3.3.5  REMARK 


Answers  to  Pauses 


certainly  not  the  sort  of  thing  most  of  us  would  think  up  by  ourselves.  Here’s  the 
important  thing  to  remember,  though.  Many  infinite  sets  are  countable — the  natu¬ 
ral  numbers,  the  integers,  the  rationals — but  there  are  some  sets  which  are  much 
bigger  (the  real  numbers  for  instance).  Infinite  sets  come  in  different  “sizes,”  just 
as  do  finite  sets. 

At  this  point,  some  readers  may  wonder  if  there  are  any  subsets  of  the  real 
numbers  whose  cardinality  lies  between  that  of  N  and  R.  This  is  a  very  good 
question!  In  work  for  which  he  was  awarded  a  Fields  Medal5  in  1966,  Paul  Cohen 
showed  that  the  usual  axioms  of  set  theory  do  not  give  enough  information  to  settle 
it.  As  a  consequence,  mathematicians  sometimes  add  the  Continuum  Hypothesis, 
first  formulated  by  Cantor,  to  these  basic  axioms. 

There  is  no  set  A  with  Ko  <  |A|  <  |R|. 

This  is  the  first  time  we  have  used  the  inequality  symbol  <  with  infinite  sets. 
While  we  have  defined  the  notion  of  “same  cardinality”,  we  have  not  discussed 
the  possibility  of  one  infinite  set  being  “smaller”  than  another.  For  any  two  sets  A 
and  B ,  we  define  |A|  <  |B|  if  and  only  if  there  is  a  one-to-one  function  A  —*■  B, 
and  |A|  <  |fi|  if  |A|  <  |Z?|  but  |A|  7^  \B\.  It  is  not  hard  to  see  that  this  agrees  with 
intuition  on  finite  sets,  so  it  is  a  reasonable  definition.  The  problem,  however,  is 
to  prove  that  if  |A|  <  |f?|  and  |B|  <  |A|,  then  | A |  =  |B|.  This  is  easy  if  A  and  B 
are  finite,  but  the  deep  “Schroder-Bemstein”  theorem  for  infinite  sets. 

We  conclude  with  two  remarks  which  you  are  encouraged  to  remember  and 
use  freely  in  the  exercises  which  follow. 

1.  A  subset  of  a  countable  set  is  countable. 

Why?  In  this  section,  we  showed  that  the  set  of  positive  rationals  is  countable 
by  taking  the  listing  of  all  fractions  suggested  by  Fig  3.4  and  omitting  repe¬ 
titions.  In  a  similar  way,  if  B  is  a  subset  of  a  countable  set  A,  the  elements 
of  B  can  be  enumerated  by  taking  a  listing  of  A  and  omitting  those  elements 
which  are  not  in  B. 

2.  The  concept  of  “same  cardinality”  is  an  equivalence  relation  on  sets,  in 
particular,  it  is  transitive.  (See  Exercise  10.) 

It  follows  that  if  we  want  to  prove  that  a  certain  set  A  is  countable,  for 
instance,  it  is  sufficient  to  show  that  A  has  the  same  cardinality  as  some  other 
set  which  is  known  to  be  countable.  Similarly,  if  we  want  to  prove  that  a  set 
is  uncountable,  it  is  enough  to  show  that  it  has  the  same  cardinality  as  a  set 
which  we  know  is  uncountable. 

6.  f(n)  =  \[\  +  (— 1)"(2ai  -  1)] 

7.  In  the  listing  of  the  integers,  natural  number  k  comes  in  position  2k  and 
integer  —  k  in  position  2k  +  1.  Thus,  1003  comes  in  position  2006  and  the 
number  in  position  1003  is  —501. 


-A  Fields  Medal  is  the  highest  honor  which  can  be  bestowed  upon  a  mathematician.  There  is  no  Nobel 
Prize  in  mathematics. 
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8.  The  listing  1,  2,  3,  4,  §,  §,  5,  6,  |,  |,  | . 

Thus,  |  is  in  position  15. 

9.  The  rationals  are  listed  0,  1,  —1, 2,  —2,  4,  —  i,  ....  The  number  |  is  30th  in 
the  list. 


EXERCISES 


The  symbol  [BB]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 


1.  [BB]  An  enormous  crowd  covers  the  field  of  a  base¬ 
ball  stadium  for  a  rock  concert.  Suggest  an  easy  way  to 
determine  whether  or  not  the  size  of  the  crowd  exceeds 
the  number  of  available  seats  in  the  stadium. 

2.  [BB]  At  first  glance,  the  perfect  squares — 1,4,9,16 
25, . . . — seem  to  be  a  very  sparse  subset  of  the  natu¬ 
ral  numbers.  Galileo,  however,  argued  that  there  were 
as  many  perfect  squares  as  there  are  natural  numbers. 
What  was  his  reasoning  likely  to  have  been? 

3.  Find  a  one-to-one  correspondence  between  each  of  the 
following  pairs  of  sets: 

(a)  [j,  y,  { a ,  b,  c}}  and  {14,  —3,  t } 

(b)  2  Z  and  17Z 

(c)  [BB]  N  x  N  and  {a  +  bi  €  C  |  a,  b  e  N] 

(d)  N  and  { j  |  m  e  N,  n  =  1 , 2} 

4.  Find  the  inverse  of  the  function  / :  N  — >■  Z  defined  in 
Problem  27. 

5.  (a)  Suppose  / :  N  -*■  Z  is  a  one-to-one  onto  function. 

Prove  that  the  function  j:NxN-tNxZ  defined 
by  g(m,  n)  =  {m.  /(«))  is  one-to-one  and  onto. 

(b)  Find  a  one-to-one  correspondence  between  N  x  N 
and  N  x  Z. 

6.  [BB]  True  or  false  and  explain:  If  A  C  5,  then  |A|  and 
|fi|  do  not  have  the  same  cardinality. 

7.  Suppose  S  is  a  set  and  for  A,  B  e  V{S),  we  define 
A  <  B  to  mean  |A|  <  \B\.  Is  this  relation  a  partial 
order  on  "P(S)?  Explain. 

8.  Show  that  for  any  sets  A  and  B,  |A  x  B\  =  \B  x  A|. 

9.  [BB]  Let  X,  Y  and  Z  be  sets. 

(a)  True  or  false:  (X  x  Y)  x  Z  =  X  x  (F  x  Z).  Provide 
a  proof  or  give  a  counterexample. 

(b)  Find  a  one-to-one  correspondence  (Xxf)xZ-> 
X  x  (Y  x  Z). 


10.  Prove  that  the  notion  of  same  cardinality  is  an  equiv¬ 
alence  relation  on  the  family  of  all  sets.  Explain,  with 
reference  to  Definition  3.3.2. 

11.  (a)  [BB]  Find  a  one-to-one  correspondence  between  the 

intervals  ( 1 , 00)  and  (3,  00).  What  do  you  con¬ 
clude  about  the  cardinalities  of  (1, 00)  and  (3,  00)? 
[Recall  that  (a,  00)  =  {x  \  x  >  «}.  See  Defini¬ 
tion  2.2.3.] 

(b)  Let  a  and  b  be  real  numbers.  Find  a  one-to-one 
correspondence  between  (a,  00)  and  ( b .  00). 

12.  (a)  [BB]  Prove  that  the  intervals  (0,  1)  and  (1,2)  have 

the  same  cardinality. 

(b)  Prove  that  (0,  1)  and  (4,  6)  have  the  same  cardinal¬ 
ity. 

(c)  Prove  that  any  two  open  intervals  (a.  b)  and  (c,  d) 
have  the  same  cardinality. 

13.  (a)  [BB]  Prove  that  the  intervals  (0,  1)  and  (10,  00) 

have  the  same  cardinality. 

(b)  Prove  that  the  intervals  (2,  5)  and  (10,  00)  have  the 
same  cardinality. 

(c)  Prove  that  the  intervals  (a,  b)  and  (c,  00)  have  the 
same  cardinality.  Assume  a  <  b. 

14.  Let  a  and  b  be  real  numbers  with  a  <  b.  Show  that  the 
set  R+  of  positive  real  numbers  has  the  same  cardinality 
as  the  open  interval  (a,  b)  =  {x  e  R  |  a  <  x  <  b). 

15.  [BB]  Prove  that  the  function  defined  by  f{x)  =  Y 
establishes  a  one-to-one  correspondence  between  the 
real  numbers  R  and  the  positive  real  numbers  R+.  What 
can  you  conclude  about  cardinality? 

16.  [BB]  Prove  that  R  and  the  interval  (a.  b)  have  the  same 
cardinality.  Assume  a  <  b. 

17.  Prove  that  each  of  the  following  sets  is  countable  by 
listing  its  elements  in  a  systematic  and  definite  way. 
(Show  at  least  the  first  dozen  terms  of  your  lists.) 
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(a)  all  positive  and  negative  integer  powers  of  2 

(b)  those  natural  numbers  which  leave  a  remainder  of 
1  when  divided  by  3 

(c)  N  x  (1,2,3} 

(d)  [BB]  those  positive  rational  numbers  ~  with  n  odd 

(e)  N  x  Z 

(D  ZxZ 

18.  Determine,  with  justification,  whether  each  of  the  fol¬ 
lowing  sets  is  finite,  countably  infinite,  or  uncountable: 

(a)  [BB]  [x  €  R  |  1  <  x  <  2} 

(b)  {x  e  Q  |  1  <  x  <  2} 

(c)  [^  |  m,  n  eN.ffl  <  100,  5  <  n  <  105} 

(d)  { j  |  m,  n  €l,m  <  100,  5  <  n  <  105} 

(e)  [BB]  [a  +  bi  €  C  |  a,b  €  N} 

(f)  {(a,  b)eQxQ\a  +  b=l) 

(g)  {(a,  b)  6  R  x  R  |  b  =  V\  -a2} 


21.  [BB]  Let  .Sj  and  S2  denote  spheres  of  radii  1  and  100, 
respectively.  Prove  that  the  points  on  the  surface  of  ,Sj 
and  those  on  the  surface  of  S2  are  sets  with  the  same 
cardinality. 

22.  [BB]  Let  S  be  an  infinite  set  and  let  x  be  an  element  not 
in  S.  Prove  that  S  and  S  U  {x}  are  sets  of  the  same  car¬ 
dinality.  (You  may  assume  that  S  contains  a  countably 
infinite  subset.) 

Prove  that  the  points  of  a  plane  and  the  points  of  a 
sphere  are  sets  of  the  same  cardinality. 

[BB]  Suppose  A  is  a  finite  set  and  B  is  a  countably 
infinite  set  and  that  A  n  B  =  0.  Show  that  A  U  B  is 
countably  infinite. 

(a)  Show  that  if  A  and  B  are  countable  sets,  then  so  is 
A  x  B. 

(b)  Show  that  the  set  of  all  polynomials  of  degree  at 
most  one  with  integer  coefficients  is  countable. 


19.  Which  of  the  following  sets  is  finite,  which  is  countably 

infinite,  which  is  uncountable?  Explain  your  answers. 

(a)  the  set  of  grains  of  sand  on  a  beach 

(b)  [3”  |  n  6  Z} 

(c)  the  set  of  words  in  the  English  language 

(d)  the  set  of  sentences  in  the  English  language 

20.  Give  an  example  of  each  of  the  following  or  explain 

why  no  example  exists. 

(a)  [BB]  an  infinite  collection  of  finite  sets  (no  two  the 
same)  whose  union  is  finite 

(b)  a  finite  collection  of  finite  sets  whose  union  is  infi¬ 
nite 

(c)  [BB]  an  infinite  collection  of  infinite  sets  whose 
union  is  finite 


26.  Prove  that  the  function  defined  by  f(x)  = 


x(x  —  1) 

defines  a  one-to-one  correspondence  from  (0,  1)  to  R. 


27.  Prove  that  for  any  set  X,  the  cardinalities  of  X  and  V(X) 
are  different. 


28.  Let  S  be  the  set  of  all  real  numbers  in  the  interval  (0,  1) 
whose  decimal  expansions  are  infinite  and  contain  only 
3  and  4 — for  example,  0.343434 . . .  and  0.333  . . .  ,  but 
not  0.34  =  0.34000 ....  Prove  that  S  is  uncountable. 


29.  Let  S  be  the  set  of  all  real  numbers  in  the  interval  (0,  1 ) 
whose  decimal  expansions  involve  only  0  and  1 .  Prove 
that  S  is  uncountable. 

■Vs  30.  What  is  the  Fields  Medal  and  after  whom  was  it  named? 
Write  a  short  note  about  this  prize  and  its  founder. 
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1.  Let  /  =  {(1,2),  (2,  3),  (3,  4),  (4,1)}  and  g  =  {(1,3), 
(2,  1),  (3,  4),  (4,  2),  (5,  1)}.  Find  f~l  and  g  o  /.  Is  g 
one-to-one?  Explain. 

2.  Let  /:  Z  — »  Z  be  defined  by  f(m)  =  3| m\  +  1.  Is  / 
one-to-one?  Is  /  onto?  Give  reasons  for  your 
answers. 

3.  Define  /:  Z  — »•  Z  by  f(x)  =  2x3  +  x.  Show  that  /  is 
one-to-one  but  not  onto. 

4.  Can  a  function  be  a  reflexive  relation?  Explain. 


5.  Is  it  possible  for  a  function  /:  R  — ►  R  to  be  symmet¬ 
ric  as  a  relation?  Give  an  example  or  explain  why  the 
answer  is  no. 

6.  Is  it  possible  for  a  function  / :  R  — *•  R  to  be  transitive  as 
a  relation?  Give  an  example  or  explain  why  the  answer 
is  no. 

7.  Answer  true  or  false  and  justify  your  answers. 

(a)  For  real  numbers  x  and  y,  \x\  =  y1  ->  x  =  y\y\. 

(b)  For  real  numbers  x  and  y,  x  =  y|y|  — >•  |x|  =  y2. 

(c)  For  real  numbers  x  and  y,  |x|  =  y2  x  =  y\y\. 
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8.  Find  subsets  A  and  B  of  R.  with  A  as  large  as  possible, 
such  that  / :  A  — >  B  defined  by  f  (x)  =  x2  +  Ax  —  7  is 
one-to-one  and  onto. 

9.  Suppose  f:  A  ->  B  and  g :  B  — »  C  are  functions. 
If  g  o  f  is  one-to-one  and  /  is  onto,  show  that  g  is 
one-to-one. 

10.  Let  | /4 1  =  n  and  \B\  =  m  for  n,m  e  N.  Use  the 
definition  of  cardinality  of  a  finite  set  to  show  that  if 
A  fl  B  =  0,  then  |/t  U  B\  =  n  +  m. 


11.  Show  that  the  open  intervals  (1,3)  and  (0,  oo)  have  the 
same  cardinality. 

12.  (a)  Let  S  be  the  set  of  all  real  numbers  in  the  interval 

(0,  1)  whose  decimal  expansions  contain  only  0's, 
2’s  and  7’s.  Prove  that  S  is  uncountable. 

(b)  Let  S'  be  the  elements  of  S  (defined  in  (a))  whose 
decimal  expansions  contain  only  finitely  many  2’s 
and  7’s.  What  is  the  cardinality  of  S'l 


INDUS 


CHAPTER 


4 

The  Integers 


4.1  The  Division  Algorithm 

What  are  some  of  the  important  properties  of  the  real  numbers?  Most  of  us  learned 
early  in  life  that  they  have  a  definite  order.  Later,  we  met  the  symbol  <,  whose 
properties  are  best  summarized  by  saying  that  it  is  a  partial  order;  that  is,  the 
binary  relation  <  is 


reflexive:  a  <  a  for  all  a  e  R, 

antisymmetric:  if  a  <  b  and  b  <  a  for  a,  b  e  R,  then  a  =  b,  and 

transitive:  if  a  <  b  and  b  <  c,  for  a,  b,  c  e  R,  then  a  <  c. 

(See  Section  2.5.) 

Readers  should  ensure  that  they  agree  with  the  following  inequalities: 

1  1  22 

-15  <  -8  <  —  <  -0.3  <  —  <  -0.25  <  0  <  1  <  3.14  <  n  <  —  <4. 
_3_  -  4  -  -  -  -  ~  ~  1  ~ 

Which  of  the  <  signs  in  the  preceding  list  could  be  changed  to  the  strict 

inequality  <?  The  answer  is  all  of  them,  except  for  —  |  <  —0.25.  Since  —  |  = 

—0.25,  the  strict  inequality  —  3  <  —0.25  is  not  true.  Is  it  clear  why  each  of  the 

remaining  <  signs  could  be  replaced  by  <  signs? 

Why  is  —5  <  —0.3?  Why  is  3.14  <  n  <  y ? 


4.1.1  PROPERTIES  OF  + 
AND 


We  can  add  two  real  numbers  a  and  b  and  obtain  their  sum  a  +  b,  or  we  can 
multiply  two  real  numbers  a  and  b  and  obtain  their  product  a  ■  b  (usually  written 
ab,  without  the  centered  dot).  These  operations  of  addition  and  multiplication 
satisfy  a  number  of  important  properties,  the  last  three  of  which  relate  to  order. 


Let  a,  b,  and  c  be  real  numbers.  Then 

1.  (closure)  a  +  b  and  ab  are  both  real  numbers. 

2.  (commutativity)  a  +  b  =  b  +  a  and  ab  =  ba. 
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3.  (associativity)  (a  +  b)  +  c  =  a  +  (b  +  c)  and  (ab)c  =  a(bc). 

4.  (identities)  a  +  0  =  a  and  a  ■  1  =  a. 

5.  (distributivity)  a(b  +  c)  =  ab  +  ac  and  ( a  +  b)c  =  ac  +  be. 

6.  (additive  inverse)  a  +  (—a)  =  0. 

7.  (multiplicative  inverse)  a{^)  =  \  ii  a  ^  0. 

8.  a  <  b  implies  a  +  c  <  b  +  c. 

9.  a  <  b  and  c  >  0  implies  ac  <  be. 

10.  a  <  b  and  c  <  0  implies  ac  >  be. 


A  third  well-known  operation,  subtraction,  is  defined  in  terms  of  addition  by 
the  rule 


a  —  b  =  a  +  (—b). 


4.1.2  WELL-ORDERING 
PRINCIPLE 


It  is  not  unusual  for  a  set  of  real  numbers  to  have  no  smallest  element;  for 
example,  there  is  no  smallest  element  in  the  set  {j,  i,  4, . . . }.  Similarly,  there 
is  no  smallest  positive  number.  On  the  other  hand,  this  sort  of  thing  does  not 
occur  with  sets  of  natural  numbers,  according  to  the  Well-Ordering  Principle. 

Any  nonempty  set  of  natural  numbers  has  a  smallest  element. 


In  the  rest  of  this  section,  we  concentrate  on  the  integers.  All  the  properties 
of  paragraph  4.1.1  hold  for  the  integers,  in  most  instances  because  they  hold  for 
all  real  numbers.  One  exception  is  the  closure  property,  which  simply  says  that 
the  sum  and  product  of  integers  are  integers.  We  also  say  that  the  integers  are 
closed  under  addition  and  under  multiplication.  Not  all  sets  of  real  numbers  are 
closed  under  these  operations;  for  instance,  the  set  of  odd  integers  is  not  closed 
under  addition  since,  for  example,  1+3  =  4,  which  is  not  odd. 

Ls  the  set  of  negative  integers  closed  under  multiplication?  Is  the  set  of  natural 
numbers  closed  under  multiplication?  Is  the  set  of  natural  numbers  closed  under 
subtraction? 


When  some  of  us  were  school  children,  division  of  58  by  17  was  performed 
with  a  configuration  like  this, 


which  served  to  illustrate  that  the  answer  is  “3  remainder  7.”  When  58  is  divided 
by  17,  the  quotient  is  3  and  the  remainder  is  7;  equivalently, 

58  =  3(17) +  7. 

It  is  also  true  that  58  =  2(17)  +  24,  but  we  were  taught  that  the  remainder  must 
be  less  than  the  divisor.  We  now  prove  that  this  sort  of  division  is  always  possible 
and,  moreover,  the  quotient  and  the  remainder  are  unique. 
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Given  natural  numbers  a  and  b,  there  are  unique  nonnegative  integers  q  and  r, 
with  0  <  r  <  b,  such  that  a  =  qb  +  r. 


Proof  Consider  the  sequence  of  nonnegative  multiples  of  b\  that  is,  0  b  =  0,  1  •  b  =  b, 
2-b  =  2b,  3  b  =  3b, . . .  .  The  first  term  in  this  increasing  sequence  of  numbers  is 
0,  which  is  less  than  a  since  a  is  a  natural  number.  On  the  other  hand,  some  term  is 
bigger  than  a  [for  example,  (2 a)b  >  a  because  2b  >  1],  so,  by  the  Well-Ordering 
Principle,  the  set  of  multiples  of  b  which  exceed  a  has  a  smallest  element,  say 
(q  +  1  )b.  So  we  have  qb  <  a  <  (q  +  l)b.  (See  Fig  4.1.)  Set  r  =  a  —  qb.  Since 
qb  <  a,  we  have  r  >  0.  Since  ( q  +  1  )b  >  a.  we  have  r  <  b.  Hence,  0  <  r  <  b, 
and  we  have  found  q  and  r  as  required. 


0  b  2b 


{ - f 


qb 


(q  +  1  )b 


Figure  4.1 


a 


To  see  that  q  and  r  are  unique,  assume  that  a  can  be  expressed  in  the  given 
form  in  two  ways;  that  is,  suppose  that  a  =  q\b  +  r \  and  a  =  q2b  +  rj  with 
0  <  r\  <  b  and  0  <  r?  <  b.  Then  (q \  —  qi)b  =  ri  —  r\.  Now  ( q\  —  q2)b  is  an 
integral  multiple  of  b,  while  —  b  <  ri  —  r\  <  b.  The  only  possibility  is  that  this 
multiple  is  0,  and  so  q\  =  qi,  r\  =  r2  as  desired.  I 


4.1.4  DEFINITIONS 


If  a  and  b  are  natural  numbers  and  a  =  qb  +  r  for  nonnegative  integers  q  and  r 
with  0  <  r  <  b,  the  integer  q  is  called  the  quotient  and  the  integer  r  is  called  the 
remainder  when  a  is  divided  by  b. 


When  a  and  b  are  both  positive  and  a  is  divided  by  b,  it  is  useful  to  note 
that  the  quotient  is  the  integer  part  of  the  number  displayed  by  a  calculator  used 
to  divide  a  by  b.  (The  remainder  is  a  —  qb.) 

PROBLEM  1.  Find  the  quotient  q  and  the  remainder  r  and  write  a  =  qb  +  r 
when  a  =  19  is  divided  by  b  =  7.  Find  the  quotient  and  remainder  when  589,621 
is  divided  by  7893  and  when  11,109,999,999  is  divided  by  1111. 

Solution.  When  19  is  divided  by  7,  the  quotient  is  2  and  the  remainder  is  5.  We 
have  19  =  2(7)  +  5. 

Dividing  589,621  by  7893,  one  author’s  calculator  displayed  74.701761.  Thus, 
q  =  74  and  r  =  589,621  -  74(7893)  =  5539. 

For  the  last  part  of  the  question,  we  have  to  be  a  little  resourceful  since  one  of 
the  numbers  here  is  too  large  for  the  author’s  unsophisticated  calculator!  Instead, 
we  note  that 


11,109,999,999=  11,110.000.000-  1 


=  1111(10,000,000)  -  1 
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=  1111(9,999,999+1)  -  1 
=  1111(9,999,999)  +  1110. 

So  the  quotient  is  9,999,999  and  the  remainder  is  1 1 10.  I 


With  only  slight  modifications,  Theorem  4. 1 .3  can  be  generalized  to  situations 
where  a  and  b  are  integers,  not  just  natural  numbers.  Note,  however,  that  in  order 
to  preserve  the  uniqueness  of  q,  the  possibility  b  =  0  must  be  excluded.1  (Also, 
in  this  case,  the  condition  0  <  r  <  b  would  be  impossible  to  meet.) 

Let  a,  b  e  Z,  b  ±  0.  Then  there  exist  unique  integers  q  and  r,  with  0  <  r  <  \b\, 
such  that  a  =  qb  +  r . 

In  Theorem  4.1.3,  we  proved  this  in  the  case  where  a  and  b  are  both  positive. 
If  a  =  0,  then  q  =  0,  r  =  0  gives  a  solution.  We  will  consider  the  other  cases 
individually. 

Case  1:  b  >  0  and  a  <  0. 

Since  —a  >  0,  we  can  apply  Theorem  4.1.3  to  — a  and  b  obtaining  q  and  r, 
with  0  <  r  <  b,  such  that  — a  =  qb  +  r.  Therefore,  a  =  ( —q)b  —  r.  If  r  =  0, 
a  =  ( —q)b ,  while  if  r  >  0,  a  =  (— q  —  1  )b  +  (b  —  r)  with  0  <  b  —  r  <  b  =  \b\. 
In  either  case,  we  have  expressed  a  in  the  desired  form. 

Case  2:  b  <  0  and  either  a  >  0  or  a  <  0. 

Here  —  b  >  0,  so  Theorem  4. 1 .3  and  Case  1  tell  us  that  there  exist  integers  q 
and  r,  0  <  r  <  —b  —  \b\,  such  that  a  =  q(—b)  +  r  =  ( —q)b  +  r.  Again  we 
have  expressed  a  in  the  desired  form. 

The  proof  of  uniqueness  follows  as  in  Theorem  4.1.3.  I 

Chapter  8  of  this  book  is  devoted  to  the  subject  of  algorithms.  Here,  we 
simply  remark  that  algorithm  means  “definite  procedure”  and  that  the  theorem 
known  as  the  Division  Algorithm  takes  its  name  from  the  fact  that  there  is  a 
definite  procedure  for  determining  q  and  r,  given  a  and  b.  For  a,  b  >  0,  q  —  [|J 
is  the  integer  part  of  |  and  r  —  a  —  qb,  as  illustrated  in  Problem  1.  For  possibly 
negative  a  and/or  b ,  the  procedure  is  given  in  Proposition  4.1.6. 

Verify  that  a  =  qb  +  r  in  each  of  the  following  cases  and,  in  the  process,  notice 
that  q  is  not  always  what  we  might  expect. 


a 

b 

<7 

r 

58 

17 

3 

i 

58 

-17 

-3 

i 

-58 

17 

-4 

10 

-58 

-17 

4 

10 

When  a  is  written  in  the  form  a  =  qb  +  r,  with  0  <  r  <  \b\,  we  shall  show 
that  q  is  the  floor  or  ceiling  of  |  according  as  b  is  positive  or  negative.  (The  floor 
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Proof 
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and  ceiling  functions  were  defined  in  paragraph  3.1.6.)  In  the  preceding  table,  for 
instance,  when  a  =  —58,  b  -  17,  |_— 58/17J  =  [—3.41  ...J  =  —4,  the  value 
recorded  for  q.  When  a  =  —58,  b  =  -17,  f — 58/  —  17]  =  [3.41]  =  4,  again  the 
recorded  value  of  q. 


If  a  =  qb  +  r,  with  0  <  r  <  \b\,  then 


|  Lf  J  if  *  >  0 
(rfl  if  6  <  0. 


We  consider  the  case  b  >  0  and  leave  the  other  possibility  to  the  exercises.  By 
definition,  x  —  1  <  [xj  <  x  for  any  x.  Let  x  =  |  and  k  =  [xj.  Then 


Multiplying  by  the  positive  number  b  (and  using  Property  9  of  paragraph  4.1.1), 
we  obtain  a  —  b  <  kb  <  a  and  then,  multiplying  by  —1, 


—a  <  —kb  <  —a  +  b 


(by  Property  10).  Adding  a  gives  0  <  a  -  kb  <  b  so,  letting  r  =  a— kb,  we  have 
a  =  kb  +  r  with  0  <  r  <  \b\.  By  uniqueness  of  q,  q  =  k  =  [|J,  as  asserted.  I 

With  a  =  — 1027  and  b  —  38,  we  have 
■  a  |  ,  1027  , 

L^J  =  L — =  L-27.026. . .  J  =  -28  = 

It  is  straightforward  now  to  determine  r:  r  =  a  —  qb  =  —1027  +  1064  =  37. 
Thus,  -1027  =  -28(38)  +  37.  With  a  =  1,234,567  and  b  =  -103,  we  have 
|"|1  =  [—1 1,986.087  . . .  ]  =  —1 1,986  =  q.  As  before,  we  note  that  r  =  a—  qb  = 
1,234,567  -  (-11,986)(-103)  =  9.  Thus,  1,234,567  =  — 103(—  1 1,986)  +  9.  A 

Representing  Natural  Numbers  in  Various  Bases 

As  an  application  of  some  of  the  ideas  we  have  presented  so  far,  we  discuss  briefly 
the  representation  of  natural  numbers  in  bases  other  than  the  familiar  one. 

Unless  they  have  reason  to  suspect  otherwise,  most  people  expect  that  the 
numbers  they  encounter  in  their  day-to-day  lives  are  presented  in  base  10.  The 
integer  2159,  for  example,  is  assumed  to  mean 

2  •  103  +  1  •  102  +  5  ■  10  +  9. 

The  digits  9,  5,  1,  and  2  are  called,  respectively,  the  units,  tens,  hundreds,  and 
thousands  digits  of  this  integer.  If  we  lived  in  a  base  12  world,  however,  we 
would  interpret  2159  as 


2  •  123  +  1  •  122  +  5  •  12  +  9. 
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When  we  represent  a  number  in  a  base  other  than  10,  we  shall  employ  notation 
such  as  (2159) i2-  Thus, 

(2159)i2  =  2  •  123  +  1  ■  122  +  5  •  12  +  9. 


4.1.7  DEFINITION 


Given  a  fixed  natural  number  b  >  1 ,  the  base  b  representation  of  a  natural  number 
N  is  the  expression  (an-\a,,-2  ■  ■  .  ao)b,  where  no-  «i , . . .  ,  a„  i  are  those  integers 
0  <  a,  <  b  which  satisfy  N  =  an-\bn~l  +  an-2b"~2  +  ■  ■  ■  +  a\b  +  oq.  Thus, 


(a„-ia„-2.-.ao)b=a„-]bn  ' +  a„-2bn  2  4 - \-a\b  +  ao. 


The  base  5  representation  of  1 17  is  432,  written  117  =  (432)s,  because  117  = 
4(52)  +  3(5)  +  2.  In  base  3,  117  =  (11. 100)3  because  117  =  1(34)  + 1(33)  +  1(32). 
While  10  is  the  most  familiar  base,  nowadays  bases  2,  8,  and  16  are  also  common. 
Base  2  is  also  known  as  binary ,  base  8  as  octal,  and  base  16  as  hexadecimal.  The 
basic  hexadecimal  digits  are  0-9,  A,  B,  C,  D,  E,  and  F.  For  example,  10  =  Aj6, 
11  =  B,6  and  (4C)i6  =  4- 16+ 12  =  76. 

To  convert  from  base  b  to  base  10  is  easy:  Use  the  definition  of  (. . .)/,  in 
Definition  4.1.7.  What  about  converting  from  base  10  to  base  b? 

Suppose  that  N  =  {an-\an~2  ■  ■  •  <+) )b  is  a  number  given  in  base  b.  Then 

N  =  an- 1 b"  1  +  a„_2^"  ^  +  •  ■  •  +  a\ b  +  no 

=  (an-\h"~2  +  an-2bn~i  H - h  a\)b  +  <+>  =  qob  +  a0 

where  qo  =  a„  \bn  2  +  an  ob"  3  +  •  •  •  +  at.  Since  0  <  ao  <  b,  when  N  is 
divided  by  b,  the  remainder  is  ao  and  the  quotient  is  qo-  Similarly, 

qo  =  ( an-\bn~}  +  an-2bn~4  H - f  «2 )b  +  a\  =  q\b  +  a\ 

with  q\  =  an-\bn~ 3  +  an-2b"~4  +  ■  ■  •  +  ai,  so  that  when  qo  is  divided  by  b,  the 
remainder  is  a\  and  the  quotient  is  q\.  In  a  similar  way,  we  see  that  ai  is  the 
remainder  when  q\  is  divided  by  b.  Thus,  the  digits  in  the  representation  of  N  in 
base  b  are,  from  right  to  left,  the  remainders  when  first  N ,  and  then  successive 
quotients,  are  divided  by  b. 


PROBLEM  4.  Find  the  binary,  octal,  and  hexadecimal  representations  of  the 
number  2159. 

Solution,  We  have  2159  =  1079(2)  +  1,  1079  =  539(2)  +  1,  539  =  269(2)  +  1, 
269  =  134(2)  + 1,  134  =  67(2) +0,  67  =  33(2)  + 1, 33  =  16(2)+ 1,  16  =  8(2)+0, 
8  =  4(2)  +  0,  4  =  2(2)  +  0,  2  =  2(1)  +  0  and  1  =  0(2)  +  1.  The  sequence  of 
remainders  is  1,  1,  1,  1,0,  1,  1,  0,  0,  0,  0,  1,  so,  writing  these  in  reverse  order, 
2159  =  (100,001,101,111)2. 

The  base  8  and  base  16  representations  can  be  obtained  by  similar  means,  or 
from  the  base  2  representation.  Having  determined  that  2159  =  2n  +  26  +  25  + 
23  +  22  +  2  +  1 ,  we  have 

2159  =  22(23)3  +  (23)2  +  (22  )23  +  23  +  7 

=  4(83)  +  1(82)  +  5(8)  +  7  =  (4157)8 


4.1  The  Division  Algorithm  103 


2159  =  2"  +  26  +  25  +  21 2 3  +  22  +  2  +  1 
=  23  (24)2  +  22(24)  +  2(24)  +  15 
=  8(16)2  +  6(16)  +  15  =  (86F)i6.  ■ 


PROBLEM  5.  Convert  21,469  to  octal  and  to  hexadecimal  notation. 

Solution.  We  have  21,469  =  2683(8) +  5,  2683  =  335(8) +  3,  335  =  41  (8) +  7, 
41  =  5(8)+  1  and  5  =0(8)  +  5.  Thus,  21,469  =  (51,735)g.  Similarly,  21,469  = 
1341(16)  +  13,  1341  =  83(16)  +  13,  83  =  5(16)  +  3  and  5  =  0(16)  +  5,  so 
21,469  =  (53DD)i6.  I 


Answers  to  Pauses 


1.  |  is  the  same  as  the  decimal  .333  ....  The  three  dots  . .  ”  mean  that  the  3’s 
continue  indefinitely.  We  often  write  j  =  0.3  and  say  that  A  equals  “point  3 
repeated.”  Since  0.3  is  larger  than  0.3,  its  negative,  —0.3,  is  less  than  —0.3. 
The  number  n  is  perhaps  the  most  fascinating  number  of  all.  Its  decimal 
expansion  begins  3.14159,  so  it  is  definitely  bigger  than  3.14.  As  a  deci¬ 
mal,  y  =  3.142857,  where  the  line  indicates  that  the  sequence  of  numbers 
underneath  is  repeated  indefinitely;  thus, 

y  =  3.  M2857  M2857|42857  142857  .... 

Thus,  we  see  that  n  <  y.  Unlike  |  and  y,  the  decimal  expansion  of  n 
continues  indefinitely,  without  any  kind  of  pattern. 

2.  The  set  of  negative  integers  is  not  closed  under  multiplication  since,  for 
example,  ( — 2)( — 3)  =  6  is  not  a  negative  integer.  The  set  of  natural  numbers 
is  closed  under  multiplication  since  the  product  of  natural  numbers  is  a  nat¬ 
ural  number.  The  natural  numbers  are  not  closed  under  subtraction  since,  for 
example,  4  —  10  =  —6  is  not  a  natural  number. 


EXERCISES 


The  symbol  [BB]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 


1.  [BB]  Use  the  properties  given  in  paragraph  4.1.1  to 
derive  a  second  distributive  law:  (a  +  b)c  =  ac  +  be 
for  any  real  numbers  a.b.c. 

2.  True  or  false?  If  false,  give  a  counterexample. 

(a)  [BB]  Subtraction  is  a  closed  operation  on  the  real 
numbers. 

(b)  Subtraction  of  real  numbers  is  commutative. 

(c)  Subtraction  of  real  numbers  is  associative. 

3.  Show  (by  means  of  a  counterexample)  that  Property  9 

of  paragraph  4.1.1  does  not  hold  if  c  <  0. 


4.  [BB]  Find  the  quotient  q  and  the  remainder  r  as  defined 
by  the  division  algorithm,  4.1.5,  in  each  of  the  following 
cases. 

(a)  a  =  500;  b  =  17 

(b)  a  =  -500;  b  =  17 

(c)  a  =  500;  b  =  -17 

(d)  a  =  -500;  b  =  -17 

5.  Find  q  and  r  as  defined  by  the  division  algorithm,  4.1.5, 
in  each  of  the  following  cases. 

(a)  a  =  5286;  6=19 

(b)  a  =  -5286;  6  =  19 


104  Chapter  4  The  Integers 


(c)  a  =  5286;  b  =  - 19 

(d)  a  =  -5286;  b  =  - 19 

(e)  a  =  19;  b  =  5286 

(f )  a  =  -19;  b  =  5286 

6.  Find  integers  q  and  r,  with  0  <  r  <  \b\,  such  that 
a  =  bq  +  /•  in  each  of  the  following  cases. 

(a)  a  =  12,345;  b  =  -39 

(b)  a  =  -27,361;  b  =  -977 

(c)  a  =  -102,497;  b  =  -1473 

(d)  a  =  98,764;  b  =  4789 

(e)  a  =  -41,391;  b  =  -755 

(f)  a  =  555,555, 123;  b  =  111,111,111 

(g)  a  =  81,538,416,000;  b  =  38,754 

7.  [BB]  Fix  a  natural  number  n  >  1  and  define  / :  Z  — >  Z 
by  setting  f  (a)  equal  to  the  quotient  when  a  is  divided  by 
n;  that  is,  f(a)  =  q  where  a  =  qn  +  r  with  0  <  r  <  n. 

(a)  Find  the  domain  and  range  of  f. 

(b)  Is  f  one-to-one? 

(c)  Is  /  onto? 

Explain  your  answers. 


8.  Suppose  n  >  1  is  a  natural  number  and  / ;  Z  — >  N  U  (0) 
is  that  function  which  associates  with  each  a  €  Z  its 
remainder  upon  division  by  n\  thus,  if  a  =  qn  +  r  with 
0  <  r  <  n.  then  f(a)  =  r. 

(a)  Find  the  domain  and  range  of  /. 

(b)  Is  /  one-to-one? 

(c)  Is  /  onto? 

Explain  your  answers. 

9.  [BB]  Complete  the  proof  of  Proposition  4.1.6  by  show¬ 
ing  that  if  a  =  qb  +  r  with  0  <  r  <  |fc|  and  b  <  0,  then 
q  =  \a/b  ]. 

10.  Find  the  binary,  octal,  and  hexadecimal  representations 
for  each  of  the  following  integers  (given  in  base  10). 

(a)  [BB]  4034 

(b)  57,483 

(c)  185,178 

11.  (a)  Suppose  the  natural  number  N  is  (a„-\  . .  .ao)b  (in 

base  b).  Prove  that  n  —  1  =  |_log;,  ?VJ  (and,  hence, 
that  N  has  1  +  Llog^  ?VJ  digits  in  base  b). 

(b)  How  many  digits  does  7254  have  in  its  base  10  rep¬ 
resentation? 

(c)  How  many  digits  does  the  number  319566  have  in 
its  base  10  representation? 


4.2  Divisibility  and  the  Euclidean  Algorithm 

When  we  say  that  one  integer  divides  another,  we  mean  “divides  exactly,”  that 
is,  with  a  remainder  of  0. 


4.2.1  DEFINITION 


Given  integers  a  and  b  with  b  ^  0,  we  say  that  b  is  a  divisor  or  a  factor  of  a 
and  that  a  is  divisible  by  b  if  and  only  if  a  =  qb  for  some  integer  q.  We  write 
b\a  to  signify  that  a  is  divisible  by  b  and  say  “ b  divides  a.” 


For  example,  3  is  a  divisor  of  18,  —7  is  a  divisor  of  35,  16  |  —64,  —4  |  38. 
(As  always,  a  slash  through  a  symbol  negates  the  meaning  of  that  symbol.  Just 
as  ^  means  “not  equal”  and  ^  means  “does  not  belong  to,”  so  \  means  “does  not 
divide”) 

Note  that  1  |  n  for  any  integer  n  because  n  —  n  ■  1. 

Show  that  n  J  0  for  any  integer  n  /  0. 


PROBLEM  6.  Given  three  consecutive  integers  a,  a  +  1,  a  +  2,  prove  that  one 
of  them  is  divisible  by  3. 

Solution.  By  the  division  algorithm,  we  can  write  a  =  3q  +  r  with  0  <  r  <  3. 
Since  r  is  an  integer,  we  must  have  r  —  0,r  —  1  or  r  —  2.  If  r  =  0,  then  a  =  3q 
is  divisible  by  3.  If  r  =  1,  then  a  =  3q  +  1,  so  a  +  2  =  3q  +  3  is  divisible  by  3. 
If  r  =  2,  then  a  =  3q  +  2,  so  o  +  1  =  3q  +  3  is  divisible  by  3.  I 
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The  following  proposition  is  elementary  but  very  important;  it  is  used  (often 
implicitly)  all  the  time. 


4.2.2  PROPOSITION 


Suppose  a,  b,  and  c  are  integers  such  that  c  \  a  and  c  \  b.  Then  c  |  (xa  +  yb)  for 
any  integers  x  and  y. 


Proof  Since  c  |  a,  we  know  that  a  =  q\c  for  some  integer  q\ .  Since  c\b,  we  also  have 
b  =  q2C  for  some  integer  qi-  Thus,  xa  +  yb  =  xq\c  +  yq2C  =  (q\x  +  qiy)c. 
Since  q\x  +  q^y  is  an  integer,  c  |  (xa  +  yb)  as  required.  I 

The  most  important  properties  of  “divides”  are  summarized  in  the  next  propo¬ 
sition. 


4.2.3  PROPOSITION 


The  binary  relation  1Z  on  N  defined  by  (a,  b)  e  7Z  if  and  only  if  a  |  b  is  a  partial 
order. 


Proof  We  have  to  show  that  the  relation  is  reflexive,  antisymmetric,  and  transitive. 
Reflexive:  For  any  a  e  N,  a  |  a  because  a  =  1  a. 

Antisymmetric:  Suppose  a,b  e  N  are  such  that  a  \  b  and  b  \  a.  Then  b  =  q\a  for 
some  natural  number  q\  and  a  —  qjb  for  some  natural  number  qi-  Thus, 
a  =  q2(q\a)  =  (q\q2)a.  Since  a  ^  0,  q\qi  =  1,  and,  since  q\  and  q2  are 
natural  numbers,  we  must  have  q\  =  q2  =  1;  thus,  a  =  b. 

Transitive:  If  a,  b,  c  e  N  are  such  that  a  \  b  and  b  |  c,  then  b  =  q\a  and  c  =  q2b 
for  some  natural  numbers  q\  and  q2 .  Thus,  c  =  q2b  =  q2(q\a)  = 
with  q\q2  an  integer.  So  a  |  c.  I 

The  proposition  says  that  (N,|)  is  a  partially  ordered  set.  In  fact,  (A,|)  is  a 
poset  for  any  set  A  of  natural  numbers.  At  this  point,  we  encourage  you  to  review 
Section  2.5  and,  in  particular,  the  terminology  associated  with  posets. 


EXAMPLE  7  Let  A  =  {1,  2,  3,  4,  5,  6}.  In  the  poset  (A,  |),  the  element  4  is  maximal  because 
there  is  no  a  €  A  satisfying  4  ]  a  except  a  =  4.  This  element  is  not  a  maximum 
however,  since,  and  for  example,  5  \  4.  Similarly,  5  and  6  are  maximal  elements 
which  are  not  maximum.  A 

Draw  the  Hasse  diagram  for  this  poset  and  find  all  minimal,  minimum,  maximal, 
and  maximum  elements. 


The  Greatest  Common  Divisor 


4.2.4  DEFINITION 


Let  a  and  b  be  integers  not  both  of  which  are  0.  An  integer  g  is  the  greatest 
common  divisor  ( gcd )  of  a  and  b  if  and  only  if  g  is  the  largest  common  divisor 
of  a  and  b\  that  is,  if  and  only  if 

1.  g\a,  g\b  and, 

2.  if  c  is  any  integer  such  that  c  \  a  and  c  j  b ,  then  c  <  g. 

We  write  g  =  gcd(o,  b )  to  signify  that  g  is  the  greatest  common  divisor  of  a 
and  b. 
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EXAMPLES  8 


4.2.5  LEMMA 


Proof 


4.2.6  EUCLIDEAN 
ALGORITHM 


•  the  greatest  common  divisor  of  15  and  6  is  3, 

.  gcd(— 24,  18)  =  6, 

.  gcd(756,  210)  =  42, 

.  gcd(— 756,  210)  =  42, 

.  gcd(— 756,  -210)  =  42.  ▲ 

If  a  and  b  are  integers  such  that  a  |  b,  what  is  the  greatest  common  divisor  of  a 
and  bl 

Suppose  a  is  a  nonzero  integer.  What  is  gcd(a,  0)? 

It  follows  almost  immediately  from  the  definition  that  two  integers,  not  both 
0,  have  exactly  one  greatest  common  divisor.  (See  Exercise  14.)  Note  also  that 
since  1  is  a  common  divisor  of  any  two  integers,  the  greatest  common  divisor  of 
two  integers  is  always  positive. 

The  seventh  book  of  Euclid’s  Elements  (300  B.C.)  describes  a  procedure  now 
known  as  the  Euclidean  algorithm  for  finding  the  gcd  of  two  integers  a  and  b. 
It  is  based  on  the  fact  that  when  a  =  qb  +  r,  then  gcd(a,  b )  =  gcd (b,  r).  For 
instance,  since  58  =  3(17)  +  7,  it  must  be  that  gcd(58,  17)  =  gcd(17,  7).  Also, 
since  75  =  3(21)  +  12,  it  must  be  that  gcd(75,  21)  =  gcd(21,  12). 

If  a  =  qb  +  r  for  integers  a,  b,  q ,  and  r,  then  gcd(a,  b)  =  gcd(b,  r). 

If  a  =  b  =  0,  then  a  =  qb  +  r  says  r  =  0.  Similarly,  if  b  =  r  =  0,  then  a  =  0. 
In  either  case,  the  result  is  true  since  neither  gcd(n,  b)  nor  gcd(b,  r)  is  defined. 
Thus,  it  remains  to  consider  the  case  that  g\  =  gcd(n,  b)  and  g2  =  gcd (b,  r)  are 
both  well  defined  integers.  First,  g2  \  b  and  #2  |  A  so  g2  \  (qb  +  r);  that  is,  g2  \  a. 
Thus,  g2  is  a  common  divisor  of  a  and  b  and,  since  g\  is  the  greatest  common 
divisor  of  a  and  b,  we  have  g2  <  gi- 

On  the  other  hand,  since  gi  |  a  and  gi  |  b,  we  know  that  g \  |  («  —  qb)\  that  is, 
g\  |  r.  As  a  common  divisor  of  b  and  r,  it  cannot  exceed  the  gcd  of  these  numbers. 
Thus,  gi  <  g2,  so  gi  =  g2  as  desired.  I 

The  Euclidean  algorithm  involves  nothing  more  than  a  repeated  application 
of  this  lemma. 

Let  a  and  b  be  natural  numbers  with  b  <  a.  To  find  the  greatest  common  divisor 
of  a  and  b,  write 

a  =  q\b  +  r\,  with  0  <  r\  <  b. 

If  r\  ^  0,  write  b  —  <720  +  r2,  with  0  <  r2  <  r\ . 

If  n  7^  0,  write  r\  =  <73 r 2  +  >'3,  with  0  <  <  n. 

If  /-3  34  0,  write  r2  =  q 4^3  +  r 4,  with  0  <  r\  <  r^. 

Continue  this  process  until  some  remainder  />+i  =  0.  Then  the  greatest 
common  divisor  of  a  and  b  is  the  last  nonzero  remainder. 


Why  does  this  process  work,  and  why  must  some  remainder  be  0?  The  answer 
to  the  second  question  is  that  either  r\  =  0  or  the  set  of  nonzero  remainders  is 
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a  nonempty  set  of  natural  numbers  and  hence  has  a  smallest  element,  rk,  by  the 
Well-Ordering  Principle.  So  the  next  remainder,  being  smaller  than  rk,  must  be  0. 
It  follows  by  Lemma  4.2.5  that  rk  is  the  gcd  of  a  and  b  because 

gcd(a,  b)  =  gcd (b,  r\)  =  gcd(n,  r2) 

=  gcd(r2,  r3)  =  •  •  ■  =  gcd (rk,  rk+ 1)  =  gcd(r*,  0)  =  rk. 


PROBLEM  9.  Find  the  greatest  common  divisor  of  630  and  196. 

Solution.  We  have  630  =  3(196)  +  42 
196  =  4(42) +  28 
42  =  1(28)  +  14 
28  =  2(14) +  0. 

The  last  nonzero  remainder  is  14,  so  this  is  gcd(630,  196).  I 


Let  us  write  down  these  same  equations  in  a  different  way,  expressing  each 
of  the  remainders — r\  =  42,  r2  =  28,  r3  =  14 — in  terms  of  a  =  630  and  b  =  196. 
We  have 

42  =  630-3(196)  =  a  -  3b 

28  =  196  -  4(42) 

(1)  =  b  —  4r\  =  b  —  4  (a  —  3  b)  =  —4a  +  13  b 

14  =  42  -  28 

=  r\  —  r2  =  (a  —  3  b)  —  (—4  a  +  13b)  =  5a  —  16b. 

Recording  only  the  remainders  and  the  coefficients  a  and  b,  these  equations 
can  be  neatly  coded  by  the  array 

42  1  -3 
28  -4  13 
14  5  -16. 

Adding  to  the  top  of  this  array  the  rows 

630  1  0 
196  0  1, 

which  correspond  to  the  equations  630  =  la  +  0b  and  196  =  0a  +  lb,  respectively, 
we  obtain  the  array 


630 

1 

0 

196 

0 

1 

42 

1 

-3 

28 

-4 

13 

14 

5 

-16, 

which  is  easy  to  remember.  Each  row  after  the  first  two  is  of  the  form  x  —  qy, 
where  x  and  y  are  the  two  rows  preceding  it  and  q  is  the  quotient  when  the  first 
number  in  x  is  divided  by  the  first  number  in  y.  When  630  is  divided  by  196,  the 
quotient  is  3,  so  the  third  row  is 

(630  1  0)  -  3(196  0  1)  =  42  1  -  3. 
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When  196  is  divided  by  42,  the  quotient  is  4,  so  the  fourth  row  is 

(196  0  1)  -  4(42  1  -  3)  =  28  -  4  13. 

When  42  is  divided  by  28,  the  quotient  is  1 ;  so  the  last  row  is 

(42  1  -  3)  -  (28  -  4  13)  =  14  5  -  16. 

Since  the  last  remainder,  14,  divides  the  previous  remainder,  28,  the  next  remain¬ 
der  is  0.  So  gcd(630,  196)  =  14,  the  last  nonzero  remainder. 

Here  is  another  example  of  this  procedure. 

PROBLEM  10.  Find  gcd(  1 800,  756) . 

Solution. 


1800 

1 

0 

756 

0 

1 

288 

1 

-2 

180 

-2 

5 

108 

3 

-7 

72 

-5 

12 

36 

8 

-19 

Since  the  last  nonzero  remainder  is  36.  gcd(1800,  756)  =  36.  I 


Nonzero  integers  are  relatively  prime  if  and  only  if  their  greatest  common  divisor 
is  1;  in  other  words,  if  and  only  if  1  is  the  only  positive  integer  which  divides 
both  the  given  integers. 

Show  that  17,369  and  5472  are  relatively  prime. 

There  is  a  very  important  property  of  the  greatest  common  divisor  which 
we  have  so  far  overlooked.  As  seen  most  clearly  in  equations  (1),  each  remain¬ 
der  which  arises  in  the  course  of  applying  the  algorithm  is  an  integral  linear 
combination  of  the  given  two  numbers  a  and  b;  that  is,  each  remainder  can  be 
written  in  the  form  ma  +nb  for  integers  m  and  n.  In  particular,  the  greatest  com¬ 
mon  divisor  of  a  and  b,  being  the  last  nonzero  remainder,  is  an  integral  linear 
combination  of  a  and  b.  For  example,  as  we  saw  in  (1),  gcd(630,  196)  =  14  = 
5 a  —  16 b  =  5(630)  —  16(196).  Similarly,  the  calculations  of  Problem  10  show 
that  gcd(1800,  756)  =  36  =  8(1800)  -  19(756). 

In  Pause  7  we  asked  you  to  show  that  the  integers  17,369  and  5472  are  relatively 
prime.  Find  integers  m  and  n  so  that  17,369w  +  5472n  =  1. 

Since,  by  definition,  the  greatest  common  divisor  of  two  integers  a  and  b ,  which 
are  not  both  0,  is  a  positive  number,  it  is  clear  that  gcd(n,  b)  =  gcd(|n|.  \b\).  For 
example,  gcd(15,  —36)  =  gcd(15,  36)  =  3  and  gcd(— 18,  —99)  =  gcd(  1 8,  99)  = 
9.  Since  |a|  and  \b\  are  natural  numbers,  the  Euclidean  algorithm  in  fact  can  be 
used  to  find  the  greatest  common  divisor  of  any  pair  of  nonzero  integers  a  and  b. 

We  have  been  illustrating  through  examples  a  general  fact:  The  greatest  com¬ 
mon  divisor  of  two  natural  numbers  a  and  b  is  a  linear  combination  of  a  and  b.  If 
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4.2.9  THEOREM 


4.2.10  COROLLARY 


Proof 


4.2.11  COROLLARY 


Proof 


a  =  0  and  b  >  1,  then  gcd(a,  b)  =  b  =  0(a)  +  1(b)  is  also  a  linear  combination 
of  a  and  b.  Using  our  remark,  it  is  clear  that  if  gcd(|a|,  \b\ )  is  a  linear  combi¬ 
nation  of  |a|  and  \b\,  then  so  is  gcd(a,  b).  For  example,  since  gcd(1800,  756)  = 
36  =  8(1800)  -  19(756),  gcd(1800,  -756)  =  36  =  8(1800)  +  19(— 756).  The 
following  theorem,  the  proof  of  which  follows  by  noting  that  each  remainder  in 
the  Euclidean  algorithm  is  a  linear  combination  of  the  previous  two,  describes  the 
most  important  property  of  the  greatest  common  divisor.  (See  also  Exercise  24.) 

The  greatest  common  divisor  of  integers  a  and  b  is  an  integral  linear  combination 
of  them;  that  is,  if  g  =  gcd(a,  b),  then  there  are  integers  m  and  n  such  that 
g  =  ma  +  nb. 

The  corollaries  which  follow  illustrate  ways  in  which  this  theorem  is  used. 

Suppose  a ,  b  and  x  are  integers  such  that  a\bx.  If  a  and  b  are  relatively  prime, 
then  a\x. 

We  know  that  there  are  integers  m  and  n  so  that  ma+nb  =  1  [because  gcd(a,  b)  = 
1].  Multiplying  by  jc,  we  obtain  max  +  nbx  =  x.  But  a  |  max  and  a  |  nbx 
because  nbx  =  n(bx)  and  we  are  given  that  a  \bx.  Thus,  a  divides  the  sum 
max  +  nbx  =  x.  I 

The  greatest  common  divisor  of  nonzero  integers  a  and  b  is  divisible  by  every 
common  divisor  of  a  and  b. 

Let  g  =  gcd(a,  b).  By  Theorem  4.2.9,  g  =  ma  +  nb  for  integers  m  and  n.  Thus, 
by  Proposition  4.2.2,  if  c  is  a  common  divisor  of  a  and  b,  then  c  I  g.  I 


PROBLEM  11.  Suppose  a,  b  and  c  are  three  nonzero  integers  with  a  and  c 
relatively  prime.  Show  that  gcd(a,  be)  =  gcd(a,  b). 

Solution.  Let  gi  =  gcd(a,  be)  and  g2  =  gcd(a,  b).  Since  g2  \  b,  we  know  that 
g2  j  be.  Since  also  g2  \  a ,  we  have  g2  <  g i-  On  the  other  hand  (and  just  as 
in  Corollary  4.2.10),  since  gcd(a,c)  =  1,  there  are  integers  m  and  n  such  that 
ma  +  nc  =  1.  Multiplying  by  b,  we  obtain  mab  +  nbc  =  b.  Since  gi  |  a  and 
gi  |  be,  it  must  also  be  the  case  that  g\  |  b.  Then,  since  gi  |  a,  gi  <  g2 ■  Therefore, 
gl  =  g2-  ■ 


The  Least  Common  Multiple 

In  paragraph  2.5.6,  the  greatest  lower  bound  of  two  elements  a  and  b  in  a  partially 
ordered  set  (A,  ;<)  was  defined  to  be  an  element  g  =  a  A b  e  A  with  the  properties 

1.  g  ^  a,  g  <  b,  and 

2.  if  c  <  a  and  c  <  b  for  some  c  e  A,  then  c  <  g. 

Corollary  4.2.11  therefore  shows  that  every  pair  of  natural  numbers  has  a  gib  in 
the  poset  (N,|),  namely,  their  greatest  common  divisor:  For  a,b  G  N, 

a  A  b  —  gcd(a,  b). 
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EXAMPLES  12 


EXAMPLES  13 


4.2.13  PROPOSITION 


It  is  also  true  that  every  pair  of  natural  numbers  has  a  least  upper  bound. 


If  a  and  b  are  nonzero  integers,  we  say  that  £  is  the  least  common  multiple  (1cm) 
of  a  and  h  and  write  t  =  lcm(a,  b)  if  and  only  if  £  is  a  positive  integer  satisfying 

1.  a  |  £,  b  |  £  and, 

2.  if  m  is  any  positive  integer  such  that  a  | m  and  b\m,  then  i  <  m. 

•  The  least  common  multiple  of  4  and  14  is  28. 

.  lcm(-6,  21)  =  42. 

.  lcm(— 5,  -25)  =  25.  ▲ 


Since  \ab\  is  a  common  multiple  of  a  and  b,  the  least  common  multiple  always 
exists  and  does  not  exceed  \ab\.  Remember  also  that  a  least  common  multiple  is 
always  positive  (by  definition). 

In  the  exercises  (see  also  Exercise  29  of  Section  4.3),  we  ask  you  to  derive 
the  formula 


(2) 


gcd(a,  b)  lcm(a,  b )  =  \ab\. 


which  holds  for  any  nonzero  integers  a  and  b  and  gives  a  quick  way  to  compute 
least  common  multiples. 


Since  gcd(6,  21)  =  3,  it  follows  that  lcm(6,  21)  =  =  42. 

Since  gcd(630,  —196)  =  14  (Problem  9),  it  follows  that  lcm(630,  -196)  = 
630[‘96)  =  8820.  A 


Just  as  the  greatest  common  divisor  of  a  and  b  is  divisible  by  all  common 
divisors  of  a  and  b,  we  can  show  that  the  least  common  multiple  of  a  and  b  is 
a  divisor  of  all  common  multiples  of  a  and  b  (see  Exercise  27).  Thus,  the  least 
common  multiple  of  natural  numbers  a  and  b  is  their  least  upper  bound  in  the 
poset  (N,  | )  (see  again  paragraph  2.5.6).  For  a,b  e  N, 

a  v  b  =  lcm(a,  b). 

Remembering  that  a  lattice  is  a  partially  ordered  set  in  which  every  two  elements 
have  a  greatest  lower  bound  and  a  least  upper  bound,  the  following  proposition 
is  immediate. 


The  poset  (N,|)  is  a  lattice. 

The  Lattice  of  Divisors  of  a  Natural  Number 

Implicit  in  the  definition  of  lattice  is  the  fact  that  the  greatest  lower  bound  and  least 
upper  bound  of  every  pair  of  elements  should  lie  in  the  set.  The  poset  ({2,  3,  4} ,  | ), 
for  instance,  is  not  a  lattice  because  while,  for  example,  2  and  3  have  a  gib  in  N, 
this  element  is  not  in  {2,  3,  4}. 

We  have  seen  that  (N,|)  is  a  lattice.  Also,  for  many  subsets  A  of  N.  (A,|)  is 
a  lattice.  For  example,  if  n  is  any  natural  number  and  A  =  {d  e  N  |  d  j  n]  is  the 
set  of  positive  divisors  of  n,  then  (A,|)  is  a  lattice.  (See  Exercise  33.) 
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EXAMPLE  14 


Figure  4.2 


Answers  to  Pauses 


With  A  =  {1,  2,  3,  5,  6,  10,  15,  30),  the  set  of  positive  divisors  of  n  =  30,  (A,|) 
is  a  lattice  whose  Hasse  diagram  is  shown  in  Fig  4.2.  Notice  that  every  pair  of 
elements  of  A  has  a  gib  and  a  lub.  For  every  pair  of  elements  a,  b  e  A,  a  A  b  is 
the  unique  element  below  (and  connected  with  lines  to)  both  a  and  b  and  a  v  b 
is  the  unique  element  above  (and  connected  with  lines  to)  both  a  and  b.  A 


30 


The  Hasse  diagram  for  (A,|),  where  A  is  the  set  of  divisors  of  30.  Compare  with  Fig  2.4,  p.  65. 


3.  For  any  integer  n,  we  have  0  =  qn  for  the  integer  q  —  0. 

4.  Since  1  |  a  for  all  a  €  A,  no  a  ^  1  can  be  min¬ 
imal.  The  only  minimal  element  is  1,  and  it  is 
a  minimum.  The  elements  4,  5,  and  6  are  maxi¬ 
mal.  For  instance,  4  is  maximal  because  there  is 
no  a  e  A  such  that  4  |  a,  except  a  =  4.  There 
are  no  maximum  elements;  for  example,  4  is  not 
maximum  because  a  |  4  for  all  a  €  A  is  not  true. 

5.  The  largest  divisor  of  a  is  |o|.  Since  |a|  is  a  common  divisor  of  a  and  b,  this 
must  be  their  gcd. 

6.  Since  a  is  a  common  divisor  of  a  and  0,  the  greatest  common  divisor  of  these 


numbers 

is  \a\. 

17,369 

1 

0 

5472 

0 

1 

953 

1 

-3 

707 

-5 

16 

246 

6 

-19 

215 

-17 

54 

31 

23 

-73 

29 

-155 

492 

2 

178 

-565 

1 

-2647 

8402 

The  last  nonzero  remainder — 1  in  this  case — is  the  gcd.  So  17,369  and  5472 
are  relatively  prime. 

8.  1  =  -2647(17,369)  +  8402(5472). 


4  6  5 
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EXERCISES 


The  symbol  [BB]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 


1.  [BB]  We  have  seen  in  this  section  that  (N,  |)  is  a  partially 
ordered  set. 

(a)  Is  it  totally  ordered? 

(b)  Does  it  have  a  maximum?  A  minimum? 

Explain  your  answers. 

2.  Consider  the  partially  ordered  set  ({2,4,  6,  8},|). 

(a)  Explain  why  every  pair  of  elements  in  this  poset  has 
a  greatest  lower  bound. 

(b)  Does  every  pair  of  elements  have  a  least  upper 
bound? 

(c)  Is  the  poset  a  lattice? 

Explain  your  answers. 

3.  Draw  the  Hasse  diagrams  for  each  of  the  following  par¬ 
tially  ordered  sets. 

(a)  [BB]  ({2,  3, 4,  5,  6,  7],  |) 

(b)  ((1,2,  3,  4,  5,  6.  7,  8,  9,  10], |) 

4.  List  all  minimal,  minimum,  maximal,  and  maximum  ele¬ 
ments  for  each  of  the  posets  in  Exercise  3. 

5.  [BB]  Let  n  be  a  natural  number.  Given  n  consecutive 
integers,  a.  a  +  1 ,  a  +  2,  ...  ,  a  +n  —  1 ,  show  that  one 
of  them  is  divisible  by  n. 

6.  [BB]  Prove  that  n 2  —  2  (n  an  integer)  is  never  divisible 
by  4. 

7.  Given  that  a  and  x  are  integers,  a  >  1,  a  I  (I  lx  +  3), 
and  a  |  (55.v  +  52),  find  a. 

8.  [BB]  Suppose  a  and  b  are  integers  with  the  same  remain¬ 
der  upon  division  by  some  natural  number  n.  Prove  that 
n  [  (a  —  b). 

9.  True  or  false?  In  each  case,  justify  your  answer  with  a 
proof  or  a  counterexample  (all  variables  represent  inte¬ 
gers). 

(a)  [BB]  if  a  |  b  and  b  |  — c,  then  a  \  c. 

(b)  if  a  |  b  and  c  |  b ,  then  ac  |  b. 

(c)  if  a  |  b  and  a  j  c,  then  a  |  be. 

(d)  if  a  |  b  and  c  [  d.  then  ac  |  bd. 

i  |  b  ,  |  b 

(e)  if  alb  and  cl-,  then  c  b  and  a  — . 

a  c 


10.  Suppose  a  and  b  are  relatively  prime  integers  and  c  is 
an  integer  such  that  a  |  c  and  b  |  c.  Prove  that  ab  |  c. 

11.  In  each  of  the  following  cases,  find  the  greatest  common 
divisor  of  a  and  h  and  express  it  in  the  form  ma  +  nb 
for  suitable  integers  in  and  n. 

(a)  [BB]  a  =  93.  b  =  1 19 

(b)  [BB]  a  =  -93,  b  =  1 19 

(c)  [BB]  a  =  -93,  b  =  -1 19 

(d)  a  =  1575,  b  =  231 

(e)  a  =  1575,  b  =  -231 

(f)  a  =  -1575,  b  =  -231 

(g)  a  =  -3719,  b  =  8416 

(h)  a  =  100,996,  b  =  20,048 

(i)  a  =  28,844,  b  =  -15,712 

(j)  a  =  12,345,  b  =  54,321 

12.  Which  of  the  pairs  of  integers  in  Exercise  1 1  are  rela¬ 
tively  prime? 

13.  [BB]  If  a  and  b  are  relatively  prime  integers,  prove  that 
gcdffl  +  b,  a  —  b)  =  I  or  2. 

14.  [BB]  Prove  that  integers  a  and  b  can  have  at  most  one 
greatest  common  divisor. 

15.  Prove  that  for  integers  a  and  b,  ged (a,  a  +  b)  = 
gcd(«.  b).  [Hint:  Mimic  the  proof  of  Lemma  4.2.5.] 

16.  (a)  [BB]  Find  a  pair  of  integers  x  and  y  such  that 

1 7,369x  +  5472y  =  4.  (See  Pause  8.) 

(b)  Find  integers  x  and  y  such  that  154x  +  260y  =  4. 

(c)  [BB]  Show  that  no  integers  x,  y  satisfy  154x  4- 
260y  =  3. 

(d)  Show  that  no  integers  x,  y  satisfy  196x+260y  =  14. 

17.  (a)  [BB]  Given  integers  d,  x  and  y,  suppose  there  exist 

integers  m  and  n  such  that  d  =  mx  +ny.  Prove  that 
gcd(x,  y)  \d. 

(b)  Is  the  converse  of  (a)  true?  If  gcd(x,y)  j  d.  need 
there  exist  integers  m  and  n  such  that  d  =  mx+ny ? 

18.  If  k  €  N.  prove  that  gcd(3£  +  2,  5k  +  3)  =  1 . 

19.  [BB]  Let  a.  b,  c  €  N.  Prove  that  gcd(c/e,  be)  = 
c(gcd(o,  b)). 

20.  If  a  e  N.  prove  that  gcd(a,  a  4-  2)  =  J  '  a  Is 

F  [  2  if  a  is  even. 

21.  [BB]  Prove  that  gcd(/7,  n  +  1)  =  1  for  any  n  e  N.  Find 
integers  x  and  y  such  that  nx  +  (/;+!  )y  =  1 . 
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22.  Prove  that  if  a,  b,  and  c  are  natural  numbers,  gcd(a,  c)  = 
1  and  b  |  c,  then  gcd(a,  b)  =  1. 

23.  Let  n  and  5  be  positive  integers  and  suppose  k  is  the 
least  positive  integer  such  that  n  |  ks.  Prove  that  k  = 

n 

gcd(«,  s) 

24.  [BB]  Use  the  Well-Ordering  Principle  to  prove  The¬ 
orem  4.2.9.  [Hint:  Consider  the  set  of  positive  linear 
combinations  of  a  and  b.] 

25.  [BB]  Find  lcm(63,  273)  and  lcm(56,  200).  [Hint:  An 
easy  method  takes  advantage  of  formula  (2)  of  this 
section.] 

26.  [BB;  (a)]  Find  the  1cm  of  each  of  the  pairs  of  integers 
given  in  Exercise  1 1 . 

27.  Let  a  and  b  be  nonzero  integers  with  lcm(a,  b )  =  l.  Let 
m  be  any  common  multiple  of  a  and  b.  Prove  that  t\  m. 

28.  [BB]  Prove  that  gcd(a,i>)  |  lcm(a,jb)  for  any  nonzero 
integers  a  and  b. 

29.  Establish  formula  (2)  of  this  section  by  proving  that  the 
least  common  multiple  of  nonzero  integers  a  and  b  is 
\ab\/ gcd{a.  b). 

30.  (a)  [BB]  Let  x  be  an  integer  expressed  in  base  10.  Sup¬ 

pose  the  sum  of  the  digits  of  x  is  divisible  by  3. 
Prove  that  x  is  divisible  by  3. 

(b)  [BB]  Prove  that  if  an  integer  x  is  divisible  by  3, 
then,  when  written  in  base  10,  the  sum  of  the  digits 
of  .v  is  divisible  by  3. 

(c)  Repeat  (a)  and  (b)  for  the  integer  9. 


4.2.14  DEFINITION 


The  greatest  common  divisor  of  n  integers  a\ ,  a2,  . . .  ,a„, 
not  all  zero,  is  a  number  g  which  is  a  common  divisor 
of  these  integers  (that  is,  g  \  a\,  g  |  (12, . . .  ,  g  \  a„ )  and 
which  is  the  largest  of  all  such  common  divisors  (that  is, 
if  c  j  a\,c  |  a2,  . . .  ,  c  |  a„ ,  then  c  <  g).  It  is  denoted 
gcd(ai - ,an). 

31.  (a)  Suppose  a.b.c  are  nonzero  integers.  Show  that 
gcd(«,  b ,  c)  =  gcd(gcd(a,  b),  c). 

(b)  Show  that  the  gcd  of  nonzero  integers  a.b.c  is  an 
integral  linear  combination  of  them. 

(c)  Find  gcd(105, 231.  165)  and  express  this  as  an 
integral  linear  combination  of  the  given  three 
integers. 


(d)  Answer  (c)  for  the  integers  6279,  8580,  and  2873. 

(e)  Answer  (c)  for  the  integers  5577,  18,837,  and 
25,740. 

32.  Suppose  that  (A],  ^1)  and  (A2,  <2)  are  partial  orders. 

(a)  Show  that  the  definition 

(x\ .  x2)  <  (y  1 ,  y2)  +*  x\  y\  and  x2  ^2  >’2 

for  (jci,  x2),  (y\,  y2)  €  A  1  x  A2  makes  A 1  x  A 2  a 
partially  ordered  set. 

(b)  Let  A\  =  Aj  =  [2,  3,  4}.  Assign  to  A\  the  partial 
order  <  and  to  A2  the  partial  order  | .  Partially  order 
Aj  x  A2  as  in  part  (a).  Show  all  relationships  of  the 
form  {x],x2)  ^  (yi,y2). 

(c)  Draw  the  Hasse  diagram  for  the  partial  order 
in  (b). 

(d)  Find  any  maximal,  minimal,  maximum,  and  mini¬ 
mum  elements  which  may  exist  in  the  partial  order 
of  (b). 

(e)  With  A 1  and  A2  as  in  part  (b),  find  those  gibs  and 
lubs  which  exist  for  each  of  the  following  pairs  of 
elements. 

i.  (2,2),  (3.3) 

ii.  (4,  2),  (3,  4) 

Hi.  (3.2),  (2,4) 
iv.  (3,  2),  (3,  4) 

(f)  Show,  by  example,  that  if  (A  1,^1)  and  (A2,  <2) 
are  total  orders,  then  ( A 1  x  A2,  <)  need  not  be  a 
total  order. 

33.  (a)  Let  n  be  a  natural  number,  n  >  1 .  Let  A  =  {a  e  N  | 

a\n}.  Prove  that  (A.|)  is  a  lattice. 

(b)  [BB]  Draw  the  Hasse  diagram  associated  with  the 
poset  given  in  (a)  for  n  =  12. 

(c)  Draw  the  Hasse  diagram  associated  with  the  poset 
given  in  (a)  for  n  =  36. 

(d)  Draw  the  Hasse  diagram  associated  with  the  poset 
given  in  (a)  for  n  =  90. 

34.  (a)  [BB]  Let  g  be  the  greatest  common  divisor  of 

integers  m  and  n,  not  both  0.  Show  that  there 
are  infinitely  many  pairs  a,b  of  integers  such  that 
g  =  am  +  bn. 

(b)  Suppose  m  and  n  are  relatively  prime  integers  each 
greater  than  1 . 

i.  Show  that  there  exist  unique  integers  a ,  b  with 
0  <  b  <  m  such  that  am  +  bn  =  1. 

ii.  Show  that  there  exist  unique  integers  a.  b  with 
0  <  a  <  n  and  0  <  b  <  m  such  that  am  = 
bn  +  1. 
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4.3  Prime  Numbers 

In  Section  4.2,  we  saw  what  it  means  to  say  that  one  integer  “divides”  another. 
The  basic  building  blocks  of  divisibility  are  the  famous  “prime  numbers”.  We 
examine  some  of  the  properties  of  prime  numbers  in  this  section. 


4.3.1  DEFINITION 


A  natural  number  p  >  2  is  called  prime  if  and  only  if  the  only  natural  numbers 
which  divide  p  are  1  and  p.  A  natural  number  n  >  1  which  is  not  prime  is  called 
composite.  Thus,  n  >  I  is  composite  if  it  =  ab  where  a  and  b  are  natural  numbers 
with  1  <  a,b  <  n. 


Thus,  2,  3,  and  5  are  primes  while  4,  6,  and  10  are  composite.  Note  that 
the  natural  number  1  is  neither  prime  nor  composite.  It  is  interesting  to  note  that 
exactly  one  quarter  of  the  numbers  between  1  and  100  (inclusive)  are  prime.  These 
primes  are  shown  in  Table  4.1. 


Table  4.1  The  primes 
less  than  100. 


2 

3 

5 

7 

11 

13 

17 

19 

23 

29 

31 

37 

41 

43 

47 

53 

59 

61 

67 

71 

73 

79 

83 

89 

97 

You  may  have  needed  more  than  a  few  seconds  to  verify  that  some  of  the 
entries  in  Table  4.1  are,  in  fact,  prime.  Such  verification  gets  increasingly  more 
difficult  as  the  numbers  get  bigger.  One  of  the  great  challenges  of  mathematics 
and  an  active  area  of  research  today  is  that  of  finding  efficient  algorithms  for 
checking  whether  or  not  large  integers  are  primes.  By  the  end  of  1988,  the  largest 
known  prime  number  was  2216,091  —  1.  This  was  discovered  in  1985  by  computer 
scientists  working  at  Geosciences  Corporation.  Houston,  who  required  just  three 
hours  on  a  Cray  X-MP  computer  to  verify  that  this  number  was,  indeed,  prime. 
Not  everyone  was  impressed.  A  vice  president  of  Chevron  Oil  was  apparently 
quoted  as  saying,  “The  results  are  interesting,  if  true,  but  they  are  certainly  not 
going  to  help  me  find  oil.” 

The  discovery  of  new  primes  is  viewed  as  both  a  test  of  humankind’s  ingenuity 
and  the  reliability  of  new  computers,  so,  when  a  new  prime  is  discovered,  it 
is  widely  publicized.  Early  in  1992,  David  Slowinski  and  Paul  Gage  of  Cray 
Research  Inc.  announced  that  2756,839  —  1  is  prime.  Then  in  1994  the  same  people 
announced  the  primality  of  2859’433  —  1,  a  number  with  a  mere  258,716  digits. 
On  November  13,  1996  Joel  Armengaud.  a  29-year-old  computer  programmer 
working  in  France,  discovered  that  21-398  269  -  1  is  prime.  A  year  later,  Gordon 
Spence  of  Hampshire,  England,  with  a  Pentium  PC  running  a  program  downloaded 
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over  the  World  Wide  Web,2  found  that  22,976,221  —  1  is  prime.  This  number,  with 
895,932  digits,  would  fill  a  450-page  book  all  by  itself!  New  primes  are  currently 
being  discovered  at  such  a  rate  that  it  is  difficult  for  the  authors  of  textbooks 
to  keep  pace.  In  June  1999,  using  the  “Lucas-Lehmer  Test,”  Nayan  Hajratwala 
discovered  that  26,972,59-  —  1,  a  number  with  2,098,960  digits,  was  prime,  and  a 
year  later  this  number  had  not  been  beaten.  The  search  for  the  next  largest  prime 
presents  a  challenge  that  will  never  end  because  it  has  been  known  since  the  time 
of  Euclid  that  the  number  of  primes  is  infinite.  To  prove  this  fact,  we  first  need  a 
short  lemma. 


Given  any  natural  number  n  >  1,  there  exists  a  prime  p  such  that  p  n. 


Proof  We  give  a  proof  by  contradiction.  Thus,  we  suppose  the  lemma  is  false.  In  this 
case,  the  set  of  natural  numbers  greater  than  1  which  are  not  divisible  by  any 
prime  is  not  empty.  By  the  Well-Ordering  Principle,  the  set  contains  a  smallest 
element  m.  Since  m\m,  but  m  is  not  divisible  by  any  prime,  m  itself  cannot 
be  prime,  so  m  is  divisible  by  some  natural  number  a ,  with  1  <  a  <  m.  By 
minimality  of  m,  a  must  be  divisible  by  a  prime  p.  Since  p  I  a  and  a  \m,  we  have 
p  |  m,  a  contradiction.  I 

Now  we  can  easily  show  that  there  are  an  infinite  number  of  primes.  The 
wonderfully  simple  argument  we  present  has  been  known  since  300  B.C.  and  is 
commonly  attributed  to  Euclid  since  it  appears  in  Euclid’s  Elements.  It  is  a  model 
of  a  proof  by  contradiction. 


4.3.3  THEOREM 


There  are  infinitely  many  primes. 


Proof  If  the  theorem  is  not  true,  there  are  just  a  finite  number  of  primes  p\,  pi,  ...  , 
pt.  Let  n  =  (p\  p2  ■  ■  ■  Pi)  +  1.  By  Lemma  4.3.2,  n  is  divisible  by  some  prime 
and  hence  by  some  p,.  Since  piP2--pr  is  also  divisible  by  Pi,  the  number 
n  —  (p\p2  ■  ■  •  Pt)  must  also  be  divisible  by  p, .  Thus,  1  is  divisible  by  p,-,  a 
contradiction.  I 


Although  the  number  of  primes  is  infinite,  the  number  of  natural  numbers  which 
are  not  prime  is  infinite  too.  Find  an  easy  way  to  see  this. 


We  now  return  to  the  question  of  determining  whether  or  not  a  given  integer 
is  prime.  Several  elementary  observations  can  be  made  at  the  outset.  Any  even 
integer  is  divisible  by  2  and  so  is  not  prime  (unless  it  equals  2).  Similarly,  any 
integer  larger  than  5  whose  units  digit  is  0  or  5  is  divisible  by  5  and  hence  not 
prime.  If  n  is  not  prime.  Lemma  4.3.2  tells  us  that  n  is  divisible  by  some  prime. 
So,  to  verify  that  n  is  not  prime,  it  is  enough  just  to  test  the  prime  numbers  less 
than  n  (rather  than  all  numbers  less  than  n)  when  searching  for  a  divisor  of  n. 

Here  is  another  fact  which  decreases  considerably  the  amount  of  testing  that 
must  be  done  when  checking  for  divisors  of  an  integer. 


2The  World  Wide  Web  is  full  of  information  about  large  primes.  For  instance,  there  is  lots  of  up-to-the- 
minute  news  in  “The  Prime  Pages:”  http://www.utm.edu/research/primes/. 
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4.3.4  LEMMA 


Proof 


4.3.5  THE  SIEVE  OF 
ERATOSTHENES 


EXAMPLE  15 


If  a  natural  number  n  >  1  is  not  prime,  then  n  is  divisible  by  some  prime  number 
p  <  yfn. 

Since  n  is  not  prime,  n  can  be  factored  n  =  ab  with  1  <  a  <  b  <  n.  Then 
a  <  y/ n  since  otherwise,  ab  >  y/tiy/n  =  n,  a  contradiction.  As  a  natural  number 
greater  than  1,  it  follows  that  a  is  divisible  by  some  prime  p.  Since  p  \  a  and  a\n, 
p  | n  by  transitivity.  Also,  since  a  <  y/n  and  p\a,  we  must  have  p  <  y/n.  Thus, 
p  is  the  desired  prime  factor  of  n.  I 

As  a  consequence  of  Lemma  4.3.4,  when  testing  a  natural  number  n  for 
primality,  we  need  only  consider  the  possibility  of  a  prime  divisor  less  than 
or  equal  to  the  square  root  of  n.  For  example,  to  verify  that  97  is  prime,  we 
need  only  check  the  prime  numbers  less  than  or  equal  to  \/97.  Since  none  of 
2,  3,  5,  7  is  a  divisor  of  97,  we  are  assured  that  97  is  a  prime  number.  It  is 
apparent  that  Lemma  4.3.4  reduces  dramatically  the  work  involved  in  a  primality 
check. 

One  simple  procedure  which  goes  back  over  2000  years  for  finding  all  the 
primes  less  than  or  equal  to  some  given  integer  is  named  after  the  Greek  Eratos¬ 
thenes  (ca.  200  B.C.),  chief  librarian  of  the  great  library  at  Alexandria  and  a 
contemporary  of  Archimedes.  Reportedly,  Eratosthenes  was  the  first  person  to 
estimate  the  circumference  of  the  earth. 

Who  was  Archimedes? 


To  find  all  the  prime  numbers  less  than  or  equal  to  a  given  natural  number  n, 

•  list  all  the  integers  from  2  to  n, 

•  circle  2  and  then  cross  out  all  multiples  of  2  in  the  list, 

•  circle  3,  the  first  number  not  yet  crossed  out  or  circled,  and  then  cross  out  all 
multiples  of  3, 

«  circle  5,  the  first  number  not  yet  crossed  out  or  circled,  and  cross  out  all 
multiples  of  5, 

•  at  the  general  stage,  circle  the  first  number  which  is  neither  crossed  out  nor 
circled  and  cross  out  all  its  multiples, 

•  continue  until  all  numbers  less  than  or  equal  to  yfn  have  been  circled  or 
crossed  out. 

When  the  procedure  is  finished,  those  integers  which  are  not  crossed  out  are  the 
primes  not  exceeding  n. 

As  an  example  of  this  procedure,  we  verify  that  the  list  of  primes  p  <  100  given 
in  Table  4.1  is  correct.  All  the  integers  from  2  to  100  are  listed  in  Figure  4.3. 
Initially,  2  was  circled  and  then  all  even  integers  were  crossed  out  with  a  single 
stroke.  At  the  second  stage,  3  was  circled  and  all  multiples  of  3  not  yet  crossed 
out  were  crossed  out  with  two  strokes.  Then  5  was  circled  and  all  multiples  of  5 
not  yet  crossed  out  were  crossed  out  with  three  strokes.  Finally,  7  was  circled  and 
all  multiples  of  7  not  previously  crossed  out  were  crossed  out  with  four  strokes. 
The  primes  less  than  100  are  those  which  are  not  crossed  out.  ▲ 
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Figure  4.3 


4.3.6  THEOREM 


Proof 


EXAMPLE  16 


4.3.7  PROPOSITION 
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The  Sieve  of  Eratosthenes  used  to  determine  the  primes  p  <  100. 

The  prime  numbers  are  considered  “building  blocks”  because  of  the  following 
important  theorem,  which  generalizes  observations  such  as  8  =  2-  2- 2,  15  =  3-5 
and  60  =  2  •  2  •  3  •  5. 

Every  natural  number  n  >  2  can  be  written  n  =  p\pi  •  ■  ■  pr  as  the  product  of 
prime  numbers  p\,  p2, . . .  ,  pr  or,  by  grouping  equal  primes,  in  the  form  n  = 
q^'q®2  ' ' '  Qs*  as  product  of  powers  of  distinct  primes  q\ ,  qi,  ■  ■  .  ,  qs- 

If  the  result  is  false,  then  the  set  of  integers  n  >2  which  cannot  be  written  as  the 
product  of  primes  is  not  empty  and  so,  by  the  Well-Ordering  Principle,  contains  a 
smallest  element  m.  This  number  cannot  be  prime,  so  m  =  ab  with  1  <  a,  b  <  m. 
By  minimality  of  m,  each  of  a  and  b  is  the  product  of  primes,  hence  so  is  m,  a 
contradiction.  I 


100  =  2  •  2  •  5  •  5  =  2252 

1 176  =  2-  2-  2-  3-  7-  7  =  23-3-72 

21340  =  2-2-5  •  11 -97  =  22 -5- 11 -97.  A 

Theorem  4.3.6  can  be  strengthened.  There  is,  in  fact,  just  one  way  to  express 
a  natural  number  as  the  product  of  primes,  but  in  order  to  prove  this,  we  need  a 
preliminary  result  which  is  actually  a  special  case  of  something  we  proved  earlier 
(see  Corollary  4.2.10).  Notice  that  3  |  12  and  regardless  of  how  12  is  factored — 
12  =  2(6)  =  3(4) — 3  always  divides  one  of  the  factors.  The  next  proposition 
describes  the  most  important  property  of  a  prime  number. 

Suppose  a  and  b  are  integers  and  p  is  a  prime  such  that  p  |  ab.  Then  p  j  a  or  p  j  b. 

This  indeed  follows  from  Corollary  4.2.10  because,  if  p  |  a,  then  p  and  a 
are  relatively  prime. 

Now  suppose  a  prime  p  divides  the  product  a\ai  ---a*  of  k  integers.  By 
Proposition  4.3.7,  p\ci\  or  p  |  ai  ■  •  •  a*.  In  the  latter  case,  applying  the  proposition 
again,  we  see  that  p  \  aj  or  p  \  ■  ■  ■  a^.  In  this  way,  we  obtain  the  following 

corollary. 
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4.3.8  COROLLARY 


4.3.10  DEFINITION 


If  a  prime  p  divides  the  product  cuaj-  ■  ■ a *  of  integers,  then  p  divides  one  of 
the  cij . 

Now  to  prove  that  a  natural  number  n  >  1  can  be  factored  in  just  one  way 
as  the  product  of  primes,  assume  that  n  can  be  factored  in  two  different  ways: 

(3)  n  =  p\P7  ■  ■  ■  Pk  =  q\qi---  qe 

with  all  the  p,  and  q,  primes.  After  canceling  equal  factors  on  each  side  of 
this  equation,  we  either  have  an  equation  which  says  that  the  product  of  certain 
primes  is  1  (an  absurdity)  or  another  equation  like  (3),  where  none  of  the  primes 
Pi  is  among  the  primes  qj.  Then,  since  p  \  \  p\pz---  Pk,  it  would  follow  that 
P\  |  q\qi  •  ■  qi-  By  Corollary  4.3.8,  p\  |  qj  for  one  of  the  primes  qj.  Since  both 
p\  and  qj  are  primes,  this  forces  p\  =  q  j,  a  contradiction. 

The  fact  that  every  integer  n  >  1  can  be  written  uniquely  as  the  product  of 
prime  numbers  is  called  the  “Fundamental  Theorem  of  Arithmetic."  Because  of 
its  importance,  we  state  it  again. 

Every  integer  n  >2  can  be  written  in  the  form 

(4)  n=pip2-pr 

for  unique  primes  p\,  P2,  ,  Pr\  equivalently,  every  integer  n  >2  can  be  written 

(5)  n  =  q('  c/“2  •  •  •  c/“'  (the  prime  decomposition  of  n) 

as  the  product  of  powers  of  distinct  prime  numbers  qi,qz . qs-  These  primes 

and  the  exponents  a\,az.  ,as  are  unique. 

As  noted,  the  decomposition  in  (5)  is  called  the  prime  decomposition  of  n. 
For  example,  the  prime  decomposition  of  120  is  2?  •  3  •  5. 

The  prime  factors  or  prime  divisors  of  an  integer  n  >  2  are  the  prime  numbers 
which  divide  n.  The  multiplicity  of  a  prime  divisor  p  of  n  is  the  largest  number 
a  such  that  pa  |  n. 

Thus,  the  prime  factors  of  an  integer  n  are  the  primes  pi  or  qi  given  in  (4)  and 
(5)  and  the  multiplicities  of  q\ , . . .  ,  qs  are  the  exponents  a\ , . . .  ,ax.  respectively. 
The  prime  factors  of  14  are  2  and  7  and  each  has  multiplicity  1.  The  prime  factors 
of  120  are  2,  3,  and  5;  2  has  multiplicity  3,  while  3  and  5  each  have  multiplicity  1. 


PROBLEM  17.  Find  the  prime  decomposition  of  the  greatest  common  divisor 
of  nonzero  integers  a  and  b. 

Solution.  By  the  Fundamental  Theorem  of  Arithmetic,  a  and  b  can  be  expressed 
in  the  form 

a  =  ±p“'  p*2  ■  ■  ■  pa/ .  b  =  ±pf 1  p(f  ■  ■  ■  p?r 

for  certain  primes  p\,  pz,  ,  pr  and  integers  a\,az, ...  .  ar,  ff\ ,  fc, .  ■ .  ,  fir  -  (By 
allowing  the  possibility  that  some  of  the  a ,•  or  ft  are  0,  we  can  assume  that  the 
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same  r  primes  occur  in  the  decompositions  of  both  a  and  b.)  We  claim  that  the 
greatest  common  divisor  of  a  and  b  is 

minfori  ,0| )  min(a2,&)  „min(ffr,0r) 
o  P\  F2  *  *  ’  rr  ’ 

where  min  (o',,  fy)  denotes  the  smaller  of  the  two  nonnegative  integers  a,  and  fy. 

Note  that  g  |  a  since  the  exponent  min(a;- ,  fi,  )  does  not  exceed  a, ,  the  exponent 
of  the  prime  p,  in  a.  Similarly,  g  |  b.  Next,  assume  that  c  |  a  and  c  |  b.  Since  any 
prime  which  divides  c  also  divides  a  and  b,  the  only  primes  dividing  c  must 
be  among  p\.  p2, . . .  ,  pr.  Thus,  c  =  p^2  ■  ■  ■  Prr  for  some  integers  y,  .  Since 
c  |  a,  however,  we  must  have  y,-  <  a,  for  each  /  and,  similarly,  y,  <  ft,  for 
each  i  because  c  |  b.  Hence,  y,-  <  min(a,-,  /?,)  for  each  i,  so  c  |  g  and  the  result 
follows.  I 


Prime  numbers  have  held  a  special  fascination  for  humankind  ever  since  the 
Greeks  realized  there  were  infinitely  many  of  them.  They  are  so  familiar,  yet  there 
are  many  questions  concerning  them  which  are  easy  to  state  but  hard  to  answer. 
Many  remain  unsolved  today.  In  the  rest  of  this  section,  we  briefly  survey  some 
of  what  is  known  and  some  of  what  is  unknown  about  primes,  but  few  details 
and  no  proofs  will  be  given.  For  more  information,  the  interested  reader  might 
consult  Silverman’s  book,3  for  example. 

Mersenne  Primes 


4.3.11  OPEN 
PROBLEM 


While  it  is  very  difficult  to  determine  whether  or  not  a  given  large  integer  is  prime, 
the  problem  may  be  easier  for  special  classes  of  integers.  For  example,  integers  of 
the  form  In,  n  e  N,  are  never  prime  if  n  >  1  since  they  are  obviously  divisible  by 
7.  What  about  integers  of  the  form  n2  —  1?  Well,  since  n2  —  1  =  (n  —  l)(n  + 1),  these 
cannot  be  prime  either  as  long  as  n  >  2.  Father  Marin  Mersenne  (1588-1648)  was 
interested  in  integers  of  the  form  2”  —  1 .  He  showed  that  these  could  only  be  prime 
if  n  itself  was  prime.  (See  Exercise  17.)  Then  he  noted  that  it  wasn’t  sufficient  just 
for  n  to  be  prime  since  21 1  —  1  =  2047  =  23-89  is  not  prime.  He  conjectured  that 
2P  —  1  is  prime  if  p  is  any  of  the  primes  2,  3,  5,  7,  13,  17,  19,  31, 67,  127,  257  and 
composite  for  the  other  primes  p  <  251.  Unfortunately,  he  was  wrong  on  several 
counts.  For  example,  261  —  1  is  prime  while  22il  —  1  is  not.  (Resist  the  urge  to 
factor!)  By  the  end  of  1997,  just  35  primes  of  the  form  2P  —  1  had  been  found. 
Interestingly,  all  the  large  primes  which  have  been  discovered  in  recent  years  are 
in  this  list.  In  Mersenne’s  honor,  primes  of  the  form  2P  —  1  are  called  Mersenne 
primes.  While  Mersenne  primes  continue  to  be  discovered,  the  following  problem 
is  still  unresolved. 

Are  there  infinitely  many  Mersenne  primes? 


Fermat  Primes 


Another  interesting  class  of  prime  numbers  is  the  set  of  so-called  Fermat  primes, 
these  being  prime  numbers  of  the  form  22"  +  1.  For  n  =  0,  1,  2,  3,  4,  indeed  21"  +  1 


3  Joseph  H.  Silverman,  A  Friendly  Introduction  to  Number  Theory ,  second  edition  (Prentice  Hall,  2001). 
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is  prime  and  it  was  a  guess  of  the  seventeenth-century  lawyer  Pierre  de  Fermat 
(1601-1665),  perhaps  the  most  famous  amateur  mathematician  of  all  time,  that 
2“  +  1  is  prime  for  all  n  >  0.  In  1732,  however,  the  great  Swiss  mathematician 

>5 

Leonhard  Euler  (1707-1783)  showed  that  2L  -Elis  not  prime — it  is  divisible  by 
641 — and,  to  this  day,  no  further  Fermat  primes  have  been  discovered. 

When  a  Cray  supercomputer  tires  of  hunting  for  Mersenne  primes,  it  turns 
to  a  search  for  Fermat  primes!  In  a  number  of  cases,  complete  factorizations  of 
22'  +  1  are  known;  in  other  cases,  one  or  two  prime  factors  are  known.  There  are 
Fermat  numbers — numbers  of  the  form  22"  +  1 — known  to  be  composite,  though 
no  one  knows  a  single  factor!  As  of  1995,  the  smallest  Fermat  number  whose 

^24 

primality  was  unsettled  was  2~  +  1 .  More  details,  together  with  an  indication  as 
to  how  testing  huge  numbers  for  primality  is  used  to  check  the  reliability  of  some 
of  the  world’s  largest  computers,  can  be  found  in  an  article  by  Jeff  Young  and 
Duncan  A.  Buell.4 


4.3.12  OPEN 
PROBLEM 


What  are  the  first  five  Fermat  primes? 


Are  there  more  than  five  Fermat  primes? 


How  Many  Primes  Are  There? 


We  have  discussed  Euclid’s  observation  that  there  are  infinitely  many  primes 
(Theorem  4.3.3),  so  what  does  the  question  just  posed  really  mean? 

There  is  great  interest  in  the  proportion  of  natural  numbers  which  are  prime. 
Students  who  have  enjoyed  advanced  calculus  will  know  that  the  series 


1111 

I+2+3+4+ 


diverges  (the  partial  sums  increase  without  bound  as  more  and  more  terms  are 
added),  while  the  series 


72  +  52+52  +  ^2  + 


converges  (to  n2/6  in  fact).5  Notice  that  the  second  sum  here  is  part  of  the  first. 
So  the  fact  that  the  second  sum  is  finite  while  the  first  is  infinite  shows  that  the 
sequence  of  perfect  squares  l2,  22,  32, . . .  forms  only  a  tiny  part  of  the  sequence 
of  all  natural  numbers  1 , 2,  3, . . .  . 

Which  of  these  sequences  does  the  set  of  primes  most  resemble?  Does  the 
sum  of  reciprocals  of  the  primes 


1111 

2+3+5+7+" 


converge  or  diverge?  As  a  matter  of  fact,  it  diverges,  so,  in  some  sense,  there 
really  are  a  “lot”  of  prime  numbers,  many  more  than  there  are  perfect  squares, 
for  example. 


4‘The  Twentieth  Fermat  Number  is  Composite,”  Mathematics  of  Computation  50  (1988),  261-263. 

5The  reader  who  has  not  studied  advanced  calculus  can  relax!  The  ideas  to  which  we  make  reference 
here  are  not  critical  to  an  overall  appreciation  of  the  mysteries  of  the  primes. 
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4.3.13  PRIME  NUMBER 
THEOREM 


A  sophisticated  result  concerning  the  number  of  primes  is  the  important 
“Prime  Number  Theorem,”  which  was  proven  independently  by  Jacques  Hadamard 
and  Charles-Jean  de  la  Vallee-Pousin  in  1896.  It  gives  an  approximation  to  the 
function  n(x ),  which  is  the  number  of  primes  p  <  x.  For  example,  n{5)  =  3 
since  there  are  three  primes  p  <  5.  We  have  earlier  noted  that  there  are  25  primes 
p  <  100;  thus,  tt(100)  =  25. 

Let  n{x)  denote  the  number  of  primes  p  <  x.  Then 

7r(x)  x 

hm  - =  1;  equivalently,  n(x)  ~ - . 

*->-oox/lnx  lnx 


Students  uncertain  about  limits  might  wish  a  translation!  First  of  all,  we  say 
“is  asymptotic  to”  for  ~.  So  the  second  statement  of  the  theorem  reads  “ n(x )  is 
asymptotic  to  x/lnx,”  from  which  we  infer  that  n(x)  is  approximately  equal  to 
x/ lnx  for  large  x,  the  approximation  getting  better  and  better  as  x  grows.  Setting 
x  =  100,  Theorem  4.3.13  asserts  that  the  number  of  primes  p  <  100  is  roughly 
100/  In  100  «  21.715.  Note  that 


3r(100)  _  25 

100/  In  100  ~  21.715 


1.151. 


[The  symbol  %  means  “approximately.”]  Setting  x  =  1,000,000,  the  theorem  says 
that  the  number  of  primes  under  1  million  is  roughly  1,000,000/ In  1,000,000  % 
72,382.  In  fact,  there  are  78,498  such  primes.  Note  that 

_ %ZM98%1o84 

1,000,000/ In  1,000,000  72,382 


4.3.14  FERMAT'S  LAST 
THEOREM 


As  x  gets  larger,  the  fraction  gets  closer  and  closer  to  1 . 

On  June  23,  1993,  at  a  meeting  at  the  Isaac  Newton  Institute  in  Cambridge, 
England,  Andrew  Wiles  of  Princeton  University  announced  a  proof  of  “Fermat’s 
Last  Theorem,”  arguably  the  most  famous  open  mathematics  problem  of  all  time. 

For  any  integer  n  >  2,  the  equation  an  +  bn  =  c"  has  no  nonzero  integer  solutions 
a ,  b ,  c. 


Notice  that  it  is  sufficient  to  prove  this  theorem  just  for  the  case  that  n  is  a 
prime.  For  example,  if  we  knew  that  a3  +  b3  =  c3  had  no  integral  solutions,  then 
neither  would  A3'1  +  B3n  =  C3n .  If  the  latter  had  a  solution,  so  would  the  former, 
with  a  =  A",  b  =  B ",  c  =  C". 

When  we  first  learned  the  Theorem  of  Pythagoras  for  right-angled  triangles, 
we  discovered  that  there  are  many  triples  a,  b,c  of  integers  which  satisfy  a2+b2  = 
c~\  for  example,  3,4,5  and  5,  12,  13,  but  are  there  triples  of  integers  a,b,c 
satisfying  a3  +  b3  =  c3  or  a1  +  b1  —  c1  or  a'1  +  bn  =  c"  for  any  values  of  n 
except  n  =  2? 

Pierre  de  Fermat  was  notorious  for  scribbling  ideas  in  the  margins  of  whatever 
he  was  reading.  In  1637,  he  wrote  in  the  margin  of  Diophantus’s  book  Arithmetic 
that  he  had  found  a  “truly  wonderful"  proof  that  an  +  bn  =  c"  had  no  solu¬ 
tions  in  the  positive  integers  for  n  >  2,  but  that  there  was  insufficient  space  to 
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write  it  down.  Truly  wonderful  it  must  have  been  because  for  over  350  years, 
mathematicians  were  unable  to  bnd  a  proof,  though  countless  many  tried! 

Amateur  and  professional  mathematicians  alike  devoted  years  and  even  life¬ 
times  to  working  on  Fermat’s  Last  Theorem.  The  theorem  owes  its  name,  by  the 
way,  to  the  fact  that  it  is  the  last  of  the  many  conjectures  made  by  Fermat  dur¬ 
ing  his  lifetime  to  have  resisted  resolution.  By  now,  most  of  Fermat’s  unproven 
suggested  theorems  have  been  settled  (and  found  to  be  true). 

In  1983,  Gerd  Faltings  proved  that,  for  each  n  >  2,  the  equation  an  +bn  =  cn 
could  have  at  most  a  finite  number  of  solutions.  While  this  was  a  remarkable 
achievement,  it  was  a  long  way  from  showing  that  this  finite  number  was  zero. 
Then,  in  1985,  Kenneth  Ribet  of  Berkeley  showed  that  Fermat’s  Last  Theorem 
was  a  consequence  of  a  conjecture  first  proposed  by  Yutaka  Taniyama  in  1955  and 
clarified  by  Goro  Shimura  in  the  1960s.  It  was  a  proof  of  the  Shimura-Taniyama 
conjecture  which  Andrew  Wiles  announced  on  June  23,  1993,  a  truly  historic 
day  in  the  world  of  mathematics.  The  months  following  this  announcement  were 
extremely  exciting  as  mathematicians  all  over  the  world  attempted  to  understand 
Wiles’s  proof.  Not  unexpectedly  in  such  a  complex  and  lengthy  argument,  a  few 
flaws  were  found.  By  the  end  of  1994,  however,  Wiles  and  one  of  his  former  grad¬ 
uate  students,  Richard  Taylor,  had  resolved  the  remaining  issues  to  the  satisfaction 
of  all. 

Among  the  many  exciting  accounts  of  the  history  of  Fermat’s  Last  Theorem 
and  of  Wiles’s  work,  we  draw  special  attention  to  an  article  by  Barry  Cipra,  “Fer¬ 
mat’s  Theorem — at  Last!”,  which  was  the  leading  article  in  What's  Happening 
in  the  Mathematical  Sciences ,  Vol.  3  (1995-1996),  published  by  the  American 
Mathematical  Society.  Faltings  himself  wrote  “The  Proof  of  Fermat's  Last  The¬ 
orem  by  R.  Taylor  and  A.  Wiles”  for  the  Notices  of  the  American  Mathematical 
Society,  Vol.  42  (1995),  No.  7.  In  fact,  many  excellent  accounts  have  been  written. 
We  cite  just  a  few.  There  is  one  entitled  “The  Marvelous  Proof’  by  Fernando  Q. 
Gouvea,  which  appeared  in  the  American  Mathematical  Monthly,  Vol.  101  (1994), 
and  others  by  Ram  Murty,  Notes  of  the  Canadian  Mathematical  Society,  Vol.  25 
(September  1993)  and  by  Keith  Devlin,  Fernando  Gouvea,  and  Andrew  Granville, 
Focus,  Vol.  13,  Mathematical  Association  of  America  (August  1993). 

More  Open  Problems 

So  far,  we  have  only  peeked  into  the  Pandora’s  box  of  fascinating  but  unanswered 
problems  concerning  prime  numbers.  Here  are  a  few  more. 

There  are  intriguing  questions  concerning  prime  “gaps,”  the  distances  between 
consecutive  primes.  On  the  one  hand,  there  are  arbitrarily  long  gaps  in  the  list  of 
primes.  One  way  to  see  this  is  to  observe  that  if  the  first  n  +  1  primes  are  p\,  p2, 
. . .  ,  pn+i,  then  all  the  numbers  between  p\p2  ■  ■  ■  pn+ 2  and  p\ pi  ■  ■  ■  p„+Pn+  \  - 1 
are  composite.  (See  Exercise  26.)  On  the  other  hand,  there  are  also  very  small 
gaps  in  the  primes,  for  example,  gaps  of  length  two  such  as  between  3  and  5  or 
1 1  and  13. 

Integers  p  and  p  +  2  which  are  both  prime  are  called  twin  primes.  For 
example,  3  and  5,  5  and  7,  11  and  13,  41  and  43  are  twin  primes.  So  are 
4,648,619,711,505  x  260’000  ±  1,  a  1999  discovery  of  Heinz-Georg  Wassing, 
Antal  Jarai,  and  Karl-Heinz  Indlekofer.  As  of  mid-2000,  these  numbers,  each 
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4.3.15  THE  TWIN 
PRIME  CONJECTURE 


with  18,075  digits,  remained  the  largest  known  pair  of  twin  primes.  It  is  possible, 
though  not  likely,  that  there  are  no  more.  Unlike  prime  numbers,  it  is  not  known 
whether  or  there  are  infinitely  many  twin  primes.  One  tantalizing  result,  proved 
by  Viggo  Brun  in  1921  using  a  variation  of  the  Sieve  of  Eratosthenes,  is  that  the 
sum  of  the  reciprocals  of  just  the  twin  primes  converges.  Having  noted  that  the 
sum  of  the  reciprocals  of  all  the  primes  diverges,  B  run’s  result  is  evidence  that 
the  number  of  twin  primes  is  “small.”  On  the  other  hand,  there  are  224,376,048 
twin  prime  pairs  less  than  100  billion  and  this  seems  like  a  pretty  big  number.  Is 
the  total  number  finite?  No  one  knows. 

There  are  infinitely  many  twin  primes. 


4.3.16  THE  60LDBACH 
CONJECTURE 


Observe  that  4  =  2+2,  6  =  3+3,  8  =  5  +  3,  28  =  17  +  11,  and  96  =  43  +  53. 
Every  even  integer  appears  to  be  the  sum  of  two  primes.  Is  this  really  right?  The 
conjecture  that  it  is,  first  made  by  Christian  Goldbach  in  1 742  in  a  letter  to  Euler, 
has  proven  remarkably  resistant  to  solution.  In  1937,  I.  M.  Vinogradov  showed 
that  every  sufficiently  large  integer  can  be  written  as  the  sum  of  at  most  four 
primes.  Sufficiently  large  means  that  there  is  a  positive  integer  no  such  that  every 
integer  larger  than  no  satisfies  the  condition.  In  1966,  J.  Chen  showed  that  every 
sufficiently  large  even  integer  can  be  written  as  x  +  y  where  jc  is  prime  and  y  is 
either  prime  or  the  product  of  two  primes.  So  we  seem  to  be  close  to  a  solution 
of  Goldbach’ s  conjecture,  though  the  last  step  is  often  the  hardest. 

Every  even  integer  greater  than  2  is  the  sum  of  two  primes. 


Answers  to  Pauses 


9.  For  instance,  numbers  of  the  form  In  are  not  prime,  and  there  are  infinitely 
many  of  these. 

10.  Archimedes  was  a  Greek  scientist  of  the  third  century  B.C.  perhaps  best 
known  for  the  “Principle  of  Archimedes.”  This  states  that  the  weight  of  the 
fluid  displaced  by  a  floating  object  is  equal  to  the  weight  of  the  object  itself. 

11.  The  first  five  Fermat  primes  are  2<20)  + 1  =  21  + 1  =3,  2(2l) + 1  =  22+ 1  =5, 
2(22)  +  1  =  24  +  1  =  17,  2(23->  +  1  =  28  +  1  =  257,  and  2(24)  +  1  =  216  +  1  = 
65537. 


EXERCISES 


The  symbol  [BB]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 


1.  Determine  whether  or  not  each  of  the  following  integers 
is  a  prime. 

(a)  [BB]  157 

(b)  [BB]  9831 

(c)  9833 

(d)  55,551,111 


(e)  2216'090  -  1 

2.  Can  Lemma  4.3.4  be  strengthened?  In  other  words,  does 
there  exist  a  number  r  <  yfn  such  that  if  n  is  not  prime, 
then  n  has  a  prime  factor  p  <  r? 

3.  (a)  [BB]  Suppose  p  is  the  smallest  prime  factor  of  an 

integer  n  and  p  >  sfnfp.  Prove  that  n/ p  is  prime. 
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(b)  [BB]  Express  16,773,121  as  the  product  of  primes 
given  that  433  is  this  number’s  smallest  prime 
factor. 

4.  Find  the  prime  decomposition  of  each  of  the  following 
natural  numbers. 

(a)  [BB]  856 

(b)  2323 

(c)  6647 

(d)  9970 

(e)  [BB]  (28  -  l)20 
(F)  55,551,111 

5.  [BB]  Use  the  Fundamental  Theorem  of  Arithmetic  to 
prove  that  for  no  natural  number  n  does  the  integer  14" 
terminate  in  0. 

6.  Let  A  =  [f  eQ|3|»}. 

(a)  List  five  different  elements  of  A  at  most  one  of 
which  is  an  integer. 

(b)  [BB]  Prove  that  A  is  closed  under  addition. 

(c)  Prove  that  A  is  closed  under  multiplication. 

7.  Let  A  be  any  subset  of  Z  \  [0]  and,  for  a,b  e  A,  define 
a  ~  b  if  ab  is  a  perfect  square  (that  is,  the  square  of 
an  integer).  Show  that  ~  defines  an  equivalence  relation 
on  A. 

8.  True  or  false?  Explain  your  answers. 

(a)  For  all  n  e  N,  n  >  1,  there  exists  a  prime  p  such 
that  p  |  n. 

(b)  [BB]  There  exists  a  prime  p  such  that  p\n  for  all 
n  e  N,  n  >  1. 

(The  position  of  the  universal  quantifier  makes  a  world 
of  difference!) 

9.  Define  / :  N  \  { 1 }  — <■  N  by  setting  f(n)  equal  to  the 
largest  prime  divisor  of  n. 

(a)  Find  the  range  of  /. 

(b)  Is  f  one-to-one? 

(c)  Is  /  onto? 

(d)  Why  did  we  not  express  /  as  a  function  N  — N? 
Explain  your  answers. 

10.  Determine  whether  or  not  each  of  the  following  func¬ 
tions  N  x  N  — »  N  is  one-to-one.  Explain  your  answers. 

(a)  [BB]  f (n,  m)  =  2m6" 

(b)  /(n,m)  =  36"' 6" 

11.  (a)  [BB]  Find  7r ( 1 0)  and  approximate  values  of 

10/  In  10  and  (three  decimal  place  accuracy). 


(b)  Find  n (50 )  and  approximate  values  of  50/  In  50  and 

*(50) 

50/ In  50' 

(c)  Find  7t(95)  and  approximate  values  of  95/  In  95  and 

?r(95) 

95/ In  95’ 

12.  (a)  Use  the  Sieve  of  Eratosthenes  to  list  all  the  primes 

less  than  200.  Find  7t(200)  and  the  values  of 
200/  In  200  and  2oo/2inC->oo  (t0  three  decimal  places), 
(b)  Use  the  Sieve  of  Eratosthenes  to  list  all  the  primes 
less  than  500.  Find  7r(500)  and  the  values  of 
500/  In  500  and  (t0  three  decimal  places). 

13.  Estimate  the  number  of  primes  less  than  5000,  less  than 
50,000,  less  than  500,000,  and  less  than  5,000,000. 

14.  [BB]  Let  p  and  q  be  distinct  primes  and  n  a  natural 
number.  If  p  |  n  and  q\n,  why  must  pq  divide  n? 

15.  [BB]  Given  distinct  positive  integers  a  and  b,  show  that 
there  exists  an  integer  n  >  0  such  that  a  +  n  and  b  +  n 
are  relatively  prime. 

16.  For  any  natural  number  n,  let  d(n )  denote  the  number  of 
positive  divisors  of  n.  For  example,  d{ 4)  —  3  because 
4  has  three  positive  divisors;  namely,  1 ,  2,  and  4. 

(a)  Describe  those  natural  numbers  n  for  which 

d(n )  =  2. 

(b)  [BB]  Describe  those  natural  numbers  n  for  which 
d(n)  —  3. 

(c)  Describe  those  natural  numbers  n  for  which 

d(n)  =  5. 

17.  (a)  [BB]  Is  215  —  1  prime?  Explain  your  answer. 

(b)  Is  291  —  1  prime?  Explain  your  answer. 

(c)  Show  that  if  2"  —  1  is  prime,  then  necessarily  n  is 
prime. 

(d)  Is  the  converse  of  (c)  true?  (If  n  is  prime,  need  2"  —  1 
be  prime?) 

18.  (a)  [BB]  Show  that  26  +  1  is  not  prime. 

(b)  Show  that  220  +  1  is  not  prime. 

(c)  Show  that  if  2"  +  1  is  prime,  then  necessarily  n  is 
a  power  of  2. 

19.  (a)  Show  that  the  sum  of  two  odd  prime  numbers  is 

never  prime. 

(b)  Is  (a)  true  if  the  word  odd  is  deleted? 

20.  [BB]  Show  that  the  sum  of  two  consecutive  primes  is 
never  twice  a  prime. 

21.  If  n  is  an  odd  integer,  show  that  jr2  —  y2  =  2 n  has  no 
integer  solutions. 

22.  LBB]  True  or  false:  [w  €  N  |  n  >  2  and  an  +  bn  = 
c "  for  some  a,  b ,  c  €  N)  =  0? 


4.3  Prime  Numbers  125 


23.  [BB]  Suppose  that  a ,  b,  and  c  are  integers  each 
two  of  which  are  relatively  prime.  Prove  that 
gcd  (ab,bc,ac)  =  1. 

24.  Let  a,  b ,  and  c  be  integers  each  relatively  prime  to 
another  integer  n.  Prove  that  the  product  abc  is  rela¬ 
tively  prime  to  n. 

25.  Given  that  p  is  prime,  gcd(a,  p 2)  =  p  and  gcd (b,  p 3)  = 
p2,  find 

(a)  [BB]  gcd (ab,  p4) 

(b)  gcd(a  +  b ,  p 4) 

26.  Let  pi,  p2,  ....  Pn+i  denote  the  first  n  +  1  primes  (in 
order).  Prove  that  every  number  between  p\ p2  ■  ■  ■  pn  +2 
and  p\ p2  ■  ■  ■  p,,+Pn+  i  —  l  (inclusive)  is  composite.  How 
does  this  show  that  there  are  gaps  of  arbitrary  length  in 
the  sequence  of  primes? 

27.  If  the  greatest  common  divisor  of  integers  a  and  b  is  the 
prime  p,  what  are  the  possible  values  of 

(a)  [BB]  gcd(a2,  b)l 

(b)  gcd (a\b)l 

(c)  gcd (a2,b2)l 

28.  [BB]  Let  a  and  b  be  natural  numbers  with  gcd(u,  b)  =  1 
and  ab  a  perfect  square ;  that  is,  ab  —  x2  for  some 
natural  number  x.  Prove  that  a  and  b  are  also  perfect 
squares. 

29.  Let  a  and  b  be  natural  numbers. 

(a)  Find  the  prime  decomposition  of  lcm(a,  b)  in  terms 
of  the  prime  decompositions  of  a  and  b  and  prove 
your  answer.  (See  Problem  17,  p.  118.) 

(b)  Use  (a)  to  prove  formula  (2)  of  Section  4.2  in  the 
case  a,  b  >  0:  gcd(a,  b)  lcm(a,  b )  =  ab. 

30.  [BB]  Prove  that  an  integer  which  is  both  a  square  ( a 2 
for  some  a )  and  a  cube  ( b 3  for  some  b )  is  also  a  sixth 
power. 

31.  Let  a  and  b  be  integers.  Let  p  be  a  prime.  Answer  true 
or  false  and  explain: 

(a)  [BB]  If  /? ( <7 1 1 ,  then  p\a. 

(b)  If  p  |  a  and  p  |  (a2  +  b2),  then  p  j  b. 

(c)  If  p  |  (a9  +  a11),  then  p  |  a. 

32.  [BB]  Show  that  there  are  infinitely  many  triples  of  inte¬ 
gers  a,  b,c  which  satisfy  a1  +  b2  =  c2. 

33.  (a)  [BB]  Prove  that  every  odd  positive  integer  of  the 

form  3 n  +  2,  n  e  N,  has  a  prime  factor  of  the 


same  form.  What  happens  if  the  word  odd  is 
omitted? 

(b)  Repeat  (a)  for  positive  integers  of  the  form  4n  +  3. 

(c)  Repeat  (a)  for  positive  integers  of  the  form  6 n  +  5. 

(d)  Prove  that  there  are  infinitely  many  primes  of  the 
form  6 n  +  5. 

34.  Suppose  p  and  p  +  2  are  twin  primes  and  p  >  3.  Prove 
that  6 1  (p  +  I). 

35.  Let  Q '  denote  the  set  of  positive  rational  numbers.  If 
f  is  in  Q+,  then 

•  either  m  =  1  or  m  =  pV  pef  ■  ■  ■  pek  is  the  product  of 
powers  of  distinct  primes  p\,  p2,  ■ . .  .  pk\ 

•  either  n  =  1  or  n  =  q(lq{2  ■  ■  ■  qf  is  the  product  of 

powers  of  distinct  primes  q\,  qi . qt 

and,  in  the  case  m  f  1  and  n  ^  1,  we  may  assume  that 
no  pi  equals  any  qj.  Now  define  / :  Q+  — »■  N  by 


/O)  =  1 


/(f) 


if  n  =  1 ,  m  /  1 
if  m  —  1 ,  n  ^  1 
if  m  ^  1,  n  ^  1 


(a)  Find  /( 8)  [BB],  /(£),  /(100)  and  /(g). 

(b)  Find  x  such  that  f(x)  =  1,000,000  [BB],  t  such 
that  f(t)  =  10,000,000  and  s  such  that  such  that 
f(s)  =  365,040. 

(c)  Show  that  /  is  a  bijection. 


(This  exercise,  which  gives  a  direct  proof  that  the  posi¬ 
tive  rationals  are  countable,  is  due  to  Yoram  Sagher.  See 
the  American  Mathematical  Monthly,  Vol.  96  (1989), 
no.  9,  p.  823.) 

36.  For  positive  integers  a  and  b,  define  a  ~  b  if  there  exist 
integers  n  >  1  and  m  >  1  such  that  am  =  bn . 


(a)  Prove  that  ~  defines  an  equivalence  relation  on  N. 

(b)  Find  3,  4,  and  144. 

(c)  Find  the  equivalence  class  of  a  6  N.  [ Hint :  Write 


a?]  ap  oik 

a  =  P\  Pi  Pk  ■ 


37.  (a)  Write  a  computer  program  which  implements  the 
Sieve  of  Eratosthenes  as  described  in  the  text. 

(b)  Use  your  program  to  enumerate  all  primes  less  than 
1000. 

(c)  What  is  7r(1000)?  Compare  this  with  approximate 
values  (three  decimal  place  accuracy)  of 
1000/ln  1000  and 
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4.4  Congruence 


4.4.1  DEFINITION 


EXAMPLES  18 


4.4.2  DEFINITION 


EXAMPLE  19 


If  it  is  11  p.m.  in  Los  Angeles,  what  time  is  it  in  Toronto?  If  he  or  she  were 
aware  that  Los  Angeles  is  three  time  zones  west  of  Toronto,  a  person  might 
well  respond  (correctly)  “2  a.m.”  The  process  by  which  the  time  in  Toronto  was 
obtained  is  sometimes  called  clock  arithmetic,  more  properly,  it  is  addition  modulo 
12:  11+3  =  14  =  2  (modulo  12).  It  is  based  upon  the  idea  of  congruence ,  the 
subject  of  this  section. 

Let  n  >  1  be  a  fixed  natural  number.  Given  integers  a  and  b,  we  say  that  a  is 
congruent  to  b  modulo  n  (or  a  is  congruent  to  b  mod  n  for  short)  and  we  write 
a  =  b  (mod  n),  if  and  only  if  n  |  {a  —  b).  The  number  n  is  called  the  modulus  of 
the  congruence. 

3  =  17  (mod  7)  because  3  —  17  =  —14  is  divisible  by  7;  —2  =  13  (mod  3) 
because  —2  —  13  =  —15  is  divisible  by  3;  60  =  10  (mod  25);  —4  =  —49 
(mod  9).  ▲ 

As  a  binary  relation  on  Z,  congruence  is 
reflexive:  a  =  a  (mod  n)  for  any  integer  a, 

symmetric:  if  a  =  b  (mod  n),  then  b  =  a  (mod  n)  and 

transitive:  if  a  =  b  (mod  n)  and  b  =  c  (mod  n ),  then  a  =  c 

(mod  n). 

Congruence  is  reflexive  because  a  —  a  —  0  is  divisible  by  az.  It  is  symmetric 
because  if  n\(a  —  b),  then  n\(b  —  a)  because  b  -  a  =  —(a  -  b).  It  is  transitive 

because  if  both  a  —  b  and  b  —  c  are  divisible  by  n,  then  so  is  their  sum,  which  is 

a  —  c.  Thus,  for  any  n  >  1,  congruence  mod  n  is  an  equivalence  relation  on  Z. 

We  urge  you  to  review  the  basic  features  of  equivalence  relations  which  were 
discussed  in  Section  2.4.  Recall,  for  instance,  that  an  equivalence  relation  parti¬ 
tions  the  underlying  set  into  subsets  called  equivalence  classes,  the  equivalence 
class  of  an  element  being  the  set  of  those  elements  to  which  it  is  equivalent.  The 
equivalence  classes  of  congruence  mod  n  are  called  congruence  classes. 

The  congruence  class  mod  n  of  an  integer  a  is  the  set  of  all  integers  to  which  a 
is  congruent  mod  n.  It  is  denoted  a.  Thus, 

a  =  {beZ\a  =  b  (mod  n)}. 

[Since  congruence  mod  n  is  symmetric,  we  do  not  have  to  remember  whether  to 
write  a  =  b  (mod  n)  or  b  =  a  (mod  n)  in  this  definition.  You  can’t  have  one 
without  the  other!] 

Let  n  =  5.  Since  — 8  —  17  =  —25  is  divisible  by  5,  we  have  —8  =  17  (mod  5). 
Thus,  —8  belongs  to  the  congruence  class  of  17;  in  symbols,  —8  e  17.  Notice  also 
that  17  e  —8.  In  fact,  you  should  check  that  — 8  =  17.  (See  Proposition  4.4.3.) 


Let  us  find  all  congruence  classes  of  integers  mod  5.  To  begin. 
0  =  {beZ\b  =  0  (mod  5)} 


4.4  Congruence  127 


=  {&  e  Z  |  5 1  (£>  —  0)} 

=  [b  €  Z  |  b  =  5k  for  some  integer  k ) 
and 

l={bGZ\b=l  (mod  5)} 

—  {b  el\  5\{b  —  \)} 

=  {b  e  Z  |  &  -  1  =  5k  for  some  integer  k } 

=  [b  G  Z  |  &  =  5k  4-  1  for  some  integer  &). 

In  Section  2.4,  we  introduced  the  notation  5Z  and  5Z  +  1  for  these  sets  0  and  1, 
respectively.  Continuing,  we  find  that 

2  =  \b  e  Z  |  =  5£  +  2  for  some  £  e  Z}  =  5Z  +  2 

3  =  {b  e  Z  |  =  5&  +  3  for  some  k  e  Z}  =  5Z  +  3 

and 


4  =  {b  €  1  |  b  =  5k  +  4  for  some  k  e  1}  =  5Z  +  4. 

It  is  useful  to  observe  that  the  five  congruence  classes  determined  so  far  par¬ 
tition  the  integers  in  the  sense  that  they  are  pairwise  disjoint  and  their  union 
is  Z. 

Why  are  they  pairwise  disjoint?  If  a  e  (5Z  +  r)  n  (5Z  +  s)  for  0  <  r,  s  <  4, 
then  a  =  5k  +  r  =  51  +  5  for  some  integers  k  and  t  and  so  r  -  s  =  5(£  —  k) 
would  be  a  multiple  of  5.  For  r,  s  between  0  and  4  this  can  only  happen  if 
r  =  s. 

Why  is  their  union  Z?  For  any  a  €  Z,  by  the  Division  Algorithm  (Theo¬ 
rem  4.1.5),  we  can  write  a  =5 q  +  r  with  q,  r  integers  and  0  <  r  <  5.  Since 
r  e  (0,  1,  2,  3,  4),  the  integer  a  is  in  5Z,  5Z  +  1,  5Z  +  2,  5Z  +  3,  or  5Z  +  4. 

Since  congruence  classes  are  equivalence  classes  and,  in  general,  any  two 
equivalence  classes  are  disjoint  or  equal  (Proposition  2.4.4),  it  follows  that  for 
any  integer  n,  n  must  be  one  of  0,  1,  2,  3,  or  4.  With  which  of  these  classes,  for 
example,  does  5  coincide?  Observe  that  5  e  5,  since  5  =  5  (mod  5).  Also,  as 
noted  previously,  5  €  0.  Since  the  classes  0  and  5  are  not  disjoint,  they  are  equal: 
5=0. 

Similarly,  since  6  belongs  to  both  6  and  1,  we  have  6=1.  You  should  also 
see  that  7  =  2,  8  =  3,  — 14  =  1,  and  so  on. 

Our  next  few  results  are  almost  immediate  consequences  of  the  theory  of 
equivalence  relations. 

Let  a ,  b,  and  n  be  integers  with  n  >  1.  Then  the  following  statements  are 
equivalent. 

(1)  n  |  (a  —  b). 

(2)  a  =  b  (mod  n). 

(3)  a  e  b. 
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Proof 


4.4.4  COROLLARY 


4.4.5  PROPOSITION 


Proof 


(4)  be  a. 

(5)  a  =  b. 

Each  of  the  implications  (1)  —>  (2)  (3)  is  a  direct  consequence  of  definitions 

and  (3)  — »  (4)  follows  from  the  symmetry  of  congruence.  We  may  therefore 
complete  the  proof  by  establishing  (4)  — >■  (5)  and  (5)  ->■  (1).  To  see  that  (4)  -> 
(5),  let  b  e  a.  Then  b  =  a  (mod  n)  so  a  =  b  by  Proposition  2.4.3.  To  see  that 
(5)  — >  (1),  suppose  that  a  =  b.  Then  a  e  b  because  a  e  a,  so  a  =  b  (mod  n) 
and  n  |  (a  —  b).  I 

Of  all  the  equivalences  established  in  Proposition  4.4.3,  we  emphasize  perhaps 
the  most  important  one. 

For  integers  a ,  b,  and  n  with  n  >  1, 


a  =  b  (mod  n)  if  and  only  if  a  =  b. 


The  next  proposition  generalizes  the  special  case  n  =  5,  which  we  have 
already  investigated. 

Any  integer  is  congruent  mod  n  to  its  remainder  upon  division  by  n.  Thus,  there 
are  n  congruence  classes  of  integers  mod  n  corresponding  to  each  of  the  n  possible 
remainders 


0  =  nZ 

1  =  nl  +  1 

2  =  nl  +  2 


n  —  1  =  nZ  +  (n  —  1). 

These  congruence  classes  partition  Z;  that  is,  they  are  disjoint  sets  whose  union 
is  the  set  of  all  integers. 

Suppose  a  is  an  integer.  The  remainder  when  a  is  divided  by  n  is  the  number 
r,  0  <  r  <  n,  obtained  when  we  write  a  =  qn  +  r  according  to  the  Division 
Algorithm  (Theorem  4.1.5).  Since  a  —  r  =  qn  is  divisible  by  n,  we  obtain  a  =  r 
(mod  «),  as  claimed. 

Since  0  <  r  <  n,  the  integer  r  is  one  of  0,  1.  2, . . .  ,  n  —  1.  Thus,  a  belongs 
to  one  of  the  specified  classes.  It  remains  only  to  show  that  these  classes  are 
disjoint.  For  this,  note  that  if  r\,  r2  €  {0,  1, . . .  ,  n  —  1},  then  r\  ^  n  (mod  n). 
Thus,  r i  e  7\,  but  r\  £  rT,  so  the  congruence  classes  7\  and  rT  are  not  equal.  Since 
congruence  classes  are  equivalence  classes,  FflTr^  =  0.  by  Proposition  2.4.4.  Thus, 
the  classes  0 . n  —  1  are  indeed  disjoint.  I 

Suppose  n  =  32.  There  are  32  congruence  classes  of  integers  mod  32;  namely, 

0  =  32Z,  I  =  32Z  +  1 ,  ...,  3~F  =  32Z  +  31. 


EXAMPLE  20 
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To  determine  the  class  to  which  a  specific  integer  belongs,  say  3958,  we  use  the 
Division  Algorithm  to  write  3958  =  123(32)  +  22. 6  Thus,  22  is  the  remainder,  so 
3958  =  22  (mod  32)  by  Proposition  4.4.5  and  3958  e  22.  ▲ 

Find  an  integer  r,  0  <  r  <  18,  such  that  3958  =  r  (mod  18).  Do  the  same  for 
-3958. 

Proposition  4.4.5  says  that  every  integer  is  congruent  modulo  n  to  one  of  the 
n  integers  0,  1,  2, . . .  ,  n  —  1.  Therefore,  when  working  mod  n,  it  is  customary  to 
assume  that  all  integers  are  between  0  and  n  -  1  (inclusive)  and  to  replace  any 
integer  a  outside  this  range  by  its  remainder  upon  division  by  n.  This  remainder 
is  called  “ a  (mod  «)”  and  the  process  of  replacing  a  by  a  (mod  n)  is  called 
reduction  modulo  n. 


.  4.4.6  DEFINITION 


If  n  >  1  is  a  natural  number  and  a  is  any  integer,  a  (mod  n)  is  the  remainder  r, 
0  <  r  <  n,  obtained  when  a  is  divided  by  n. 


EXAMPLES  21 


—  17  (mod  5)  =  3,  28  (mod  6)  =  4  and  —30  (mod  9)  =  6. 

The  integer  29  reduced  modulo  6  is  5.  A 


Just  as  for  equations,  to  “solve”  a  congruence  or  a  system  of  congruences 
involving  one  or  more  unknowns  means  to  find  all  possible  values  of  the  unknowns 
which  make  the  congruences  true,  always  respecting  the  convention  given  in 
Definition  4.4.6.  Without  this  convention,  any  even  number  is  a  solution  to  the 
congruence  2x  =0  (mod  4).  With  the  convention,  however,  we  give  only  x  =  0 
and  x  =  2  as  solutions. 


PROBLEM  22.  Solve  each  of  the  following  congruences  if  possible.  If  no  solu¬ 
tion  exists,  explain  why  not. 

(a)  3x  =  1  (mod  5) 

(b)  3jc  =  1  (mod  6) 

(c)  3.x  =  3  (mod  6) 

Solution.  Simple  congruences  like  these  with  small  moduli  are  probably  best 
solved  by  trying  all  possible  values  of  x  mod  n. 

(a)  If  *  =  0,  3x  =  0  =  0  (mod  5). 

If  x  =  1,  3x  —  3  =  3  (mod  5). 

If  x  =  2,  3x  =  6  s  1  (mod  5). 

If  x  —  3,  3.x  —  9  =  4  (mod  5). 

If*  =  4,  3jc  =  12  =  2  (mod  5). 


Since  the  modulus  is  5,  we  want  x  in  the  range  0  <  x  <  5.  Thus,  the  only 
solution  to  the  congruence  is  x  =  2. 


^Remember  the  easy  way  to  see  this.  Use  a  calculator  to  compute  =  1 23.68  . . .  ,  The  integer  part  of 
this  number  (the  floor  of  )  is  the  quotient  given  by  the  Division  Algorithm.  Refer  to  Proposition  4. 1 .6. 
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(b)  There  is  no  solution  to  this  congruence  because  the  values  of  3x  (mod  6) 
are  just  0  and  3: 

3(0)  =  0,  3(1)  =  3,  3(2)  =  6e0, 

3(3)  =  9  =  3,  3(4)  =  12  =  0,  3(5)  =15  =  3. 

(c)  The  calculations  in  part  (b)  show  that  3jc  =  3  (mod  6)  has  solutions  x  —  l, 

x  —  3,  and  x  —  5.  I 

Suppose  we  want  to  find  the  sum  1017  +  2876  (mod  7).  This  can  be  accom¬ 
plished  in  two  ways.  We  could  evaluate  1017  +  2876  =  3893  =  1  (mod  7), 
but  equally,  we  could  reduce  the  integers  1017  and  2876  modulo  7  first  like 
this: 


1017  =  2  (mod  7) 

2876  =  6  (mod  7) 

1017  +  2876  =  2  +  6  =  8  =  1  (mod  7). 

The  second  approach  is  particularly  useful  when  forming  products  since  it 
keeps  the  numbers  involved  small.  Observe: 

(101 7)(2876)  =e  (2) (6)  =  12  =  5  (mod  7). 

Computing  powers  of  an  integer  modulo  a  natural  number  n  is  often  just 
a  mental  (rather  than  calculator)  exercise  if  we  continually  work  mod  n.  For 
example, 

(1017)2  =  22  =  4  (mod  7) 

( 101 7)3  =  (101 7)2(  1017)  s  4(2)  =  8  =  1  (mod  7) 

(1017)4  =  (101 7)3  ( 1017)  =  1(2)  =  2  (mod  7) 

( 101 7)5  =  ( 1017)4(1017)  =  2(2)  =  4  (mod  7) 
and  so  on.  Here  is  an  easy  way  to  compute  (1017)  . 

(1017) 12  =  (( 1017)4)3  =  23  =  8  =  1  (mod  7). 

The  following  proposition  describes  the  general  principles  which  guarantee 
that  the  sorts  of  calculations  we  have  been  performing  are  valid. 


4.4.7  PROPOSITION 


If  a  =  x  (mod  n)  and  b  =  y  (mod  n ),  then 

(a)  a  +  b  s  x  +  y  (mod  n)  and 

(b)  ab  =  xy  (mod  n). 


Proof  Direct  proofs  of  each  part  are  suggested,  and  these  we  provide. 

(a)  We  have  to  check  that  (a  +b)  —  (x  +  y)  is  divisible  by  n.  This  difference 
is  ( a  —  x)  +  (b  —  y),  which  is  the  sum  of  two  numbers  each  divisible  by 
n,  so  the  difference  is  itself  divisible  by  n. 
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(b)  We  have  to  check  that  ab  —  xy  is  divisible  by  n.  Subtracting  and  adding 
ay,  we  notice  that 

ab  —  xy  =  ab  —  ay  +  ay  —  xy  =  a(b  —  y)  +  (a  —  x)y, 

each  term  on  the  right  again  being  divisible  by  n.  Thus,  ab  —  xy  is 
divisible  by  n.  I 


PROBLEM  23 

and  3  |  b. 

Solution.  We 

false,  then  a  = 


For  examp 
(mod  3)  becauk 
6x  =  3  (mod 
second  congrui: 
multiplying  a  c 
the  congruence) 
(mod  9)  are  di 


PROBLEM  24 


Suppose  a  and  b  are  integers  and  3  |  (a2  +  b2).  Show  that  3  |  a 

wish  to  prove  that  a  =  0  (mod  3)  and  b  =  0  (mod  3).  If  this  is 
1  or  2  (mod  3)  and  so,  by  part  (b)  of  Proposition  4.4.7,  a2  =  1 
or  4  (mod  3).  Since  4  =  1  (mod  3),  we  must  have  a2  =  1  (mod  3).  Similarly, 

b2  =  1  (mod  3)  and  so,  by  (a)  of  the  proposition,  a2  +  b2  =  1  +  1=2  (mod  3), 

a  contradiction 

While  addition  and  multiplication  of  congruences  behave  as  we  would  hope, 
we  must  be  exceedingly  careful  when  dividing  each  side  of  a  congruence.  Dividing 
each  side  of  the  congruence  30  =  12  (mod  9)  by  3,  for  example,  produces  the 
false  statement  10  =  4  (mod  9).  In  general,  we  can  only  divide  a  congruence 
by  an  integer  tyhich  is  relatively  prime  to  the  modulus,  as  asserted  by  the  next 
proposition.  (We  leave  its  proof  to  the  exercises.) 

If  ac  =  be  (mod  n )  and  gcd(c,  n)  —  1,  then  a  =  b  (mod  n). 


e,  given  28  =  10  (mod  3),  we  can  divide  by  2  and  obtain  14  =  5 
,e  gcd(2,  3)  =  1 .  The  linear  congruences  2x  =  1  (mod  7)  and 
})  have  the  same  solutions  since  we  can  divide  each  side  of  the 
nee  by  3,  this  being  relatively  prime  to  7.  On  the  other  hand, 
congruence  by  a  number  not  relatively  prime  to  the  modulus  changes 
For  example,  the  solutions  to  2x  =  1  (mod  9)  and  6x  =  3 
liferent! 


Solve  2x  =  1  (mod  9)  and  6x  =  3  (mod  9). 


Solve  each  of  the  following  pairs  of  congruences,  if  possible. 


If  no  solution  exists,  explain  why  not. 

(a)  2x  +  3y  e=  1  (mod  6) 

x  +  3y  ee  4  (mod  6) 

(b)  2x  +  3y  e=  1  (mod  6) 

v  +  3 y  e=  5  (mod  6) 

Solution.  We  solve  simple  systems  of  linear  congruences  by  the  ad  hoc  methods 
with  which  we  first  solved  systems  of  linear  equations,  by  adding  and  subtracting 
and  occasionally  multiplying,  though  only  by  numbers  relatively  prime  to  the 
modulus  (for  a  reason  illustrated  by  Pause  13). 

(a)  Adding  the  two  congruences  gives  3x  +  6y  =  5  (mod  6).  Since  6y  =  0 
(mod  6),  we  have  3x  =  5  (mod  6).  This  congruence  has  no  solution  because 
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4.4.9  PROPOSITION 


Proof 


the  values  of  3x  (mod  6)  are  0  and  3.  Thus,  no  x ,  y  satisfy  the  given  pair 
of  congruences. 

(b)  This  time,  adding  the  two  congruences  gives  3jc  =  0  (mod  6)  and  hence 
x  =  0,  x  =  2  or  x  =  4  (mod  6).  If  x  =  0,  then  the  second  congruence 
says  3y  =  5  (mod  6)  to  which  there  is  no  solution.  If  x  =  2,  the  second 
congruence  says  2  +  3 y  =  5,  hence  3y  =  3  (mod  6),  so  y  =  1  (mod  6), 
y  =  3  (mod  6)  or  y  =  5  (mod  6).  If  x  =  4,  the  second  congruence  reads 
4  +  3y  =  5,  so  3v  =  1  (mod  6)  and  there  is  no  solution.  The  pair  of 
congruences  has  three  solutions:  x  =  2  (mod  6)  and  y  =  1  (mod  6),  y  =  3 
(mod  6)  or  y  =  5  (mod  6).  I 

Every  integer  a  has  an  additive  inverse  modulo  n,  that  is,  there  exists  an 
integer  x  satisfying  a  +  x  =  0  (mod  «);  for  example,  take  x  =  —a  or  x  =  n  —  a 
to  respect  the  convention  in  4.4.6  above.  The  existence  of  additive  inverses  means 
that  congruences  of  the  form  a  +x  =  b  (mod  n)  always  have  a  solution  obtained 
by  adding  the  additive  inverse  of  a  to  each  side:  x  =  b  +  (—  a)  =  b  —  a.  On  the 
other  hand,  not  every  congruence  of  the  form  ax  =  b  (mod  n)  has  a  solution. 

The  congruence  3x  =  b  (mod  6)  may  not  have  a  solution  (for  example,  when 
b  =  1).  Contrast  this  with  the  congruence  3 x  =  b  (mod  7),  which  has  a  solution 
for  any  b  since  each  of  the  integers  0,  1, 2,  3,  4,  5,  6  is  3x  (mod  7)  for  some  x: 

3(0)  =  0,  3(1)  =  3,  3(2)  =  6,  3(3)-9s2, 

3(4)  =12  =  5,  3(5)  =  15  =  1,  3(6)  =  18  =  4  (mod  7). 

A  solution  to  3x  =  1  (mod  7),  for  instance,  is  x  =  5.  The  difference  between 
3x  =  b  (mod  6)  and  3x  =  b  (mod  7)  is  the  modulus.  In  the  second  case,  3  is 
relatively  prime  to  the  modulus,  7,  whereas  in  the  first  case,  it  is  not. 

Let  n  >  1  be  a  natural  number  and  let  a  be  an  integer  with  gcd(o,  n)  =  1. 

(a)  There  exists  an  integer  s  such  that  sa  =  1  (mod  n).  [We  call  s  a  multi¬ 
plicative  inverse  of  a  (mod  ft).] 

(b)  For  any  integer  b.  the  congruence  ax  =  b  (mod  n )  has  a  solution. 

(c)  The  solution  to  ax  =  b  (mod  n )  is  unique  mod  n  in  the  sense  that  if 

ax\  =  b  (mod  n)  and  ax 2  =  b  (mod  n),  then  x\  =  xi  (mod  n). 

(a)  By  Theorem  4.2.9,  the  greatest  common  divisor  of  two  integers  is  a  linear 
combination  of  them.  Here,  since  gcd(a,»)  =  1,  we  know  there  are 
integers  s  and  t  such  that  sa  +  tn  =  1  .  Since  sa  —  1  is  divisible  by  n, 
sa  —  1  (mod  /;)• 

(b)  By  part  (a),  we  know  that  a  has  an  inverse  .v  (mod  n).  Multiplying  the 
congruence  ax  =  b  by  s,  we  have  sax  =  sb  (mod  n)  and  so  x  =  sb.  It 
is  straightforward  to  verify  that  x  =  sb  is  indeed  a  solution  to  ax  =  b 
(mod  n )  since  a{sb)  =  ( as)b  =  1(b)  =  b  (mod  n ). 

(c)  Uniqueness  follows  directly  from  Proposition  4.4.8.  Since  gcd(r/,  n)  —  1, 

if  ax  1  =  0x2  (mod  «),  then  x\  =  X2  (mod  n).  I 

It  is  useful  to  observe  that  our  proof  [in  part  (b)]  that  ax  =  b  (mod  n)  has 

a  solution  which  essentially  repeats  the  steps  by  which  we  solve  the  equation 
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ax  =  b  in  real  numbers.  To  solve  ax  =  b,  with  a,  b  real  numbers  and  a  ^  0,  we 
multiply  both  sides  by  the  (multiplicative)  inverse  of  a  to  obtain  x  =  a~]b.  We 
solve  the  congruence  ax  =  b  (mod  n)  exactly  the  same  way.  The  only  problem 
is  to  find  the  inverse  of  a  (mod  n),  if  this  exists. 

PROBLEM  25.  Solve  the  congruence  20x  =101  (mod  637). 

Solution.  We  use  the  method  described  in  the  proof  of  Proposition  4.4.9.  Write 
-7(637)  +  223(20)  =  1  and  obtain  223(20)  =  1  (mod  637).  Thus,  223  is  the 
inverse  of  20  (mod  637).  Multiplying  each  side  of  the  given  congruence  by  223 
gives  x  =  223(101)  =  22,523  =  228  (mod  637).  Thus,  x  =  228  provides  a  solu¬ 
tion  and  one  which  is  unique  mod  637:  Any  other  solution  is  congruent  mod  637 
to  228.  I 


PROBLEM  26.  Part  (a)  of  Proposition  4.4.9  says  that  if  gcd(o,  n)  =  1,  then  a 
has  an  inverse  mod  n.  Show  that  the  converse  is  also  true:  If  a  has  an  inverse 
mod  n ,  then  gcd(a,  n)  =  1. 

Solution.  If  a  has  an  inverse  mod  n,  there  is  an  integer  s  such  that  sa  =  1 
(mod  n),  so  sa  —  1  =  qn  for  some  integer  q.  Writing  this  as  1  =  sa  -  qn,  it 
follows  that  any  natural  number  which  divides  both  n  and  a  must  divide  1  and 
hence  be  1.  So  gcd (a.n)  —  1.  I 


4.4.10  FERMAT'S 
LITTLE  THEOREM 


As  another  application  of  Proposition  4.4.8,  we  derive  Fermat’s  Little  Theorem. 
What  was  Fermat’s  “big”  theorem? 


If  p  is  a  prime  and  p  Jf  c,  then  cp  1  =  1  (mod  p). 


Proof  We  have  gcd(c,  p)  —  1.  Thus,  by  Proposition  4.4.8,  no  two  of  the  integers  c,  2c, 
. . .  ,  {p  —  l)c  are  congruent  mod  p.  The  same  proposition  also  shows  that  none 
of  the  elements  c,2c, . . .  ,  (p  —  l)c  is  0  (mod  p).  Thus,  modulo  p ,  the  p  —  1 
integers  c,  2c, . . .  ,  (p  —  l)c  are  precisely  1,2,...  ,  p  —  1,  in  some  order.  Thus, 

c  ■  2c  ■  3c  •  (p  —  l)c  =  1  •  2  •  3  •  •  •  (p  —  1)  (mod  p). 

Letting  x  =  1  ■  2  •  3  •  •  •  ip  —  1  ),7  this  equation  reads 

xcp~'  —  x  (mod  p) 

and,  since  gcd (p,x)  =  1,  we  can  cancel  x  and  obtain  cp~]  =  1,  as  required.  I 

Because  of  Fermat’s  Little  Theorem,  none  of  the  congruences  22  =  1  (mod  3), 
46  =  1  (mod  7),  910  =  1  (mod  11)  are  surprises.  Similarly,  since  13,331  is 
prime,  Fermat’s  Little  Theorem  shows  413’330  =  1  (mod  13,331),  413-331  =  4 
(mod  13,331),  and  413-332  =  16  (mod  13,331). 


7The  product  of  all  the  integers  from  I  to  n  inclusive  is  commonly  denoted  n\ — read  “n  factorial” — so 
we  could  also  write  ( p  —  1)1  instead  of  x.  See  Definition  5.1.2. 
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Answers  to  Pauses 


Explain  the  last  two  congruences  here. 

12.  An  integer  a  is  congruent  (mod  18)  to  its  remainder  when  it  is  divided  by 
18.  Since  =  219.8,  J  =  219  and  we  find  3958  =  219(18)  +  16. 
The  remainder  is  16,  so  3958  =  16  (mod  18).  Similarly,  |_— =  —220, 
so  -3958  =  -220(18)  +  2  and  -3958  =  2  (mod  18). 

13.  2.x  =  1  (mod  9)  implies  x  =  5,  but  6r  s  3  (mod  9)  implies  x  =  2,  x  =  5 
or  x  =  8. 

14.  Theorem  4.3.14. 

15.  4 13,331  =  4(413,330)  =  4(1)  =  4  (mod  13,331); 

413,332  =  4(4)(413.330)  =  16(1)  =  16  (mod  13,331),  using  Fermat’s  Little 
Theorem  in  each  case  to  deduce  that  413,330  =  1  (mod  13,331). 


EXERCISES 


The  symbol  [BB]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 


1.  [BB]  This  question  concerns  congruence  mod  7. 

(a)  List  three  positive  and  three  negative  integers  in  5 
and  in  —3. 

(b)  What  is  the  general  form  of  an  integer  in  5  and  of 
an  integer  in  -3  ? 

2.  This  question  concerns  congruence  mod  13. 

(a)  List  four  positive  and  five  negative  integers  in  3  and 
in  —2. 

(b)  What  is  the  general  form  of  an  integer  in  3  and  of 
an  integer  in  —2? 

3.  Find  a  (mod  n)  in  each  of  the  following  cases. 

(a)  [BB]  a  =  1286,  n  =  39 

(b)  a  =  43,197,  n  =  333 

(c)  [BB]  a  =  -545,608,  n  =  51 

(d)  a  =  -125,617,  n  =  315 

(e)  a  =  11,  111,  111,  111,  n=  1111 

4.  True  or  false?  Give  a  reason  for  each  answer. 

(a)  [BB]  2  =  T8  (mod  10) 

(b)  7  G  — 13  (mod  5) 

(c)  [BB]  -8  =  44  (mod  13) 

(d)  With  respect  to  congruence  mod  29,  17  D  423  =  0 

(e)  — 18  ^  400  (mod  19) 

5.  List  all  congruence  classes  giving  the  most  usual  and 
one  other  name  for  each. 

(a)  [BB]  congruence  mod  3 

(b)  congruence  mod  5 

(c)  congruence  mod  8 


6.  Carry  out  each  of  the  indicated  calculations,  giving  the 
answer  mod  n. 

(a)  [BB]  21,758,623  +  17,123,055,  n  =6 

(b)  (21,758,623X17,123,055),  n  =  6 

(c)  (17, 123)50,  n  =  6 

(d)  104,  108,  10'2,  1020,  1024,  n  =  7 

(e)  [BB]  2,  22,  23,  . . .  ,  210,  n  =  1 1 

(f)  4,  42,  43,  44,  45,  46.  47,  48,  49,  410,  n  =  1 1 

7.  Find  a  +  b  (mod  n),  ab  (mod  n)  and  (a+b)2  (mod  n) 
in  each  of  the  following  situations: 

(a)  a  =  4003,  b  =  -127,  n  =  85 

(b)  a  =  17,891,  b  =  14,485,  n  =  143 

(c)  a  -  -389,221,  b  =  123,450,  n  =  10,000 

8.  [BB]  If  a  G  Z  and  a  ^  0  (mod  7),  show  that  a  =  5k 
(mod  7)  for  some  integer  k. 

9.  Find  all  integers  x,  0  <  x  <  n,  satisfying  each  of  the  fol¬ 
lowing  congruences  mod  n.  If  no  such  x  exists,  explain 
why  not. 

(a)  [BB]  3x  =  4  (mod  n),  n  =  6 

(b)  4x  =  2  (mod  /?),  n  =  6 

(c)  4x  =  3  (mod  n),  n  =  7 

(d)  4.x  =  3  (mod  n),  n  =  6 

(e)  [BB]  2x  =  18  (mod  n ),  n  =  50 

(f)  [BB]  5x  =  1  (mod  w),  n  =  11 

(g)  5x  =  5  (mod  n),  n  =  25 

(h)  4.x  =  301  (mod  n ),  n  =  592 

(i)  65x  =  27  (mod  n),  n  =  169 

(j)  4x  =  320  (mod  n),  n  =  592 
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(k)  [BB]  16x  =  301  (mod  n),  n  =  595 

(l)  79x  =  15  (mod  n),  n  =  722 

(m)  155x  =  1 185  (mod  «),  n  =  1404 

10.  (a)  Given  integers  a,  b,  c,  d ,  x,  and  a  prime  p,  suppose 

(ax-\~b)(cx+d)  =  0  (mod  p ).  Prove  thatax+6  =  0 
(mod  p)  or  cx  +  d  =  0  (mod  p). 

(b)  Find  all  integers  x,  0  <  x  <  n,  which  satisfy  each 
of  the  following  congruences.  If  no  x  exists,  explain 
why  not. 

i.  [BBJ  x2  =  4  (mod  n),  n  =  13 

ii.  (2x  +  l)(3x  +  4)  =  0  (mod  n),  n  =  17 

iii.  3x2  +  14x  —  5  =  0  (mod  n),  n  =97 

iv.  [BB]  x2  =  2  (mod  n),  n  =  6 

v.  x1  =  —2  (mod  n),  n  =  6 

vi.  4x2  +  3x  +  7  =  0  (mod  «),  «  =  5 

11.  Find  all  integers  x  and  y,  0  <  x,  y  <  n,  which  satisfy 
each  of  the  following  pairs  of  congruences.  If  no  x,  y 
exist,  explain  why  not. 


(a) 

[BB] 

2x 

+  y  =  1  (mod  n) 

n 

x  +  3y  =  3  (mod  n) 

(b) 

[BB] 

X 

+  5y  =  3  (mod  n) 

ii  - 

4x 

+  5y  =  1  (mod  n ) 

(c) 

x  + 

5y 

s  3  (mod  n)  n  = 

8 

4x  + 

=  1  (mod  n) 

(d) 

lx  + 

2y 

=  3  (mod  n)  n  = 

15 

9x  + 

4  y 

—  6  (mod  n) 

(e) 

3x  + 

5  y 

=  14  (mod  n)  n  - 

=  28 

5x  +  9y  s  6  (mod  n) 


12.  [BB]  Prove  Proposition  4.4.8. 

13.  [BB]  If  a  =  b  (mod  n),  show  that  gcd(a,n)  = 
gcd (b,  n). 

14.  Let  a  and  n  be  natural  numbers  with  n  >  1.  Let  r  and  .v 
be  integers  and  suppose  r  =  s  (mod  n).  True  or  false? 
Explain. 

(a)  [BB]  ar  =  as  (mod  n) 

(b)  ra  =  sa  (mod  n) 

15.  Suppose  a  and  b  are  integers,  n  >  1  is  a  natural  number, 
and  a  =  b  (mod  n).  True  or  false?  In  each  case,  prove 
or  give  a  counterexample. 

(a)  [BB]  3 a  =  b2  (mod  n) 

(b)  a 2  =  b 2  (mod  n) 

(c)  a 2  =  b3  (mod  n) 

(d)  a 2  s  b2  (mod  n2) 


16.  (a)  [BB]  Use  the  result  of  Problem  23  to  prove  that  \fl 

is  irrational;  that  is,  show  that  it  is  impossible  to 
write  Jl  =  |  for  integers  a,  b.  [Hint:  Without  loss 
of  generality,  assume  that  a  and  b  have  no  com¬ 
mon  factors,  other  than  ±1.  Why  is  there  no  loss  of 
generality?] 

(b)  Let  n  be  a  natural  number,  n  =  2  (mod  3).  Use 
Problem  23  to  prove  that  -/n  is  irrational. 

17.  (a)  Find  all  integers  x,  0  <  x  <  n,  which  satisfy  each 

of  the  following  congruences. 

i.  x2  =  1  (mod  n),  n  =  5 

ii.  x2  =  1  (mod  n),  n  =  1 

iii.  x2  =  1  (mod  n),  n  =  13 

(b)  [BB]  Suppose  p  ^  2  is  a  prime.  Find  all  integers  x, 
0  <  x  <  p,  such  that  x2  =  1  (mod  p). 

18.  (a)  Find  all  integers  x,  0  <  x  <  n,  which  satisfy  each 

of  the  following  congruences. 

i.  [BB]  x2  =  1  (mod  n),  n  =  52 

ii.  x2  s  1  (mod  n),  n  =  53 

iii.  x2  =  1  (mod  n),  n  =  72 

(b)  Suppose  p  /  2  is  a  prime  and  k  is  any  natural 
number.  Find  all  integers  x,  0  <  x  <  pk ,  which 
satisfy  x2  =  1  (mod  pk). 

19.  (a)  Find  all  integers  x,  0  <  x  <  n,  which  satisfy  each 

of  the  following  congruences. 

i.  x2  =  1  (mod  n),  n  =  2 

ii.  x2  =  1  (mod  n),  n  =  22 

iii.  x2  =  1  (mod  n ),  n  =  2? 

iv.  x2  =  1  (mod  n),  n  =  24 

(b)  Let  k  be  a  natural  number.  Find  all  integers  x,  0  < 
x  <  2k,  which  satisfy  x2  =  1  (mod  2k). 

20.  In  each  of  the  following,  the  given  integer  p  is  a  prime. 

(a)  [BB]  Find  1 88970,  188971  and  188972,  each  mod  p, 
p  =  897 1 . 

(b)  Find  5320,592,  5320,593,  5320,594,  each  mod  p,  p  = 
20,593. 

(c)  Find  3508,  3509,  3512,  each  mod  p,  p  =  509. 

(d)  Find  610,588,  610'589,  610-594,  each  mod  p,  p  = 
10,589. 

(e)  Find  84056  ,  84058  ,  84060,  each  mod  p,  p  =  4057. 

(f)  Find  23948  and  23941  (mod  p),  p  =  3943. 

21.  [BB]  Show  that  x97  —  x  +  1  =0  (mod  97)  has  no  solu¬ 
tions. 

22.  Let  A  be  the  set  of  congruence  classes  of  integers  mod¬ 
ulo  some  natural  number  n.  For  a,  b  e  A,  define  a  <  b  if 
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ab  =  a 2  (mod  n).  Prove  or  disprove  that  ■<  is  a  partial 
order  in  each  of  the  following  cases. 

(a)  n  =  p  is  a  prime. 


(b)  n  =  pq  is  the  product  of  two  distinct  primes. 

(c)  n  is  divisible  by  the  square  of  a  prime.  [Mint:  It 
might  be  helpful  first  to  consider  the  case  n  =  12.] 


4.5  Applications  of  Congruence 

international  Standard  Book  Numbers 

Since  1968,  most  published  books  have  been  assigned  a  ten-digit  number  called 
the  International  Standard  Book  Number — abbreviated  ISBN—  which  identifies 
the  country  of  publication,  the  publisher,  and  the  book  itself.  In  fact,  all  relevant 
information  is  stored  in  the  first  nine  digits;  the  tenth  digit  is  a  check  digit  whose 
sole  purpose  is  to  give  us  confidence  that  the  first  nine  digits  are  correct. 

When  a  university  department  wishes  to  place  an  order  for  textbooks  for  a 
forthcoming  semester,  it  is  common  for  each  faculty  member  to  send  to  some 
administrative  person  the  ISBNs  of  the  books  he  or  she  will  use  and  then  for  a 
single  list  of  all  desired  ISBNs  to  be  produced  and  sent  to  the  bookstore.  There 
lists  from  many  departments  are  collected,  grouped  in  various  ways,  and  sent  to 
publishers.  It  is  not  hard  to  see  that  there  are  many  opportunities  for  numbers  to 
be  copied  incorrectly.  Thus,  some  easy  way  to  encourage  accuracy  is  essential.  If 
the  digits  of  an  ISBN  are  denoted  a\ ,  02, . . .  ,  aio,  with  the  first  nine  in  the  range 
0-9,  then  a\o  is  chosen  in  the  range  0-10  so  that 

(6)  a\  +  2a2  +  303  +  •  •  •  -1-  9aq  +  lOajo  =  0  (mod  1 1). 

If  am  happens  to  be  10,  it  is  recorded  as  an  X. 

For  example,  if  the  first  nine  digits  are  0-936031-03,  the  tenth  digit  is  chosen 
so  that 

1(0) +  2(9) +  3(3) +  4(6) +  5(0) 

+  6(3) +7(1) +  8(0) +9(3)+  10aio  =  0  (mod  11); 

thus,  103  +  lOaio  =  0  (mod  11),  so  a\o  —  4.  The  ISBN  would  be  recorded  as 
0-936031-03-4.  If  this  number  were  copied  to  another  list  with  an  error  in  the  fourth 
digit,  say  as  0-935031-03-4,  a  computer  could  easily  check  that 

1(0)  +  2(9)  +  3(3)  +  4(5)  +  5(0)  +  6(3)  +  7(1)  +  8(0)  +  9(3)  +  10(4) 

=  139s  7^0  (mod  11) 

and  hence  the  existence  of  a  mistake  would  come  to  light.  Note  that  it  would 
be  necessary  to  check  previous  lists  to  determine  the  precise  error.  Obtaining  7 
instead  of  0  as  the  result  of  our  calculation  provides  no  clue  as  to  which  digit 
was  in  error.  For  instance,  the  number  0-936031-53-4  also  differs  from  the  correct 
ISBN  in  one  digit  and  for  it  we  again  have 

1  (0)  +  2(9)  +  3(3)  +  4(6)  +  5(0)  +  6(3) 


+  7(1)  +  8(5)  +  9(3)  +  10(4)  =  183  =  7  (mod  11). 
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We  can  show  that  the  test  given  in  (6)  always  detects  errors  in  single  digits 
(see  Exercise  5),  although  it  is  quite  possible  for  errors  in  two  digits  to  cancel 
each  other  and  so  to  go  undetected. 

Change  two  digits  in  0-936031-03-4  so  that  a\  +  laj  +  3«3  -j - h9ag  +  1 0a i o  =  0 

(mod  11)  (and  hence  the  changes  would  not  be  detected  by  our  test). 

On  the  other  hand,  when  a  number  is  copied,  a  common  error  is  for  consecu¬ 
tive  digits  to  be  transposed:  We  might  dial  the  phone  number  754-3628  instead  of 
754-3268,  for  instance. 

PROBLEM  27.  Show  that  the  test  in  (6)  detects  transpositions  of  consecutive 
digits. 

Solution.  If  a  i«2  •  •  •  £  10  is  a  correct  ISBN  number,  then 

a  =  a\  +  lci2  +  •  •  •  +  idj  +  (i  +  \  )cii+\  +  ■  ■  ■  +  lOaio  =  0  (mod  11). 

Suppose  that  the  digits  a,  and  a,+\  are  transposed.  We  claim  that  the  miscopied 
number  a\a2  ■  ■  •  a,  \  \ a,  ■  ■  ■  a\ o  does  not  satisfy  (6).  To  see  this,  let 

b  —  a\  H-  2<32  +  ■  ■  •  +  idi+i  +  ( i  +  1)<2;  +  •  •  •  +  lOcrio 

and  note  that  a  —  b  =  i(aj  —  a,+i)  +  (/  +  l)(fl/+i  —  a,)  =  a/+i  —  a,.  Since 
0  <  a,-,  al  +  \  <  9,  the  difference  o!  +  i  -a,-  cannot  be  0  (mod  1 1)  (unless  a,  =  a,  +  i , 
in  which  case  there  was  no  transposition  error)  and  so  a  ^  b  (mod  1 1 )  and  b  =£  0 
(mod  11).  The  miscopied  number  does  not  pass  the  test.  I 

We  conclude  this  section  by  observing  that  the  test  given  in  (6)  can  be  abbre¬ 
viated  by  writing  it  in  the  form  w  ■  a  =  0  (mod  11),  where  a  is  the  vector 
(a i,  «2>  •  •  •  ,  «io)»  w  the  weight  vector  (1,2,...  ,  10)  and  •  denotes  dot  product. 
For  readers  unfamiliar  with  the  concepts  of  vector  and  dot  product,  a  vector  is  an 
n-tuple  of  numbers,  (fl| ,  02.  •  •  •  ,  an)  (n  can  be  any  natural  number),  and  the  dot 
product  of  a  =  (ai ,  <J2»  •  -  -  ,  «n)  and  b  =  (b\ ,  &2,  •  •  •  .  bn)  is  the  number 

a  •  b  =  a\b\  +  ^£2  H - V  a„bn. 


These  notions  are  not  critical  to  an  understanding  of  this  section  but  useful,  per¬ 
haps,  to  students  with  some  linear  algebra  background. 

Universal  Product  Codes 

Check  digits  permeate  today’s  society.  Besides  forming  part  of  an  ISBN,  they  are 
attached  to  identification  numbers  of  airline  tickets,  money  orders,  credit  cards, 
bank  accounts,  drivers’  licenses,  and  most  items  found  in  stores. 

Figure  4.4  shows  the  kind  of  universal  product  code  (UPC)  with  which  North 
American  consumers  are  familiar.  Most  goods  for  sale  today  can  be  identified 
uniquely  by  a  special  number  called  a  universal  product  number.  A  universal 
product  code  is  a  way  to  represent  a  universal  product  number  as  a  pattern  of 
black  and  white  stripes  of  various  thicknesses.  For  obvious  reasons,  a  universal 
product  code  is  also  called  a  bar  code.  (Later  we  shall  see  how  this  code  works.) 
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64200 


1  15  8  9 


Figure  4.4  A  universal  product  code. 

The  universal  product  numbers  we  want  to  discuss  are  12-digit  numbers  of 
the  form  x-xxxxx-xxxxx-x,  where  each  x  stands  for  a  single  digit  between  0  and 
9;  for  example,  0-12345-67890-1.  The  last  digit  is  a  check  digit  which  serves  to 
affirm  (with  some  degree  of  confidence)  the  accuracy  of  the  preceding  eleven.  For 
example,  the  universal  product  number  shown  in  Fig  4.4  is  0-64200-11589-6  and  6 
is  the  check  digit.  It  is  not  unusual  for  a  scanning  device  to  misread  a  bar  code. 
The  use  of  a  check  digit,  therefore,  provides  a  way  to  alert  the  cashier  that  the 
number  must  be  read  again. 

The  check  digit  is  determined  by  the  rule 

(7)  3(sum  of  digits  in  odd  positions) 

+  (sum  of  digits  in  even  positions)  =  0  (mod  10). 
For  the  number  encoded  by  the  bar  code  in  Fig  4.4,  for  instance,  we  have 
sum  of  digits  in  odd  positions  =  0  +  44-0+1  +  5  +  9=  19 

sum  of  digits  in  even  positions  =  6  +  2  +  0 +1+8  +  6  =  23 
and  3(19)  +  23  =  80  =  0  (mod  10). 

As  with  ISBNs,  the  rule  for  determining  the  check  digit  of  a  universal  product 
number  can  be  phrased  in  the  language  of  vectors.  If  a\-a2  ■  ■  ■  a^-a-j  ■  ■  ■  a\\-ci\2 
is  a  universal  product  number  and  a  denotes  the  vector  {a\ ,  a2,  ■  ■  ■  ,  a  12),  then  (7) 
just  says  w  •  a  =  0  (mod  10)  for  a  certain  weight  vector  w. 

What  is  w? 

Finally,  we  consider  the  nature  of  a  bar  code  itself.  Notice  that  the  bar  code 
in  Fig  4.4  begins  and  ends  with  a  black-white-black  sequence  of  thin  stripes  and 
is  separated  into  distinct  halves  by  a  white-black-white-black-white  sequence  of 
thin  stripes.  In  each  half  of  the  code,  each  of  the  digits  0-9  corresponds  to  a 
sequence  of  four  stripes  of  varying  thicknesses.  In  the  left  half,  the  pattern  is 
white-black-white-black;  in  the  right  half,  it  is  black-white-black-white. 

Using  0  to  denote  a  thin  white  stripe  and  00,  000,  0000  to  denote  increasingly 
thicker  white  stripes  and,  similarly,  using  1  to  denote  a  thin  black  stripe  and  1 1 , 
111,  1 1 1 1  to  denote  increasingly  thicker  black  stripes.  Table  4.2  shows  the  right 
and  left  stripe  sequences  to  which  each  digit  in  the  range  0-9  corresponds.  For 
example,  on  the  left,  6  is  encoded  0101  111,  that  is,  by 

thin  white  (0) - thin  black  (1) - thin  white 

(0) - thickest  black  (1111). 


4.5  Applications  of  Congruence  139 


Table  4.2  The  pattern  of  stripes 
used  to  encode  each  of  the  digits 
0-9  on  the  left  and  right  sides  of  a 
bar  code. 


Encoding 


Digit 

On  left  side 

On  right  side 

0 

0001101 

1110010 

1 

0011001 

1100110 

2 

0010011 

1101100 

3 

0111101 

1000010 

4 

0100011 

1011100 

5 

0110001 

1001110 

6 

0101111 

1010000 

7 

0111011 

1000100 

8 

0110111 

1001000 

9 

0001011 

1110100 

Note  that  the  stripe  sequence  which  encodes  a  digit  on  the  right  can  be 
obtained  from  the  stripe  sequence  on  the  left  by  interchanging  black  and  white. 
(Also  note  that  there  is  no  connection  between  the  sequences  used  to  encode  a 
digit  and  the  base  2  representation  of  that  digit.) 

Only  for  convenience  of  the  consumer  is  the  universal  product  number  some¬ 
times  written  beneath  the  sequence  of  stripes  which  encodes  it.  The  scanning 
device  used  by  the  cashier  to  read  the  code  is,  of  course,  only  able  to  detect  light 
and  dark  stripes. 

How  does  such  a  device  know  whether  a  bar  code  has  been  passed  over  it 
from  left  to  right  or  from  right  to  left?  Suppose  the  first  four  stripes  read  by  the 
scanner  are  0011101.  At  first  thought,  we  might  assume  that  this  was  the  code 
for  a  digit  on  the  left  (after  all,  it  begins  with  a  white  stripe),  but  it  could  also 
be  the  code  for  a  digit  on  the  right  read  backward  (since  it  ends  with  a  black 
stripe).  Notice,  however,  that  there  is  no  digit  whose  left  code  is  0011101;  thus, 
the  scanner  has  read  from  right  to  left  and  the  last  digit  in  the  product  number 
has  been  coded  1011 100  (the  digits  of  0011101  in  reverse);  the  last  digit  in  the 
product  number  must  be  4.  Since  the  sequence  of  stripes  used  to  encode  a  digit, 
when  taken  in  reverse  order,  is  not  the  sequence  of  any  other  encoding,  the  scanner 
always  knows  the  direction  in  which  it  is  reading  the  code. 

There  is  a  simpler  way  to  tell  whether  the  code  is  being  read  left  to  right  or 
right  to  left.  Can  you  find  it?  (Study  the  sum  of  the  digits  in  the  codes  given  in 
Table  4.2.) 


We  refer  readers  who  wish  to  learn  more  about  check  digits  to  a  very  inter¬ 
esting  article  by  Joseph  A.  Gallian  which  appeared  in  1991  in  an  issue  of  The 
College  Mathematics  Journal ,8 

The  Chinese  Remainder  Theorem 

The  notion  of  congruence  is  so  basic  and  useful  that  some  people  just  have  to 
spread  the  word.  In  the  lunch  room  the  other  day,  a  discrete  math  student  (DMS) 
boasted  to  his  friend  as  follows: 

DMS:  Calculators  are  neat,  but  there  are  lots  of  ways  to  work  with 
numbers  without  having  to  use  one.  The  trick  is  to  keep  the  numbers 
small  by  working  with  remainders. 

Friend:  What  do  you  mean? 

DMS:  I’  m  thinking  of  a  three  digit  number.  When  I  divide  it  by  12,  I 
get  a  remainder  of  4. 

Friend:  So  what? 

DMS:  When  I  divide  the  same  number  by  25, 1  get  a  remainder  of  15. 
Friend:  So  what? 

DMS:  What  was  my  number?  I’ll  give  you  a  hint.  It’s  bigger  than  200 
but  smaller  than  500. 

Friend  (10  minutes  later):  I  give  up. 

DMS:  It’s  easy.  I’ll  show  you  how  to  do  it.  {See  Exercise  19.) 

We  conclude  this  section  by  considering  certain  systems  of  simultaneous  con¬ 
gruences.  The  system 

x  =  1  (mod  4) 
x  =  0  (mod  30), 

for  example,  has  no  solution  because  the  first  congruence  says  x  is  odd  while  the 
second  says  x  is  even.  This  situation  occurs  precisely  because  the  moduli,  4  and 
30,  are  not  relatively  prime. 

Suppose  m  and  n  are  relatively  prime  natural  numbers.  Then,  for  any  integers 
a  and  b,  the  pair  of  congruences 

x  =  a  (mod  m ) 
x  =  b  (mod  n) 

has  a  solution.  One  way  to  see  this  is  to  note  that  there  are  integers  s  and  t  such 
that  sm  +tn  =  I,  by  Theorem  4.2.9.  Thus,  x  =  a(tn)  +b(sm)  is  a  solution  to  (8) 
since  In  =  1  (mod  m)  and  sm  =  1  (mod  n).  Moreover,  x  is  unique  mod  mn  in 
the  sense  that,  if  x'  is  another  solution,  then  x  =  x'  (mod  mn).  For  this,  observe 
that  if  x  and  x'  both  satisfy  (8),  then  x  —  x'  is  divisible  by  both  m  and  n  and 
hence  by  mn.  (See  Exercise  10  of  Section  4.2.) 


PROBLEM  28.  Solve  the  system  x  =  2  (mod  4) 

x  =  6  (mod  7). 


8Joe  Gallian.  “The  Mathematics  of  Identification  Numbers,”  The  College  Mathematics  Journal  22,  No. 
3  (May  1991). 
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Solution.  We  write  1  =  (— 1)(7)+2(4)  and  obtain  x  =  2(-l)(7)+6(2)(4)  =  34. 
Thus,  mod  28,  34  =  6  is  the  unique  solution  to  the  given  pair  of  congruences.  I 

We  have  been  investigating  a  special  case  of  the  Chinese  Remainder  Theorem, 
named  after  the  country  where  it  was  first  discovered  (circa  A.D.  350). 

Suppose  mi, m2, ...  , m,  are  pairwise  relatively  prime  integers,  that  is,  any  two 
of  them  are  relatively  prime.  Then,  for  any  integers  a\,a,2, ...  ,  at,  the  system  of 
congruences 

x  =  a\  (mod  m\) 
x  =  02  (mod  m2) 


x  =  a,  (mod  mt) 

has  a  solution  which  is  unique  modulo  the  product  mi  m2  •  •  •  mt. 

The  proof,  which  is  just  an  extension  of  the  t  =  2  case  already  discussed  and 
a  straightforward  exercise  in  mathematical  induction,  is  left  to  the  exercises  of 
Section  5.1. 

The  Chinese  Remainder  Theorem  has  many  practical  consequences,  two  of 
which  we  now  describe.  Each  provides  a  convincing  example  of  how  “pure” 
mathematics  can  have  very  relevant  applications. 

Determining  Numbers  by  Their  Remainders 

Let  n  be  a  natural  number.  By  the  Fundamental  Theorem  of  Arithmetic  (Theo¬ 
rem  4.3.9),  n  can  be  written  n  =  /?“' p“2  ■  ■  ■  p*,' ,  where  p\,  P2,  ••  •  ,  p,  are  unique 
distinct  primes  and  the  exponents  oq  >  0.  Let  a  >  1  be  some  integer  and  let  a,  be 
the  remainder  when  a  is  divided  by  By  Proposition  4.4.5,  a  =  at  (mod  p“'); 
in  other  words, 

a  =  a\  (mod  p“‘) 
a  =  02  (mod  p“2) 

a  =  at  (mod  /?“')• 

Since  the  numbers  /?“* , . . .  ,  pat'  are  relatively  prime,  the  Chinese  Remainder 
Theorem  says  that  a  is  unique  mod  n.  Thus,  if  0  <  a  <  n,  then  the  integer  a 
itself  is  uniquely  determined.  This  idea  is  crucial  for  the  manipulation  of  “large” 
numbers  in  computers  since  it  implies  that  in  order  to  store  a  number  a  larger 
than  a  given  computer’s  capacity,  it  is  sufficient  to  store  the  set  of  remainders 
which  a  leaves  upon  division  by  a  set  of  prime  powers.  We  illustrate. 


PROBLEM  29.  Let  n  =  90  =  2  ■  5  ■  9.  Suppose  a  is  an  integer,  0  <  a  <  90, 
which  has  remainders  1,  3,  and  4  upon  division  by  the  prime  powers  2,  5,  and  9, 
respectively.  We  claim  that  a  has  been  identified  uniquely.  Why? 
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EXAMPLE  30 


Solution.  We  are  given  that  a  is  a  solution  to  the  system 

x  =  \  (mod  2) 
x  -  3  (mod  5) 
x  =  4  (mod  9). 

The  Chinese  Remainder  Theorem  says  that  x  is  unique  mod  90.  Thus,  v  is  con¬ 
gruent  to  exactly  one  a  in  the  range  0  <  a  <  90. 

To  find  a,  we  solve  the  first  two  congruences  by  the  method  proposed  earlier. 
We  write  1(5)  —  2(2)  =  1  and  obtain  x  =  1(5)  —  3(2)(2)  =  —7  =  3  (mod  10). 
Solving  this  congruence  and  the  third  one,  we  write  1  (10)  —  1  (9)  =  1  and  determine 
a  =  4(10)  —  3(9)  =  13  (mod  90).  Thus,  a  =  13  +  901  for  some  l.  Since  0  <  a  < 
90,  we  must  have  a  =  13.  I 


Suppose  that  a  and  b  are  integers  and  that  m  \ ,  ...  ,  m,  are  pairwise  relatively 
prime.  (Typically,  the  m,  are  powers  of  distinct  prime  numbers.)  Let  a,  and  bi 
be  the  remainders  when  a  and  b  are  divided  by  mi,  respectively.  Since  a  =  a, 
(mod  nti)  and  b  =  b,  (mod  m,),  we  know  that  ab  =  a,  bi  (mod  /«,)  by  Proposi¬ 
tion  4.4.7.  The  Chinese  Remainder  Theorem  says  that  ab  is  determined  uniquely, 
mod  mn,  by  the  remainders  a\b\, . . .  ,  a,b,.  This  idea  can  be  exploited  when  a 
and  b  are  (very)  large  to  find  the  product  ab  in  shorter  time  than  is  possible  by 
conventional  methods.  Again,  we  illustrate  with  an  example. 

Let  a  =  64  and  b  =  79.  We  have 


a 

0 

(mod  4) 

b  =  3 

(mod  4) 

a  = 

1 

(mod  9) 

and  b  =  7 

(mod  9) 

a  = 

14 

(mod  25) 

b  =  4 

(mod  25). 

Thus,  ab  is  a  solution  to  the  set  of  congruences 
x  =  0(3)  =  0  (mod  4) 
x  =  1  (7)  =  7  (mod  9) 
x  =  14(4)  =  56  =  6  (mod  25). 

We  solve  the  first  two  congruences  by  writing  1  =9  —  2(4)  and  obtaining  x  = 
0(9)  -  7(2)(4)  =  -56  =  16  (mod  36).  Then  —9(36)  +  13(25)  =  1  gives  x  = 
6(— 9)(36)+ 16(13) (25)  =  3256  =  556  (mod  900)  as  a  solution  to  all  three.  Thus, 
ab  =  556+900?  for  some  integer  t.  Since  crude  estimates  give  4200  <  ab  <  5600, 
we  could  determine  ab  —  556  +  900(5)  =  5056,  this  being  the  only  value  of 
556  +  900 1  in  this  range.  A 

Cryptography 

Cryptography  (also  known  as  “cryptology”)  is  the  study  of  ways  in  which  mes¬ 
sages  can  be  coded  so  that  a  third  party,  intercepting  the  code,  will  have  great 
difficulty  recovering  the  original  text.  Such  coding  is  of  importance  not  just  to  the 
military.  Today,  when  so  much  information  is  transmitted  electronically,  secrecy 
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is  of  paramount  concern  to  us  all.  As  a  second  application  of  the  Chinese  Remain¬ 
der  Theorem  (and  of  other  ideas  introduced  in  this  section),  we  discuss  the  RSA 
Algorithm ,  a  method  of  encoding  a  message  discovered  in  1977  by  Ronald  Rivest, 
Adi  Shamir,  and  Leonard  Adleman. 

By  assigning  to  the  letters  of  the  alphabet  the  numbers  01,  02,  . . .  ,  26,  to 
a  space  the  number  27  and  to  various  punctuation  marks  numbers  beyond  27, 
it  is  apparent  that  any  message  determines  a  number  M.  “HELLO,”  for  instance, 
would  determine  the  number  M  —  0805121215.  Here  is  how  the  RSA  Algorithm 
converts  M  to  another  number  E. 

Choose  two  different  primes  p  and  q  and  a  natural  number  s  relatively  prime 
to  both  p—  1  and  q  —  1 .  Let  r  =  pq.  Let  E  be  the  remainder  when  Ms  is  divided 
by  r  and  use  E  to  encode  M;  that  is,  transmit  E  instead  of  M .  Since  E  =  Ms 
(mod  r),  we  have  E  =  Ms  (mod  p)  and  E  =  Ms  (mod  q). 

Using  the  RSA  Algorithm,  and  publicizing  the  values  of  r  and  s,  anybody  can 
send  you  an  encoded  message  which  you  can  decode  by  the  following  scheme. 
Since  gcd(s,  p  —  1)  =  1,  there  exist  integers  a  and  a:  such  that  as  +  x(p  —  1)  = 
1,  so  M  =  j^as+xip- 1)  _  ( Ms)a(Mp~x)x  =  Ea  (mod  p)  (by  Fermat’s  Little 
Theorem).  Similarly,  M  =  Eb  (mod  q)  for  an  integer  b  which  you  can  determine 
because  you  know  5  and  q.  Since  M  =  Ea  (mod  p)  and  M  =  Eb  (mod  q), 
by  the  Chinese  Remainder  Theorem,  M  is  uniquely  determined  modulo  r  and,  if 
M  <  r,  then  M  is  a  uniquely  determined  integer.  The  success  of  this  decoding 
procedure  lies  in  knowing  the  factorization  r  =  pq.  Finding  the  prime  factors  of 
a  large  integer  is  practically  impossible  if  r  is,  say,  a  100-digit  number.  Thus,  we 
have  described  a  procedure  which  can  be  public  knowledge  but  which  ensures 
that  messages  transmitted  to  you  are  secure. 


PROBLEM  31.  Suppose  r  =  17  •  59  =  1003  and  5  =  3.  A  secret  agent  wishes 
to  send  you  the  message  “GO.”  In  this  case,  M  =  715.  The  agent  calculates 
Ms  =  (7 1 5)3  and,  since  (715)3  =  579  (mod  1003),  she  transmits  the  number 
E  =  579.  You  receive  this  number  and  must  decode  it.  How  is  this  done? 

Solution.  First,  note  that  gcd(3,  16)  =  1  and  11(3)  +  (— 2)(16)  =  1,  so  a  =  11. 
Also,  39(3)  +  ( — 2) (58)  =  1,  so  b  =  39.  (It  is  always  possible  to  choose  a  and 
b  positive.  See  Exercise  21.)  Then  Ea  =  (579)”  =  l"  =  1  (mod  17),  while 
Eb  =  (579)39  =  (48)39  =  48(3) 19  ==  48(27)224  s  7  (mod  59).  The  problem  is 
then  reduced  to  solving  the  pair  of  congruences 

x=\  (mod  17) 

K  )  x  =  l  (mod  59). 

Using  the  methods  described  earlier,  we  obtain  x  =  715  (mod  1003),  which  was 
the  word  transmitted.  i 


What  “methods  described  earlier”?  Show  how  to  get  715  as  the  solution. 

As  a  final  remark,  we  note  that,  in  practice,  we  do  not  convert  a  message  to 
a  single  enormous  number  M  and  then  encode  M.  Instead  we  divide  the  message 
into  blocks  of  characters  of  a  fixed  length,  convert  each  block  to  a  number,  and 
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then  encode  these  numbers.  For  example,  the  message 

THE  PROJECT  IS  DELAYED  TWO  MONTHS 


might  be  divided  into  blocks  of  length  four, 

THEP  ROJE  CTIS  DELA  YEDT  WOMO  NTHS 

and  then  into  numbers 


20080516  18151005  03200919  04051201  25050420  23151315  14200819 


each  of  which  could  be  coded  as  described  previously. 

Readers  interested  in  learning  more  about  cryptography  might  consult  the 
excellent  little  book  Cryptology ,  by  Albrecht  Beutelspacher,  published  by  the 
Mathematical  Association  of  America  in  1994. 


Answers  to  Pauses 


16.  We  have  seen  that  changing  the  fourth  digit  from  6  to  5  gives  a  number  with 
a\  +  2a2  +  •  •  •  +  9cu)  +  lOajo  s  7  (mod  1 1),  so  if  we  also  change  the  first 
digit  from  0  to  4,  we  should  get  a\  +  la2  +  ■  ■  •  +  9a<)  +  10aio  =  0  (mod  1 1), 
as  desired.  It  is  easy  to  check  that  the  incorrect  ISBN  4-935031-03-4,  which 
has  errors  in  two  digits,  would  pass  our  test. 

17.  w=  (3,  1,3,  1,3,  1,3,  1,3,  1,3,  1). 

18.  The  sum  of  the  digits  which  code  a  number  on  the  left  is  odd,  whereas  the 
sum  of  the  digits  which  code  a  number  on  the  right  is  even.  If  the  scanner 
at  first  reads  001 1 101,  for  example,  the  sum  of  digits  being  even  implies  that 
the  number  is  on  the  right. 

19.  59  1  0 

17  0  1 

8  1  -3 

1  -2  7 


So  1  =  -2(59)  +  7(17)  and  l(-2)(59)  +  7(7)(17)  =  715  is  a  solution. 


EXERCISES 


The  symbol  [BB]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 


1.  Which  of  the  following  are  valid  ISBNs  and  which  are 
not?  Explain. 

(a)  [BB]  0-123456-78-9 

(b)  [BB]  0-432091-05-5 

(c)  1-667132-42-6 

(d)  9-412883-19-6 

(e)  3-492166-27-X 

2.  Find  the  digit  A  so  that  each  of  the  following  numbers 
is  a  valid  ISBN  number. 


(a)  [BB]  3-416109-27-A 

(b)  A-461228-37-4 

(c)  9-123A45-51-X 

(d)  [BB]  2-7291 88-A6-2 

(e)  8-9A5398-28-4 

3.  Is  there  an  ISBN  number  of  the  form  A-31 5266-78-2? 
Explain. 

4.  [BB]  Show  that  the  test  given  in  (6)  to  detect  an  error  in 
an  ISBN  is  equivalent  to  the  test  a i  +  2a2  +  ■  ■  ■  +  9a<)  = 
a io  (mod  1 1). 
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5.  Show  that  an  error  in  any  single  digit  of  an  ISBN  will 
always  be  detected  by  the  test  in  (6). 

6.  (a)  [BB]  Change  the  third  and  sixth  digits  of  the  ISBN 

2-429783-29-0  so  that  the  resulting  number  is  not  a 
valid  ISBN. 

(b)  Change  the  third  and  sixth  digits  of  the  ISBN  in  (a) 
so  that  the  resulting  number  is  still  valid. 

(c)  Show  that  any  two  digits  of  an  ISBN  can  always  be 
changed  so  that  the  errors  would  not  be  detected  by 
the  test  in  (6). 


7.  Consider  the  following  alternative  check  for  correctness 
of  an  ISBN  number.  Instead  of  (6),  the  check  digit  aio 
is  determined  by  the  rule  a\  +  aj  +  •  •  -  +  £*9  +  «io  =  0 
(mod  11). 

(a)  [BB]  Express  this  rule  in  the  form  w  •  a  =  0 
(mod  11)  for  some  vectors  w  and  a. 

(b)  Show  that  this  rule  detects  an  error  in  a  single  digit. 

(c)  Show  that  this  rule  does  not  detect  a  transposition 
of  two  (different)  digits. 

8.  Try  to  identify  the  universal  product  numbers  defined 
by  each  of  the  bar  codes  in  Fig  4.5. 


(a)[BB] 


(b) 


Figure  4.5  Bar  codes  for  Exercise  8. 

9.  Which  of  the  following  are  valid  universal  product  num¬ 
bers  and  which  are  not?  Explain. 

(a)  [BB]  0-12345-67890-1 

(b)  1-66326-73551-5 

(c)  [BB]  2-52998-17394-9 

(d)  9-53889-22687-3 

(e)  8-41285-19384-2 

10.  In  each  of  the  following  cases,  find  the  digit  x  so  that 
the  given  number  is  a  valid  universal  product  number. 

(a)  [BB]  0-12x89-29109-4 


(b)  1-29347-49x26-8 
(C)  4-2921 7-1 0258-X 

(d)  5-91057-X9332-2 

(e)  x-791 54-91 937-6 

11.  [BB]  This  exercise  concerns  the  test  labeled  (7)  on 
page  138. 

(a)  Show  that  the  test  will  detect  a  single  error  in  an 
even  position  digit  of  a  universal  product  number. 

(b)  Show  by  example  that  the  test  will  not  necessarily 
detect  two  errors  in  even  positions. 
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12.  Repeat  both  parts  of  Exercise  1 1  for  odd  positions. 

13.  (a)  [BB]  Give  an  example  of  a  valid  universal  prod¬ 

uct  number  which  differs  from  the  valid  number 
1-23456-98732-6  in  the  fourth  and  fifth  positions. 

(b)  [BB]  Give  an  example  of  an  invalid  product  number 
which  differs  from  the  valid  number  1-23456-98732-6 
in  the  fourth  and  fifth  positions. 

14.  Will  the  test  in  (7)  detect  the  transposition  of  two  (dif¬ 
ferent)  adjacent  digits?  Explain. 

15.  A  certain  state  proposes  to  determine  the  check  digit  an 
of  a  12-digit  driver’s  license  number  a\a2  ■  ■  ■  an  by  the 
rule 

12ai  +  1 1  <32  +  10fl3  +  •  •  •  +  2a 1 1  -I-  an  =  0  (mod  10). 

(a)  Express  this  test  in  the  form  w-  a  =  0  (mod  10)  for 
vectors  w  and  a. 

(b)  [BB]  Does  this  test  detect  an  error  in  a  single  digit? 

(c)  Does  this  test  detect  the  error  resulting  from  the 
transposition  of  consecutive  digits? 

(d)  Does  it  detect  transposition  errors  in  general? 

Explain  your  answers. 

16.  Consider  the  general  test 

wa  =  0  (modn) 

for  correctness  of  a  £-digit  number  a\a2  . . .  <%,  0  <  a,  < 
9,  where  a  =  (a\,a 2,  ■■■  ,  a*),  w  =  (uq,  mo,  •  •  •  ,  u>t) 
is  a  weight  vector  and  n  >  I . 

(a)  [BB]  If  this  is  to  be  a  sensible  test,  we  should  have 
n  >  9.  Why? 

(b)  Assume  n  >  9.  If  Wj  is  relatively  prime  to  n  for  all 
i ,  show  that  the  test  will  detect  single-digit  errors. 

(c)  Assume  n  >  9.  Show  that  the  test  will  detect  the 
error  resulting  from  transposition  of  a,  and  aj  pro¬ 
vided  w,  —  wj  is  relatively  prime  to  n. 

17.  In  each  case,  find  the  smallest  nonnegative  integer  x 
which  satisfies  the  given  system  of  congruences. 

(a)  [BB]  x  =  3  (mod  5) 

x  =  4  (mod  7) 

(b)  a  =  1  (mod  4) 

a  =  8  (mod  9) 

(c)  a  =  3  (mod  5) 

a  =  7  (mod  8) 

(d)  [BB]  a  =  6  (mod  8) 

a  s  1 7  ( mod  25 ) 

(e)  a  =  3  (mod  1917) 

a  =  75  (mod  385) 


(f)  a  =  1003  (mod  17,369) 
x  =  2974  (mod  5472) 

(g)  a  =  1  (mod  4) 
a  =  8  (mod  9) 
a  =  10  (mod  25) 

18.  For  each  of  the  following,  find  the  smallest  positive 
integer  which  has  the  given  sequence  of  remainders 
when  divided  by  the  prime  powers  4,  3,  and  25,  respec¬ 
tively. 

(a)  [BB]  1,2,3 

(b)  2,  0.  6 

(c)  3,  1,  17 

(d)  [BB]  0,2,  10 

(e)  3,2,24 

19.  [BB]  Reflect  on  the  conversation  between  discrete  math 
student  and  friend  on  page  140.  What  was  the  num¬ 
ber? 

20.  In  each  of  the  following  cases,  find  a  positive  integer  a 
such  that  ab  =  x  modulo  a  suitable  integer.  Assuming 
that  ab  <  50,000,  find  ab  itself,  if  possible,  and  explain 
your  reasoning. 

(a)  [BB]  (t  =  3is3  (mod  4) 

a  =  0,is8  (mod  9) 
a  =  4,  b  =  1 8  (mod  25) 

(b)  a  =  7,  b  =  7  (mod  8) 
a  =  9,  b  =  8  (mod  27) 

a  =  29, b  =  18  (mod  125) 

(c)  a  =  1 ,  b  =  2  (mod  8) 
a  =  8,  b  =  I  (mod  27) 

a  =  55,6  =  82  (mod  125) 

(d)  a  =  I ,  b  =  6  (mod  8) 
a  =  2,  b  =  6  (mod  27) 

a  =  1 2,  b  =  97  (mod  1 25) 

(e)  [BB]  a  =  1 ,  b  =  3  (mod  4) 

a  =  3,  b  =  4  (mod  25) 
a  =  10,/?  =  79  (mod  343) 

(f)  a  =  3,  b  =  1  (mod  4) 

a  =  17,/?  =  6  (mod  25) 
a  =  78,  /?  =  122  (mod  343) 

(g)  a  =  3,  b  =  4  (mod  5) 

a  -  36,  b  =  42  (mod  49) 
a  =l,b  =  8  (mod  11) 
a  =  1,  b  =  3  (mod  13) 

(h)  a  =  2,  b  =  4  (mod  5) 

«  =  14,  b  =  36  (mod  49) 
rt  =  1 ,  b  =  10  (mod  1 1) 
a  =  11,6  =  7  (mod  13) 
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21.  [BB]  Suppose  m  and  n  are  relatively  prime  positive  inte¬ 
gers.  Show  that  there  exist  integers  ,v  and  t  with  s  >  0 
such  that  sm  +tn  =  1.  [ Hint :  It  might  be  helpful  first  to 
consider  some  specific  examples.  For  instance,  can  you 
find  s  and  t,  s  >  0.  such  that  Is  +  22 1  =  1?] 

The  remaining  exercises  are  based  upon  the  method  for 
encoding  messages  described  at  the  end  of  this  section. 

22.  Suppose  p  —  17,  q  —  23,  and  s  =  5.  How  would  you 
encode  each  of  the  following  “messages”? 

(a)  [BB]  X 

(b)  HELP 

(c)  [BB]  AIR 
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1.  Find  the  quotient  and  remainder  when 
11,109,999,999,997  is  divided  by  1111. 

2.  (a)  Convert  (1100101)2  to  base  10. 

(b)  Convert  32,145  to  octal. 

3.  (a)  The  sum  of  the  digits  of  the  number  8215  is  8  + 

2+1+5=16  =  7  (mod  9).  Observe  also  that 
8215  =  9(912)  +  7  =  7  (mod  9).  Does  this  hold 
for  any  number?  Explain. 

(b)  Suppose  we  want  to  compute  the  product  8215  x 
3567  modulo  9.  Replacing  these  numbers  by  those 
obtained  by  adding  their  digits  and  reducing  modulo 
9  gives  16  x  21  =  7x3  =  21=3  (mod  9).  Is  it 
true  that  8215  x  3567  =  3  (mod  9)?  Explain. 

4.  Find  gcd(2700,  —504)  and  express  it  as  an  integral  linear 
combination  of  the  given  integers. 

5.  Suppose  x,  a  and  b  are  integers  such  that  x  \  ab.  If  x 
and  a  are  relatively  prime,  prove  that  x  |  b. 

6.  For  any  k  e  N,  prove  that  gcd(4£  +  3,  Ik  +  5)  =  1. 

7.  (a)  Give  Euclid's  proof  that  there  are  infinitely  many 

primes. 

(b)  State  the  Fundamental  Theorem  of  Arithmetic. 


(d)  BYE 

(e)  NOW 

23.  Suppose  p  =  5,  q  —  7,  and  s  =  5.  Decode  each  of  the 
following  encoded  “messages.” 

(a)  [BB]  31 

(b)  24 

(c)  [BB]  7 

(d)  11 

(e)  23 

24.  Suppose  p  =  17,  q  =  59,  and  s  =  3. 

(a)  [BB]  If  you  receive  E  =  456,  what  is  the  message? 

(b)  If  you  receive  E  =  926,  what  is  the  message? 


8.  Is  2119  —  1  prime?  What  about  3 1 09  —  1?  What  about 
4109  -  1?  Explain  your  answers. 

9.  True  or  false  (and  justify): 

(a)  If  b  I  a  and  -  I  c,  then  be  I  a. 

b 

(b)  If  b  |  a  and  -  I  c,  then  a  I  be. 

I  jy  I 

10.  (a)  Compute  380  (mod  7). 

(b)  Find  all  integers  x  such  that  5x  =  1  (mod  100). 
Briefly  explain  your  answer. 

11.  Find  all  integers  x  and  y,  0  <  x,  y  <  10,  which  satisfy 

x  +  5y  =  5  (mod  10) 

5x  +  3y  =  1  (mod  10). 

12.  (a)  Find  the  digit  A  so  that  A-253228-46-7  is  a  valid  ISBN 

number. 

(b)  Find  the  digit  x  so  that  3-25814-39x75-6  is  a  valid 
universal  product  code. 

13.  Find  the  smallest  nonnegative  integer  which  satisfies 

x  =  5  (mod  341) 
jc  =  1 1  (mod  189). 
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5.1  Mathematical  Induction 

One  of  the  most  basic  methods  of  proof  is  mathematical  induction,  which  is  a  way 
to  establish  the  truth  of  a  statement  about  all  the  natural  numbers  or,  sometimes, 
all  sufficiently  large  integers.  Mathematical  induction  is  important  in  every  area 
of  mathematics.  In  addition  to  the  examples  presented  in  this  section,  other  proofs 
by  mathematical  induction  appear  elsewhere  in  this  book  in  a  variety  of  different 
contexts.  In  the  index  (see  “induction”),  we  draw  attention  to  some  theorems 
whose  proofs  serve  as  especially  good  models  of  the  technique. 


PROBLEM  1.  A  certain  store  sells  envelopes  in  packages  of  five  and  packages 
of  twelve  and  you  want  to  buy  n  envelopes.  Prove  that  for  every  n  >  44,  this 
store  can  fill  an  order  for  exactly  n  envelopes  (assuming  an  unlimited  supply  of 
each  type  of  envelope  package). 

Solution.  If  you  want  to  purchase  44  envelopes,  you  can  buy  two  packages  of 
twelve  and  four  packages  of  five.  If  you  want  to  purchase  45  envelopes,  you  can 
buy  nine  packages  of  five.  If  you  want  to  purchase  46  envelopes,  pick  up  three 
packages  of  twelve  and  two  packages  of  five.  If  you  want  to  buy  47  envelopes,  get 
one  package  of  twelve  and  seven  packages  of  five  and,  if  you  want  48  envelopes, 
purchase  four  packages  of  twelve. 

The  obvious  difficulty  with  this  way  of  attacking  the  problem  is  that  it  never 
ends.  Even  supposing  that  we  continued  laboriously  to  answer  the  question  for  n 
as  big  as  153,  say,  could  we  be  sure  of  a  solution  for  n  =  154?  What  is  needed  is  a 
general,  not  an  ad  hoc,  way  to  continue;  that  is,  if  it  is  possible  to  fill  an  order  for 
exactly  k  envelopes  at  this  store,  we  would  like  to  be  able  to  deduce  that  the  store 
can  also  fill  an  order  for  k  +  1  envelopes.  Then,  knowing  that  we  can  purchase 
exactly  44  envelopes  and  knowing  that  we  can  always  continue,  we  could  deduce 
that  we  can  purchase  exactly  45  envelopes.  Knowing  this,  and  knowing  that  we 
can  always  continue,  we  would  know  that  we  can  purchase  exactly  46  envelopes. 
And  so  on. 
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Suppose — just  suppose — that  it  is  possible  to  buy  exactly  k  envelopes  at 
this  store,  where  k  >  44.  If  this  purchase  requires  seven  packages  of  five,  then 
exchanging  these  for  three  packages  of  twelve  fills  an  order  of  exactly  k  +  1 
envelopes.  On  the  other  hand,  if  k  envelopes  are  purchased  without  including  seven 
packages  of  five,  then  the  order  for  k  envelopes  included  at  most  30  envelopes 
in  packages  of  five  and  so,  since  k  >  44,  at  least  two  packages  of  twelve  must 
have  been  required.  Exchanging  these  for  five  packages  of  five  then  fills  exactly 
an  order  for  k  +  1  envelopes.  We  conclude  that  any  order  for  n  >  44  envelopes 
can  be  filled  exactly.  I 


This  example  demonstrates  the  key  ingredients  of  a  proof  by  mathematical 
induction.  Asked  to  prove  something  about  all  the  integers  greater  than  or  equal 
to  a  particular  given  integer — for  instance,  that  any  order  of  n  >44  envelopes  can 
be  filled  with  packages  of  five  and  twelve — we  first  establish  truth  for  the  first 
integer — for  example,  n  =  44 — and  then  show  how  the  truth  of  the  statement  for 
n  =  k  enables  us  to  deduce  truth  for  n  —  k  +  1 . 


Figure  5.1  An  8  x  8  board. 


PROBLEM  2.  Chess  is  a  game  played  on  an  8  x  8  grid,  that  is,  a  board  consisting 
of  eight  rows  of  eight  small  squares.  (See  Fig  5. 1 .)  Suppose  our  board  is  defective 
in  the  sense  that  one  of  its  squares  is  missing.  Given  a  box  of  L-shaped  trominos 

like  this,  Q]  ,  each  of  which  covers  exactly  three  squares  of  a  chess  board,  is  it 

possible  to  tile  the  board  without  overlapping  or  going  off  the  board?  Show,  in 
fact,  that  it  is  possible  to  tile  any  2n  x  2"  defective  board. 


Figure  5.2  Three  defective  boards. 

Solution.  We  begin  our  solution  by  thinking  of  some  easier  situations.  Figure  5.2 
shows  defective  2  x  2,  4  x  4,  and  8x8  boards  (with  the  missing  square  highlighted 
in  each  case).  Certainly  the  2  x  2  board  can  be  tiled  because  its  shape  is  exactly 
that  of  a  single  tromino.  A  little  experimentation  would  show  how  to  tile  the 
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Figure  5.3 


5.1.1  THE  PRINCIPLE 
OF  MATHEMATICAL 
INDUCTION 


4x4  board.  Rather  than  proceeding  case  by  case,  however,  we  use  the  idea 
suggested  by  our  first  example  and  attempt  to  understand  how  a  solution  for 
one  particular  board  can  be  used  to  obtain  a  solution  for  the  next  bigger  board. 
Suppose  then  that  we  know  how  to  tile  any  2k  x  2k  defective  board.  How  might 
we  tile  a  defective  board  of  the  next  size,  2k+l  x  2k+x  ?  The  idea  is  to  realize  that  a 
2k+x  x2kn  board  can  be  divided  into  four  boards,  each  of  size  2k  x  2k ,  as  shown  in 
Fig  5.3. 

2k  2k 


2k 


2k 


r 

n 

J 

One  of  these  smaller  boards  contains  the  missing  square  and  so  is  defective. 
Now  place  a  tromino  at  the  center  so  as  to  cover  squares  in  each  of  the  three 
remaining  smaller  boards.  Each  of  the  boards  is  now  “defective”  and  so,  by 
assumption,  can  be  tiled  with  trominos.  So  we  have  tiled  the  larger  board!  I 

The  two  examples  discussed  so  far  made  assertions  about  infinitely  many 
consecutive  integers.  In  each  case,  we  adopted  the  following  strategy. 

•  Verify  that  there  is  a  solution  for  the  smallest  integer. 

•  Show  how  a  solution  for  one  integer  leads  to  a  solution  for  the  next. 

We  now  give  a  formal  statement  of  the  principle  which  has  been  at  work,  the 
Principle  of  Mathematical  Induction. 

Given  a  statement  CP  concerning  the  integer  n,  suppose 

1.  CP  is  true  for  some  particular  integer  no', 

2.  if  CP  is  true  for  some  particular  integer  k  >  no,  then  it  is  true  for  the  next 
integer  k  +  1 . 

Then  CP  is  true  for  all  integers  n  >  no- 

In  Step  2,  the  assumption  that  CP  is  true  for  some  particular  integer  is  known 
as  the  induction  hypothesis. 

In  our  first  example,  we  had  to  prove  that  any  order  of  n  envelopes,  n  >  44, 
could  be  filled  with  packages  of  five  and  of  twelve:  no  was  44  and  the  induction 
hypothesis  was  the  assumption  that  there  was  a  way  to  purchase  k  envelopes  with 
packages  of  five  and  twelve.  In  the  second  example,  we  had  to  demonstrate  that 
any  defective  board  of  size  2"  x  2”,  n  >  1,  could  be  covered  in  a  certain  way;  no 
was  1  and  the  induction  hypothesis  was  the  assumption  that  we  could  properly 
cover  a  2k  x  2k  board. 
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Our  next  example  is  suggested  by  the  following  pattern.  Notice  that 

1=  1  =  12 
1  +  3  =  4  =  22 
1+3  +  5=  9  =  32 


The  first  odd  integer  is  l2;  the  sum  of  the  first  two  odd  integers  is  22;  the  sum 
of  the  first  three  odd  integers  is  32  and  so  on.  It  appears  as  if  the  sum  of  the 
first  n  odd  integers  might  always  be  n2.  The  picture  in  Fig  5.4  adds  force  to  this 
possibility. 


Figure  5.4  The  sum  of  the  first  n  odd  integers  is  n2. 


PROBLEM  3.  Prove  that  for  any  integer  n  >  1,  the  sum  of  the  odd  integers 
from  1  to  2n  —  1  is  n2. 

Before  solving  this  problem,  we  remark  that  the  sum  in  question  is  often 
written 

(1)  1  +  3  +  5  +  •  •  •  +  (In  —  1), 

where  the  first  three  terms  here — 1  +  3  +  5 — are  present  just  to  indicate  that  odd 
numbers  are  being  added,  beginning  with  1,  and  the  last  term,  2n  —  1,  describes 
the  last  term  and  gives  a  formula  for  the  general  term:  The  second  odd  number  is 
2(2)  —  1;  the  third  odd  number  is  2(3)  —  1,  the  kth  odd  number  is  2/c  —  1 .  Do  not 
infer  from  this  expression  that  each  of  the  three  numbers  1,  3,  5  is  always  present. 
For  example,  when  n  =  2,  2n  —  1  =  3  and  so  the  sum  in  (1)  is,  by  definition,  1+3. 
We  can  also  describe  the  sum  (1)  with  sigma  notation, 

n 

(2)  1  +3  +  5  + •■•  +  (2//  -  1)  =  ^(2f  -  1), 

i=i 

so  named  because  the  capital  Greek  letter,  used  to  denote  summation,  is 
pronounced  “sigma.”  The  letter  i  is  called  the  index  of  summation ;  the  i  =  1  at 
the  bottom  and  the  n  at  the  top  mean  that  the  summation  starts  with  i  =  1  and 
continues  with  i  =  2,  3  and  so  on,  until  i  =  n.  Thus,  the  first  term  in  the  sum 
is  2 i  —  1  with  i  —  1;  that  is,  2(1)  —  1  =  1.  The  second  term  is  2 i  —  1  with 
i  =  2;  that  is,  2(2)  —  1=3.  Summing  continues  until  i  =  n\  The  last  term  is 
In  -  1. 
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4 

Write  ^P(2 i  —  1)  without  using  sigma  notation  and  evaluate  this  sum. 

/= l 

Problem  3  asks  us  to  prove  that,  for  all  integers  n  >  1, 

(3)  1  +  3  +  5  +  •  •  •  +  (2n  -  1)  =  n1 

or,  equivalently,  that 

Y(2i  —  1)  =  n2 . 
i=l 

Solution.  In  this  problem,  no  =  1.  When  n  =  1,  1  +  3  +  5  +  •  •  •  +  (2 n  —  1) 
means  “the  sum  of  the  odd  integers  from  1  to  2(1)  —  1  =  1.”  Thus,  the  sum  is 
just  1.  Since  l2  is  also  1,  the  statement  is  true  for  n  —  1.  Now  suppose  that  k  is 
an  integer,  k  >  1,  and  the  statement  is  true  for  n  =  k:  in  other  words,  suppose 

1  +  3  +  5  +  •  •  •  +  (2k  —  1)  =  k2  induction  hypothesis. 

We  must  show  that  the  statement  is  true  for  the  next  integer,  n  =  k  +  1 ;  namely, 
we  must  show  that 

1  +  3  +  5  +  •  •  •  +  (2 (k  +  1)  -  1)  =  (k  +  l)2. 

Since  2(k  +  l)  —  1  =  2k  +  1,  we  have  to  show 

1  +  3  +  5  +  •  •  •  +  (2k  -f-  1)  =  (k  +  1)“. 

!!!! - WAIT  A  SECOND - !!!! 

The  sum  on  the  left  is  the  sum  of  the  odd  integers  from  1  to  2k  +  1;  this  is  the 
sum  of  the  odd  integers  from  1  to  2k  -  1,  plus  the  next  odd  integer,  2k  +  1 : 

1  +  3  +  5  +  •  •  •  +  (2k  +1)  =  [1  +  3  +  5  + - (-  (2k  —  1)]  +  (2k  +  1). 

By  the  induction  hypothesis,  we  know  that 

1  +  3  +  5  +  •  ■  •  +  (2k  +  1 ) 

=  1  +  3  +  5  +  •  •  •  +  (2k  -  1)  +  (2k  +  1)  =  k2  +  (2k  +  1). 

Since  k2  +  (2k  +  !)  =  (&  +  I)2,  this  is  the  result  we  wanted.  By  the  Principle  of 
Mathematical  Induction,  statement  (3)  is  true  for  all  integers  n  >  1.  I 


Why  did  we  “wait  a  second”  in  the  preceding  argument?  It  has  been  the 
authors’  experience  that  students  sometimes  confuse  their  statement  of  what  is  to 
be  proven  when  n  =  k  +  1  with  the  start  of  their  actual  proof.  Consequently,  we 
strongly  recommend  the  following  approach  to  a  proof  by  mathematical  induction: 

•  verify  the  statement  for  n  =  n$\ 

•  write  down  the  induction  hypothesis  (the  statement  for  n  =  k)  in  the  form 
“Now  suppose  that  ...  ”  and  be  explicit  about  what  is  being  assumed; 

•  write  down  what  is  to  be  proven  (the  statement  for  n  =  k  +  1)  in  the  form 

“We  must  show  that  ...  ”  again  being  very  explicit  about  what  is  to  be 

shown;  and  finally  (after  waiting  a  second); 
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•  give  a  convincing  argument  as  to  why  the  statement  for  n  =  k  +  I  is  true 
(and  make  sure  this  argument  uses  the  induction  hypothesis). 

We  continue  with  several  examples  which  you  should  take  as  models  for 
proofs  by  mathematical  induction. 


PROBLEM  4.  Prove  that  for  any  natural  number  n  >  1 , 
1"  +  2~  +  3^  +  ■  ■  ■  + 


Solution.  When  n  =  1,  the  sum  of  the  integers  from  1 2  to  1 2  is  1.  Also 


1(1  +  1)  (2  •  1  +  1)  = 

6 

so  the  statement  is  true  for  n  =  1.  Now  suppose  that  k  >  1  and  the  statement  is 
true  for  n  =  k;  that  is,  suppose  that 


l2  +  22  +  32  +  •  •  ■  +  k2 


k(k  +  1 )  (2 k  4-  1) 
6 


We  have  to  show  that  the  statement  is  true  for  n  =  k  +  1 ;  that  is,  we  have  to  show 
that 


l2  +  22  +  32  +  •••  +  (*  +  l)2 


(k  +  1)[(A  +  1)  +  1  ][2  (k  +  1)  +  1  ]  (k  +  1)(A  +  2)  (2  k  +  3) 
=  __  6  ~  =  6 
!!!! - WAIT  A  SECOND - !!!! 

Observe  that 

l2  +  22  +  32  +  ---  +  (A’+  l)2  =  (l2  +  22  +  32  +  "-+k2)  +  (k+  l)2 


k(k  +  1)(2  k  +  1) 

6 


+  (k  +  l)2 


k(k+  1)(2*+  1)  +  6(A+  l)2 
6 

(k  +  ])[k(2k+  1)  +  6(A+  1)] 

6 

(k  -f  1)[2A“  I  Ik  -f  6] 

_  6 

(k  +  \){k  +  2)(2k  +  3) 

6 

which  is  just  what  we  wanted.  By  the  Principle  of  Mathematical  Induction,  the 
statement  is  true  for  all  integers  n  >  1.  I 
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PROBLEM  5.  Prove  that  for  any  integer  n  >  1,  2ln  —  1  is  divisible  by  3. 

Solution.  When  n  —  1,  22(1)  -1=  22  —  1=4  —  1=3  is  divisible  by  3. 
Now  suppose  that  k  >  1  and  the  statement  is  true  for  n  =  k;  that  is,  suppose  that 
22*  —  1  is  divisible  by  3.  We  must  prove  that  the  statement  is  true  for  n  =  k  +  1; 
that  is,  we  must  prove  that  22(i+1)  —  1  is  divisible  by  3.  The  key  to  what  follows 
is  the  fact  that  we  must  somehow  involve  the  induction  hypothesis.  Observe  that 
22<*+i)  _  i  =  22*+2  -  1  =  4(22k)  —  1.  This  is  helpful  since  it  introduces  22k.  By 
the  induction  hypothesis,  2~k  —  1  =  3/  for  some  integer  /,  so  2~k  =  3/  +  1.  Now 
it’s  smooth  sailing.  We  have 

22(*+i)  _  i  =  4(22k)  -  1  =  4(3/  +  1)  -  1  =  12/  +  4  -  1  =  12/  +  3  =  3(4/  +  1). 

Thus,  22(*+1)  -  1  is  divisible  by  3,  as  required.  By  the  Principle  of  Mathematical 
Induction,  2?"  —  1  is  divisible  by  3  for  all  integers  n  >  1.  I 


Prove  that  32"  —  1  is  divisible  by  8  for  every  n  >  1 . 

Let  n  be  a  given  positive  integer.  It  is  convenient  to  have  some  notation  for 
the  product  of  all  the  integers  between  1  and  n  since  this  sort  of  product  occurs 
frequently  in  statistical  and  in  counting  problems.  (See  Chapter  7.) 


5.1.2  DEFINITION 


Define  0!  =  1  and,  for  any  integer  n  >  1,  define 

n\  =  n(n  -  1  )(n  -  2)  •  •  •  (3)(2)(1). 


5.1.3  STIRLING'S 
APPROXIMATION 


The  symbol  n!  is  read  “n  factorial.”  The  first  few  factorials  are  0!  =  1,  1!  =  1, 
2!  =  2  •  1  =  2,  3!  =  3(2)(1)  =  6,  4!  =  4(3)(2)(1)  =  24.  It  is  useful  to  notice  that 
4!  =  4(3!),  5!  =  5(4!),  and  so  on.  Thus,  if  we  know  that  8!  =  40,320,  then  it 
is  easy  to  deduce  that  9!  =  9(40,320)  =  362,880.  Factorials  grow  very  quickly. 
James  Stirling  (1730)  provided  an  important  estimate  for  the  size  of  n !  when  n  is 
large. 


lim  ’ - 

V2tt n(n/e)n 


=  1; 


equivalently, 


In  this  formula,  e  —  2.71828 . . .  denotes  the  base  of  the  natural  logarithms. 
Remember  from  our  discussion  of  the  Prime  Number  Theorem  (Theorem  4.3.13) 
that  we  read  “is  asymptotic  to”  at  the  symbol  ~.  Thus,  Stirling’s  formula  says 
that  n\  is  asymptotic  to  «j2nn{n/e)n,  meaning  that  n !  is  approximately  equal 
to  V27 xn{n/e)n,  for  large  n.  For  example,  15!  ^  \/307r(15/e)15  ^  1.3  x  1012, 
which,  by  most  people’s  standards,  is  indeed  a  large  number. 

Our  next  problem  provides  another  indication  of  the  size  of  n!,  albeit  a  rather 
crude  one.  For  example,  it  says  that  15!  >  215  =  32,768  and  30!  >  230  «  109. 


PROBLEM  6.  Prove  that  n !  >  2”  for  all  n  >  4. 

Solution.  In  this  problem,  no  =  4  and  certainly  4!  =  24  >  16  =  24.  Thus,  the 
statement  is  true  for  no-  Now  suppose  that  k  >  4  and  the  statement  is  true  for 
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n  =  k.  Thus,  we  suppose  that  k\  >  2k .  We  must  prove  that  the  statement  is  true 
for  n  =  k  +  1;  that  is,  we  must  prove  that  ( k  +  1)!  >  2k+1.  Now 


(k  +  1)!  =  (*  +  !)*!  >  (k  +  1)2* 


using  the  induction  hypothesis.  Since  k  >  4,  certainly  k  +  1  >  2,  so  (k  +  \)2k  > 
2  ■  2k  =  2*  11 .  We  conclude  that  (k  +  I)!  >  2*+l  as  desired.  By  the  Principle  of 
Mathematical  Induction,  we  conclude  that  n !  >2"  for  all  integers  n  >  4.  I 


What  was  the  induction  hypothesis  in  this  problem? 

Why  did  the  induction  in  this  example  start  at  n  =  4  instead  of  some  smaller 
integer? 


The  Principle  of  Mathematical  Induction  is  one  of  the  most  powerful  tools 
of  mathematics.  With  it,  we  can  prove  many  interesting  things,  but  if  it  is  not 
applied  correctly,  we  can  also  prove  some  interesting  things  which  are  not  true! 


PROBLEM  7.  What  is  the  flaw  in  the  following  argument,  which  purports  to 
show  that 


2  +  4  +  6  +  •  ■  •  +  2n  =  (n  —  1  )(n  +  2) 
for  all  positive  integers  «? 

“Assume  that  2  +  4  +  6  4 - +  2k  =  (k  —  l)(fc  +  2)  for  some  integer 

k.  Then 

2  +  4  +  6  +  •  •  •  +  2  (k  +  l)  =  (2  +  4  +  6  +  --  -  +  2k)  +  2(k  +  1) 

=  (k  —  l)(fc  +  2)  +  2  (k  +  1) 

(by  the  induction  hypothesis) 

=  k2  +  k  -  2  +  2k  +  2 
=  k2  +  3  k 
=  k(k  +  3) 

=  [(*+l)-l)][(*  +  l)  +  2], 

which  is  the  given  statement  for  n  =  k  +  1 .  It  follows,  by  the  Principle 
of  Mathematical  Induction,  that  the  statement  is  true  for  all  positive 
integers  n.” 

Solution.  The  inductive  step,  as  given,  is  correct,  but  we  neglected  to  check  the 
case  n  —  1,  for  which  the  statement  is  most  definitely  false.  I 


There  is  another  form  of  the  Principle  of  Mathematical  Induction,  called  the 
strong  form,  because,  at  first  glance,  it  appears  to  be  more  powerful  than  the 
principle  stated  previously.  The  two  forms  are  completely  equivalent,  however: 
The  collection  of  statements  which  can  be  proven  true  using  one  form  is  exactly 
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5.1.4  PRINCIPLE  OF 
MATHEMATICAL 
INDUCTION  (STRONG 
FORM) 


the  collection  which  can  be  proven  true  using  the  other.  It  just  so  happens  that  in 
certain  problems,  the  strong  form  is  more  convenient  than  the  other. 

Given  a  statement  (P  concerning  the  integer  n,  suppose 

1.  iP  is  true  for  some  integer  no', 

2.  if  &  >  no  is  any  integer  and  7  is  true  for  all  integers  l  in  the  range  no  <  l  <  k, 
then  it  is  true  also  for  k. 


Then  CP  is  true  for  all  integers  n  >  no. 


The  two  forms  of  the  Principle  of  Mathematical  Induction  differ  only  in 
the  statement  of  the  induction  hypothesis  (the  assumption  in  the  second  step). 
Previously,  we  assumed  the  truth  of  the  statement  for  just  one  particular  integer 
and  we  had  to  prove  it  true  for  the  next  largest  integer.  In  the  strong  form  of 
mathematical  induction,  we  assume  the  truth  of  the  statement  for  all  integers  less 
than  some  integer  and  prove  that  the  statement  is  true  for  that  integer.  When  we 
first  encounter  mathematical  induction,  it  seems  to  be  the  weak  form  which  is 
always  used;  problems  requiring  the  strong  form  are  seldom  encountered.  In  the 
analysis  of  finite  structures,  however,  the  strong  form  is  employed  extensively.  We 
want  to  acquire  knowledge  about  structures  of  a  certain  size  from  knowledge  about 
similar  structures  of  smaller  size.  Recall  that  part  of  the  Fundamental  Theorem 
of  Arithmetic  states  that  every  natural  number  greater  than  1  is  the  product  of 
primes.  The  strong  form  of  mathematical  induction  affords  a  very  straightforward 
proof  of  this  result. 


PROBLEM  8.  Use  the  strong  form  of  mathematical  induction  to  prove  that  every 
natural  number  n  >  2  is  either  prime  or  the  product  of  prime  numbers.  (See  4.3.9, 
the  Fundamental  Theorem  of  Arithmetic.) 

Solution.  The  theorem  is  a  statement  about  all  integers  n  >  2.  The  first  such 
integer,  no  =  2,  is  prime,  so  the  assertion  of  the  theorem  is  true.  Now  let  k  >  2 
and  suppose  that  the  assertion  is  true  for  all  positive  integers  i,  2  <  i  <  k;  in 
other  words,  suppose  that  every  integer  i  in  the  interval  2  <  l  <  k  is  either  prime 
or  the  product  of  primes.  We  must  prove  that  k  has  this  same  property.  If  k  is 
prime,  there  is  nothing  more  to  do.  On  the  other  hand,  if  k  is  not  prime,  then  k 
can  be  factored  k  =  ab ,  where  a  and  b  are  integers  satisfying  2  <  a,  b  <  k.  By 
the  induction  hypothesis,  each  of  a  and  b  is  either  prime  or  the  product  of  primes. 
Thus,  k  is  the  product  of  primes,  as  required.  By  the  Principle  of  Mathematical 
Induction,  we  conclude  that  every  n  >  2  is  prime  or  the  product  of  primes.  I 


Our  next  problem  demonstrates  another  common  error  in  “proofs”  by  math¬ 
ematical  induction. 


PROBLEM  9.  Canada  has  a  two-dollar  coin  known  colloquially  as  the  “toonie.” 
What  is  wrong  with  the  following  argument,  which  purports  to  prove  that  any  debt 
of  n  >  1  Canadian  dollars  can  be  repaid  (exactly)  with  only  toonies? 

Here  no  =  2.  We  begin  by  noting  that  any  two-dollar  debt  can  be 
repaid  with  a  single  toonie.  Thus,  the  assertion  is  true  for  n  =  2. 
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Now  let  k  >  2  and  suppose  that  the  assertion  is  true  for  all  l,  2  <  i  <  k. 

We  must  prove  that  the  assertion  is  true  for  n  =  k.  For  this,  we  apply 
the  induction  hypothesis  to  k  —  2  and  see  that  a  (k  —  2)-dollar  debt  can 
be  repaid  with  toonies.  Adding  one  more  toonie  allows  us  to  repay  k 
dollars  with  only  toonies,  as  required.  By  the  Principle  of  Mathematical 
Induction,  any  debt  of  n  >  1  dollars  can  be  repaid  with  toonies. 

Solution.  The  problem  here  is  that  the  latter  part  of  the  argument  does  not  work 
if  k  =  3. 

The  induction  hypothesis — that  the  assertion  is  true  for  all  i,  2  <  l  <  k — 
was  applied  to  t  —  k  —  2.  If  k  =  3,  however,  then  k  —  2=1  and  the  induction 
hypothesis  cannot  be  applied.  I 


We  conclude  with  a  brief  discussion  about  the  equivalence  of  the  two  Princi¬ 
ples  of  Mathematical  Induction  and  the  Well-Ordering  Principle. 

Mathematical  Induction  and  Well  Ordering 

Recall  that  the  Well-Ordering  Principle  (4.1.2)  says  that  any  nonempty  set  of 
natural  numbers  has  a  smallest  element.  This  can  be  proved  using  the  weak  form 
of  the  Principle  of  Mathematical  Induction.  Here  is  the  argument. 

A  set  containing  just  one  element  has  a  smallest  member,  the  element  itself, 
so  the  Well-Ordering  Principle  is  true  for  sets  of  size  no  =  1.  Now  suppose  it 
is  true  for  sets  of  size  k\  that  is,  assume  that  any  set  of  k  natural  numbers  has  a 
smallest  member.  Given  a  set  S  of  k  +  1  numbers,  remove  one  element  a.  The 
remaining  k  numbers  have  a  smallest  element,  say  b,  and  the  smaller  of  a  and  b 
is  the  smallest  element  of  5.  This  proves  that  any  finite  set  of  natural  numbers 
has  a  smallest  element.  We  leave  to  the  reader  (Exercise  17)  the  extension  of  this 
result  to  arbitrary  subsets  of  N. 

Conversely,  we  may  use  the  Well-Ordering  Principle  to  prove  the  Principle  of 
Mathematical  Induction  (weak  form).  For  suppose  that  V  is  a  statement  involving 
the  integer  n  which  we  wish  to  establish  for  all  integers  greater  than  or  equal  to 
some  given  integer  no.  Assume 

1.  V  is  true  for  n  =  no,  and 

2.  if  V  is  true  for  an  integer  k  >  no,  then  it  is  also  true  for  k  +  1 . 

How  does  the  Principle  of  Well-Ordering  show  that  V  is  true  for  all  n  >  no?  For 
convenience  we  assume  that  no  >  1.  (The  case  n o  <  0  can  be  handled  with  a 
slight  variation  of  the  argument  we  present.) 

If  V  is  not  true  for  all  n  >  no,  then  the  set  S  of  natural  numbers  n  >  no  for 
which  V  is  false  is  not  empty.  By  the  Well-Ordering  Principle,  .S’  has  a  smallest 
element  a.  Now  a  /  no  because  we  have  established  that  V  is  true  for  n  =  no- 
Thus,  a  >  no,  so  a  —  1  >  no-  Also,  a  —  1  <  a.  By  minimality  of  a,  V  is  true  for 
k  =  a  —  1.  By  assumption  2,  V  is  true  for  k+ 1  =  a,  a  contradiction.  We  are  forced 
to  conclude  that  our  starting  assumption  is  false:  V  must  be  true  for  all  n  >  no. 

The  preceding  paragraphs  show  that  the  Principles  of  Well-Ordering  and 
Mathematical  Induction  (weak  form)  are  equivalent.  With  minor  variations  in  the 
reasoning,  we  can  prove  that  the  Principles  of  Well  Ordering  and  Mathematical 


5.1  Mathematical  induction  159 


Answers  to  Pauses 


Induction  (strong  form)  are  equivalent.  It  follows,  therefore,  that  the  three  princi¬ 
ples  are  logically  equivalent. 

1.  Eti(2i'  -  1)  =  [2(D  -  1]  +  [2(2)  -  1]  +  [2(3)  -  1]  +  [2(4)  -  1]  =  1  +  3  + 
5  +  7=  16. 

2.  When  n  =  1,  31 2^*^  —  1  =  32  —  1  =  9  —  1  =  8  is  divisible  by  8.  Now  suppose 
that  k  >  1  and  the  statement  is  true  for  n  —  k\  that  is,  suppose  that  32k  —  1 
is  divisible  by  8.  Thus,  32k  —  1  =  8?  for  some  integer  t  and  so  32k  =  8/  +  1. 
We  have 

32»+l)  _  {  _  32k+2  _  j  _  9(32*)  _  j 


=  9(8r  +  1)  -  1  =  72r  +  9  -  1  =  72f  +  8  =  8(9 1  +  1). 

Thus,  32(*+1)  -  1  is  divisible  by  8,  as  required.  By  the  Principle  of  Mathe¬ 
matical  Induction,  32”  —  1  is  divisible  by  8  for  all  integers  n  >  1. 

3.  The  induction  hypothesis  was  that  k\  >  2k  for  some  particular  integer  k  >  4. 

4.  The  statement  n\  >  2"  is  not  true  for  n  <  4;  for  example,  3!  =  6,  whereas 
23 4  =  8. 


EXERCISES 


The  symbol  [ BB  /  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 


1.  Write  each  of  the  following  sums  without  using  E  and 
evaluate. 

(a)  [BB]  Ef=i  i2 

(b)  [BB]  Etl  2' 

(c)  [BB]  E,=i  sin  7r t 

(d)  l5=o  32+2 

(e)  Tl=-i(^2-k  +  l) 

(f)  Et-o(-l)‘ 

2.  List  the  elements  of  each  of  the  following  sets: 

(a)  {£7=o(-D'  I  «  =  0, 1,2,3} 

(b)  (E7=i  2‘  I  »  e  N,  1  <  n  <  5} 

3.  Prove  that  it  is  possible  to  fill  an  order  for  n  >32  pounds 
of  fish  given  bottomless  wheelbarrows  full  of  5-pound 
and  9-pound  fish. 

4.  Use  mathematical  induction  to  prove  the  truth  of  each 
of  the  following  assertions  for  all  n  >  1 . 

(a)  [BB]  n2  +  n  is  divisible  by  2. 

(b)  nr  +  2n  is  divisible  by  3. 

(c)  [BB]  n3  +  (n  +  l)3  +  (n  +  2)3  is  divisible  by  9. 

(d)  5"  —  1  is  divisible  by  4. 

(e)  8"  -  3"  is  divisible  by  5. 


(f)  52"  —  25"  is  divisible  by  7. 

(g)  10"+1  +  10"  +  1  is  divisible  by  3. 

(h)  an  —  bn  is  divisible  by  a  —  b  for  any  integers  a,  b 
with  a  —  b  /  0. 

5.  (a)  [BB]  Prove  by  mathematical  induction  that 


n{n  +  1) 
2 


for  any  natural  number  n. 

(b)  Prove  by  mathematical  induction  that 


l3  +  23  -I - bn3 


n2(n  +  l)2 

4 


for  any  natural  number  n. 

(c)  Use  the  results  of  (a)  and  (b)  to  establish  that 

(1  +  2  +  3  +  •  •  •  +  n)2  =  l3  +  23  +  •  •  •  +  n3 


for  all  n>  1. 

6.  Use  mathematical  induction  to  establish  the  truth  of  each 
of  the  following  statements  for  all  n  >  1 . 

(a)  [BB]  1  +  2  +  22  +  23  +  •  •  •  +  2”  =  2n+]  -  1 

(b)  [BB]  l2  -  22  +  32  -  42  +  •••  +  (-1  )"-V  = 


2 
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(c)  1 2  +  32  +  52  +  ■  •  •  +  (2n  —  1  )2 

(d)  1-2-3  +  2-  3-  4  +  3-  4-  5  +  - 
n(n  +  l)(n  +  2)0  +  3) 


n(2n  —  1)(2  n  +  1) 

'  3 

■+n(n  +  1  )0  +2)  = 


1  I  1 

(C)  ~i~2  +  2U3  +  3^4  +  ' 


n(n  +  1)  n  +  1 


7.  [BB;  (a)]  Rewrite  each  of  the  sums  in  Exercise  6  using 

notation. 

8.  Use  mathematical  induction  to  establish  each  of  the  fol¬ 
lowing  formulas. 


(a)  [BB]  J2(i  +  1)2'  =  n2" 


+  i 


n  O  +  1 ) 

f rf  (2 i  -  1)(2 i  +1)  =  2(2«  +  1) 


(b)  Y — ~ — 

2—i  m  _  i  w 


(c)  £(2/  -  1)(2»)  = 


;=i 


n  (n  +  1)(4  n  —  1) 
3  " 


9.  Use  mathematical  induction  to  establish  each  of  the  fol¬ 
lowing  inequalities. 

(a)  [BB]  2n  >  n2,  for  n  >  5. 

(b)  (1  +  >  I  +  §,  for  n  e  N. 

(c)  For  any  p  e  R,  p  >  —  1,  (1  +  p)n  >  1  +np  for  all 
n  e  N. 

(d)  For  any  integer  n  >  2,  4-  — L  +  _J_  +  •  •  •  +  ^  > 

12 

24' 

«  1 

(e)  Y.  ~~F  >  \/w  f°r  n  ^  2. 

,  =  l  V» 


10.  Suppose  c,  xi ,  X2,  ■  ■  ■  ,  xn,  y\,  y2, . .  ■  ,yn  are  2«  +  1 
given  numbers.  Prove  each  of  the  following  assertions 
by  mathematical  induction. 

n  n  n 

(a)  [BB]  Yix'  +  W)  =  Y,Xi  +  J2  >'i  for  n  >  1 . 

;=i  ;=i  i=i 

n  n 

(b)  Y_,  cx>  =  c  Y1  x‘  ^or  n  —  *  • 

/  =  !  /  =  1 

n 

(c)  ^(.V/  —  x,_i)  =  xn  —  xi  for  n  >  2. 

i=2 

11.  [BB]  Find  the  fault  in  the  following  “proof’  that  in  any 
group  of  n  people,  everybody  is  the  same  age. 

Suppose  n  =  I.  If  a  group  consists  of  just 
one  person,  everybody  is  the  same  age.  Sup¬ 
pose  that  in  any  group  of  k  people,  every¬ 
one  is  the  same  age.  Let  G  =  {a\ ,  ci2, . . .  , 


cik+ 1)  be  a  group  of  k  +  I  people.  Since 

each  of  the  groups  {a[,  ci2 . a^  \  and  (a?, 

«3, . . .  ,  a/c+ 1 }  consists  of  k  people,  every¬ 
body  in  each  group  has  the  same  age,  by 
the  induction  hypothesis.  Since  02  is  in  each 
group,  it  follows  that  all  k  +  1  people  a\, 
«2,  •  ■  •  ,  rft+i  have  the  same  age. 


12.  [BB]  Find  the  fault  in  the  following  “proof  by  mathe¬ 
matical  induction  that 


(2  n  +  l)2 
8 


for  all  natural  numbers  n. 

{2k  +  l)2 

If  1  +  2  +  3  -I - +  k  =  - ,  then 

8 

1  +  2  +  3-f---  +  (&+  1) 

=  (1  +  2  +  3  +  --  -  + &)  +  (&+  1) 
(2k  +  l)2 


+  (k  +  1) 
4k2  +  4k  +  1  +  8  k  +  8 


4  k2+  \2k  +  9 


(2k  +  3)2  [2(k  +  1)  +  l]2 


and  so  truth  for  k  implies  truth  for  k  + 


13.  What  is  wrong  with  the  following  “proof’  that  any  order 
for  n  >  10  pounds  of  fish  can  be  filled  with  only  5-pound 
fish? 


We  use  the  strong  form  of  mathematical 
induction.  Flere  n 0  =  10.  Since  an  order 
for  10  pounds  of  fish  can  be  filled  with 
two  5-pound  fish,  the  assertion  is  true  for 
n  =  10.  Now  let  A:  >  10  be  an  integer 
and  suppose  that  any  order  for  l  pounds  of 
fish,  10  S  f  <  k,  can  be  filled  with  only  5- 
pound  fish.  We  must  prove  that  an  order  for 
k  pounds  can  be  similarly  filled.  But  by  the 
induction  hypothesis,  we  can  fill  an  order 
for  k  —  5  pounds  of  fish,  so,  adding  one 
more  5-pounder,  we  can  fill  the  order  for  k 
pounds.  By  the  Principle  of  Mathematical 
Induction,  we  conclude  that  the  assertion  is 
true  for  all  n  >  1 0. 
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14.  One  of  several  differences  between  the  Canadian  and 
American  games  of  football  is  that  in  Canada,  a  team 
can  score  a  single  point  without  first  having  scored  a 
touchdown.  So  it  is  clear  that  any  score  is  possible  in 
the  Canadian  game.  Is  this  so  in  the  American  game? 
Indeed  this  seems  to  be  the  case,  even  assuming  (this  is 
not  true!)  that  in  the  United  States,  points  can  be  scored 
only  three  at  a  time  (with  a  field  goal)  or  seven  at  a  time 
(with  a  converted  touchdown).  Here  is  an  argument. 

Assume  that  k  points  can  be  achieved  with 
multiples  of  3  or  7.  Here's  how  to  reach 
k+ 1  points.  If  k  points  are  achieved  with  at 
least  two  field  goals,  subtracting  these  and 
adding  a  touchdown  gives  k  +  1  points.  On 
the  other  hand,  if  the  k  points  are  achieved 
with  at  least  two  touchdowns,  subtracting 
these  and  adding  five  field  goals  also  gives 
k  +  1  points. 

Does  this  argument  show  that  any  score  is  possible  in 
American  football?  Can  it  be  used  to  show  something 
about  the  nature  of  possible  scores? 

15.  [BB]  It  is  tempting  to  think  that  if  a  statement  involving 
the  natural  number  n  is  true  for  many  consecutive  values 
of  n,  it  must  be  true  for  all  n.  In  this  connection,  the 
following  example  due  to  Euler  is  illustrative. 

Let  f(n)  =  n2  +  n  +  41. 

(a)  Convince  yourself  (perhaps  with  a  computer  algebra 
package  like  Maple  or  Mathematical  that  fin)  is 
prime  for  n  =  1, 2,  3, . . .  39  but  that  /( 40)  is  not 
prime. 

(b)  Show  that  for  any  n  of  the  form  n  =  k 2  +  40,  f(n) 
is  not  prime. 

16.  [BB]  Prove  that  a  set  with  n  elements,  n  >  0,  contains 
2"  subsets. 

17.  [BB]  Suppose  that  any  nonempty  finite  set  of  natu¬ 
ral  numbers  has  a  smallest  element.  Prove  that  any 
nonempty  set  of  natural  numbers  has  a  smallest  ele¬ 
ment. 

18.  (a)  Prove  that  for  any  integer  n  >  1,  any  set  of  n  pos¬ 

itive  real  numbers  has  a  smallest  element. 

(b)  Prove  that  the  result  of  (a)  is  not  true  for  infinite 
sets  of  positive  real  numbers  in  general  but  that  it 
is  true  for  some  infinite  sets. 

(c)  What  is  the  name  of  the  principle  which  asserts  that 
any  nonempty  set  of  natural  numbers  has  a  smallest 
element? 


19.  [BB]  Let  n  >  1  and  let  A,  B\,  S2,  •  •  •  .  Bn  be  sets.  Gen¬ 
eralize  the  result  of  Exercise  18,  Section  2.2  by  prov¬ 
ing  that  A  U  (n"=i  Bi)  =  n"=i(rl  u  Bi )  for  all 
n  >  1. 

20.  Prove  that  A  D  (  U?=i  Bi )  =  [J''=1(A  G  fl,)  for  any  sets 
A,  B\,  B2,  ■  ■  ■  ,  Bn. 

21.  In  Section  2.2,  we  defined  the  symmetric  difference  of 
two  sets.  More  generally,  the  symmetric  difference  of 
n  >  3  sets  Aj, . . .  ,  A„  can  be  defined  inductively  as 
follows: 

A  i  ©  •  •  •  ©  A„  =  (A  i  ©••■©  A„_i)  ©  A„. 

Prove  that  for  any  n  >  2,  A  i  ©  A2  ©  •  •  ■  ©  A„  consists  of 
those  elements  in  an  odd  number  of  the  sets  Aj . A„ . 

22.  Prove  the  Chinese  Remainder  Theorem,  4.5.1,  by  math¬ 
ematical  induction. 

23.  [BB]  For  n  >  3,  the  greatest  common  divisor  of 
n  nonzero  integers  a\,  a2,  •••  .  an  can  be  defined 
inductively  by 

gcd(oi, ...  ,a„)  =  gcd(ai  ,  gcd(a2, ...  ,a„)). 

Prove  that  gcd (a\.a2 . an)  is  an  integral  linear 

combination  of  aj,  a2 . an  for  all  n  >  2;  that  is, 

prove  that  there  exist  integers  st,....s„  such  that 
gcd(«] . a„)  =  si«]  +  s2a2  - 1-  s„a„. 

24.  The  definition  of  the  greatest  common  divisor  of  n  >  3 
integers  given  in  Exercise  23  differs  from  that  given 
in  the  exercises  to  Section  4.2.  Suppose  a\, ...  ,an  are 

nonzero  integers.  Show  that  gcd(aj . a„),  as  defined 

in  Exercise  23,  satisfies  the  properties  given  in  Defini¬ 
tion  4.2.14. 

25.  Suppose  n  and  m\,m2,  . . .  ,  m,  are  natural  numbers, 
and  that  the  m ,■  are  pairwise  relatively  prime.  Suppose 
each  m,-  divides  n.  Prove  that  the  product  m\m2  ■  ■  ■  mt 
divides  n.  [Hint:  Induction  on  t  and  Exercise  10  of 
Section  4.2.] 

26.  Define  / :  Z  -►  Z  by 

n  —  2  n  >  1000 

fin)  = 

I /(/(«+ 4))  n  <  1000. 

(a)  Find  the  values  of  /(1000  —  n)  for  n  —  0,  1,2, 
3,4,5. 

(b)  Guess  a  formula  for  /( 1000  —  n)  valid  for  n  >  0 
and  prove  your  answer. 

(c)  Find  /( 5)  and  /( 20). 

(d)  What  is  the  range  of  /? 


162  Chapter  5  induction  and  Recursion 


5.1.5  DEFINITION 


A  set  A  of  integers  is  called  an  ideal  if  and  only  if 

(i)  0  e  A, 

(ii)  if  a  e  A,  then  also  —a  e  A ,  and 

(iii)  if  a ,  b  e  A ,  then  a  +  b  e  A. 

27.  For  any  integer  n  >  0,  recall  that  nZ  =  { kn  \  k  e  Z} 
denotes  the  set  of  multiples  of  n. 

(a)  Prove  that  nZ  is  an  ideal  of  the  integers. 

(b)  Let  A  be  any  ideal  of  Z.  Prove  that  A  =  nZ  for 
some  n  >  0  by  establishing  each  of  the  following 
statements. 

i.  If  A  contains  only  one  element,  then  A  is  of  the 
desired  form. 

Now  assume  that  A  contains  more  than  one  ele¬ 
ment. 

ii.  Show  that  A  contains  a  positive  number. 

iii.  Show  that  A  contains  a  smallest  positive  num¬ 
ber  n. 

iv.  nZ  C  A,  where  n  is  the  integer  found  in  iii. 

v.  A  c  nZ.  [Hint:  The  division  algorithm,  4.1.5.] 

28.  [BB]  Prove  that  for  every  integer  n  >  2,  the  num¬ 
ber  of  lines  obtained  by  joining  n  distinct  points  in 
the  plane,  no  three  of  which  are  collinear,  is 
\n(n  —  1). 

29.  An  n-sided  polygon  (commonly  shortened  to  «-gon)  is  a 
closed  planar  figure  bounded  by  n  straight  sides  no  two 
of  which  intersect  unless  they  are  adjacent,  in  which 
case  they  intersect  just  at  a  vertex.  Thus,  a  3-gon  is 
just  a  triangle,  a  4-gon  is  a  quadrilateral,  a  5-gon  is  a 
pentagon,  and  so  on.  An  n-gon  is  convex  if  the  line  join¬ 
ing  any  pair  of  nonadjacent  vertices  lies  entirely  within 
the  figure.  A  rectangle,  for  example,  is  convex.  Prove 
that  the  sum  of  the  interior  angles  of  a  convex  n-gon  is 
(n  -2)180°  for  all  n  >  3. 

30.  Suppose  a  rectangle  is  subdivided  into  regions  by  means 
of  straight  lines  each  extending  from  one  border  of  the 
rectangle  to  another.  Prove  that  the  regions  of  the  “map” 
so  obtained  can  be  colored  with  just  two  colors  in  such 
a  way  that  bordering  "countries”  have  different  colors. 

31.  (a)  [BB]  Given  an  equal  arm  balance  capable  of  deter¬ 

mining  only  the  relative  weights  of  two  quantities 
and  eight  coins,  all  of  equal  weight  except  possibly 
one  which  is  lighter,  explain  how  to  determine  if 
there  is  a  light  coin  and  how  to  identify  it  in  just 
two  weighings. 


(b)  Given  an  equal  arm  balance  as  in  (a)  and  3"  —  1 
coins,  n  >  1,  all  of  equal  weight  except  possibly 
one  which  is  lighter,  show  how  to  determine  if  there 
is  a  light  coin  and  how  to  identify  it  in  at  most  n 
weighings. 


32.  True  or  false?  In  each  case,  give  a  proof  or  provide  a 
counterexample  which  disproves  the  given  statement. 

(a)  [BB]  5"  +  n  +  1  is  divisible  by  7  for  all  n  >  1 

(b)  Yl"= o(k  +  1)  =  n(n  +  3)/2  for  all  n  >  1 


(c)  If  n  >  2,  gcd( 


(n  +  2)!  (n  +  3)L  (n  +  2) ! 
x  '  o  '  —  A 


(d)  (For  students  of  calculus)  n  5  >  2"  for  all  n  >  1 


33.  Let  n  be  any  integer  greater  than  1 .  Show  that  the  fol¬ 
lowing  argument  is  valid.  (See  Section  1.5.) 


P 1  P2 
P2  -*■  Pi 


Pn-l  -»  Pn 
P\  -+  Pn 

34.  (For  students  who  have  completed  a  course  in  differen¬ 
tial  calculus)  State  and  prove  (by  mathematical  induc¬ 
tion)  a  formula  for  j^x"  which  holds  for  all  n  >  1. 


5.1.6  NOTATION 


The  product  of  n  elements  a\ ,  a2,  . . .  ,  a„  is  denoted  n  ar. 

r=  1 


35.  Let  x  be  a  real  number,  x  ^  ±  1.  Prove  that 


]~J(1  +  x2  )  = 

r=  I 


2,,+ 1 


1  —  X 


2 


for  any  integer  n  >  1 . 

36.  [BB]  (For  students  of  calculus)  The  condition  x2  /  1 
is  necessary  in  Exercise  35  since  otherwise  we  would 
have  a  denominator  of  0  on  the  right.  However. 

1  -  jc2f+'  I  -  x2r+l 

lint  - % —  and  lim  - ■= — 

x—*  1  1  —  X1  x — ► —  1  1  —  X ~ 


both  exist.  Use  the  result  of  Exercise  35  to  find  these 
limits. 


37.  Find  an  expression  for  J~j 


2  r  -  1 


r= 2 


2  r 


valid  for  n  >  2  and 


prove  by  mathematical  induction  that  your  answer  is 
correct. 
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38.  For  any  n  >  0,  let  F„  =  22"  +  1.  (Those  numbers 
F„  which  are  prime  are  called  Fermat  primes.  See 
Section  4.3.) 

n- 1 

(a)  Prove  that  |~ |  F,  =  F„  —  2  for  all  n  >  1 . 

r= 0 

(b)  Prove  that  gcd(F,„ :  F„ )  =  1  for  any  positive 
integers  m  and  n  with  m  n. 

(c)  Use  the  result  of  (b)  to  give  another  proof  (different 
from  Euclid’s)  that  there  are  infinitely  many 
primes. 

39.  For  a  given  natural  number  n,  prove  that  the  set  of  all 
polynomials  of  degree  at  most  n  with  integer  coefficients 
is  countable.  [Flint:  Let  P„  denote  the  set  of  all  polyno¬ 
mials  of  degree  at  most  n  with  integer  coefficients.  The 
result  for  Pi  was  Exercise  25(b)  in  Section  3.3.] 

40.  (a)  Prove  that  the  strong  form  of  the  Principle  of  Math¬ 

ematical  Induction  implies  the  Well-Ordering  Prin¬ 
ciple. 

(b)  Prove  that  the  Well-Ordering  Principle  implies  the 
strong  form  of  the  Principle  of  Mathematical  Induc¬ 
tion.  (Assume  no  >  1.) 

41.  In  this  section,  we  have  studied  two  formulations  of  the 
Principle  of  Mathematical  Induction. 

(a)  Use  either  of  these  to  establish  the  following  (pecu¬ 
liar?)  third  formulation. 


Suppose  7(n)  is  a  statement  about  the 
natural  number  n  such  that 

1.  CP(  1 )  is  true; 

2.  For  any  k  >  1,  7{k)  true  implies 
(P(2 k)  true;  and, 

3.  For  any  k  >  2,  CP(Jt)  true  implies 
7(k-  1 )  true. 


(b)  Prove  that  for  any  two  nonnegative  numbers  x  and 

x  +  y  ^  . — 
u  — —  >  -Jxy- 

(c)  Use  the  Principle  of  Mathematical  Induction  in  the 

form  given  in  part  (a)  to  generalize  the  result  of  part 
(b),  thus  establishing  the  arithmetic  mean-geometric 
mean  inequality:  For  any  n  >  1  and  any  n  nonneg¬ 
ative  real  numbers  a\ ,  az . a„. 


42.  Let  m  and  n  be  relatively  prime  integers  each  greater 
than  1.  Assume  you  have  an  unlimited  supply  of  m-  and 
n-cent  stamps.  Using  only  these  stamps,  show  that 


(a)  it  is  not  possible  to  purchase  a  selection  of  stamps 
worth  precisely  mn  —  m  —  n  cents; 

(b)  for  any  r  >  mn  —m  —  n,  it  is  possible  to  purchase 
a  selection  of  stamps  worth  exactly  r  cents.  [Hint: 
There  exist  integers  a  and  b,  with  0  <  a  <  n  and 
0  <  b  <  m  such  that  bn  =  am  —  1  and  hence  such 
that  ( n  —  a)m  =  (m  —  b)n  —  1 .  See  Exercise  34  of 
Section  4.2.  Now  try  downward  induction!] 


5.2  Recursively  Defined  Sequences 

Suppose  n  is  a  natural  number.  How  should  we  define  2"?  We  could  write 

2”  =  2  •  2  -  2  ■  •  •  2 

i  y  J 

n  2’s 

or 

(4)  21  =2  and,  for  k  >  1,  2k+l  =  2  ■  2k. 

The  latter  statement  is  an  example  of  a  recursive  definition .  It  explicitly  defines 
2"  when  n  =  1  and  then,  assuming  2"  has  been  defined  for  n  —  k,  defines  it  for 
n  =  k  +  1.  By  the  Principle  of  Mathematical  Induction,  we  know  that  2"  has  been 
defined  for  all  integers  n  >  1. 

Another  expression  that  is  most  naturally  defined  recursively  is  n\,  which  was 
introduced  in  Section  5.1.  If  we  write 

0!  =  1  and,  for  k  >  0,  (k  +  1)!  =  (k  +  1  )k\. 

then  it  follows  by  the  Principle  of  Mathematical  Induction  that  n !  has  been  defined 
for  every  n  >  0. 

Sequences  of  numbers  are  often  defined  recursively.  A  sequence  is  a  func¬ 
tion  whose  domain  is  some  infinite  set  of  integers  (often  N)  and  whose  range 
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is  a  set  of  real  numbers.  Since  its  domain  is  countable,  we  can  and  usually  do 
describe  a  sequence  by  simply  listing  its  range.  The  sequence  which  is  the  func¬ 
tion  / :  N  — »  R  defined  by  f(n)  =  n2,  for  instance,  is  generally  described  by  the 
list  1,  4,  9,  16, ... ,  the  idea  being  to  write  down  enough  numbers  from  the  start 
of  the  list  that  the  rest  can  be  inferred.  The  numbers  in  the  list  (the  range  of  the 
function)  are  called  the  terms  of  the  sequence.  Sometimes  we  start  counting  at  0 
(if  the  function  has  domain  N U {0))  so  that  the  terms  are  denoted  ao,a\,a2 . 

The  sequence  2,  4,  8,  16, . . .  can  be  defined  recursively  like  this: 

(5)  a i=2  and,  for  k  >  1,  a^+x  =  2 a*. 

By  this,  we  understand  that  a \  =  2  and  then,  setting  k  =  1  in  the  second  part  of 
the  definition,  that  a2  =  2a\  =  2(2)  =  4.  With  k  =  2,  a2  =  2a2  =  2(4)  =  8;  with 
k  =  3,  a4  =  2a3  —  2(8)  =  16  and  so  on.  Evidently,  (5)  defines  the  sequence  we 
had  in  mind.  Again,  the  definition  is  recursive  because  each  term  in  the  sequence 
beyond  the  first  is  defined  in  terms  of  the  previous  term. 

The  equation  a^+x  =  2 in  (5),  which  defines  one  member  of  the  sequence 
in  terms  of  a  previous  one,  is  called  a  recurrence  relation.  The  equation  a\  =  2 
is  called  an  initial  condition. 

There  are  other  possible  recursive  definitions  which  describe  the  same 


sequence  as  (5).  For  example,  we  could  write 

<3o  =  2  and,  for  k  >  0, 

1  =  2  at. 

or  we  could  say 

a  i=2  and,  for  k  >  2, 

ak  =  2ak-\. 

[Verify  that  these  definitions  give  the  same  sequence,  2,  4.  8,  16 . ] 

Sometimes,  after  computing  a  few  terms  of  a  sequence  which  has  been  defined 
recursively,  we  can  guess  an  explicit  formula  for  a„.  In  (5),  for  instance,  an  =  2". 
We  say  that  an  =  2"  is  the  solution  to  the  recurrence  relation.  Our  goal  in  this 
section  and  the  next  is  to  gain  some  skill  at  solving  recurrence  relations. 


PROBLEM  10.  Write  down  the  first  six  terms  of  the  sequence  defined  by  ci\  =  1, 
a/c+ 1  =  3  a*  +  1  for  k  >  1.  Guess  a  formula  for  an  and  prove  that  your  formula  is 
correct. 

Solution.  The  first  six  terms  are 
a\  =  1 

a2  =  2>a\  +  1  =3(1)+  1  =4 
a3  =  3a2  +  1  =  3(4)  +1  =  13 
«4  =  40 
a5  =  121 
«6  =  364. 

Since  there  is  multiplication  by  3  at  each  step,  we  might  suspect  that  3"  is  involved 
in  the  answer.  After  trial  and  error,  we  guess  that  a„  =  J+3''  —  1)  and  verify  this 
by  mathematical  induction. 

When  n  =  1,  the  formula  gives  ^O1  —  1)  =  1,  which  is  indeed  a i,  the  first 
term  in  the  sequence. 
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Now  assume  that  k  >  1  and  that  ak  =  ^(3*  —  1).  We  wish  to  prove  that 
ak+ 1  =  5(3a+1  —  1).  We  have 

flft+i  =  3a*  +  l  =  3±(3*-l)  +  l 


using  the  induction  hypothesis.  Hence, 

ajt+i  =|3*+1-|  +  l  =  ^(3*+'-l) 

as  required.  By  the  Principle  of  Mathematical  Induction,  our  guess  is  correct.  I 


PROBLEM  11.  A  sequence  is  defined  recursively  by  ao  =  1 ,  a\  =  4  and 
a, i  =  4cin-\  —  4an-2  for  n  >  2.  Find  the  first  six  terms  of  this  sequence.  Guess  a 
formula  for  an  and  establish  the  validity  of  your  guess. 

Solution.  Here  there  are  two  initial  conditions — ao  =  1,  a\  =4.  Also,  the 
recurrence  relation,  an  =  4a„-i  —  4a„_2>  defines  the  general  term  as  a  function 
of  two  previous  terms.  The  first  six  terms  of  the  sequence  are 

a0  =  1 

a\  =  4 

a2  —  4  a\  —  4ao  =  4(4)  —  4(1)  =  12 
a3  =  4a2  -  4a,  =  4(12)  -  4(4)  =  32 
a4  =  4 03  -  4a2  =  4(32)  -  4(12)  =  80 

a5  =  4 a4  -  4 a}  =  4(80)  -  4(32)  =  192. 

Finding  a  general  formula  for  a„  requires  some  ingenuity.  Let  us  carefully  examine 
some  of  the  first  six  terms.  We  note  that  a2  =  32  =  24  +  8  =  3(8)  +  8  =  4(8)  and 
a4  =  80  =  64  +  16  =  4(16)  +  16  =  5(16)  and  05  =  192  =  6(32).  We  guess  that 
an  =  (n  +  1)2".  To  prove  this,  we  use  the  strong  form  of  mathematical  induction 
(with  no  =  0). 

When  n  =  0,  we  have  (0+1)2°  =  1(1)  =  1,  in  agreement  with  the  given 
value  for  ao.  When  n  =  1,  (1  +  1)2]  =  4  =  a\.  Now  that  the  formula  has  been 
verified  for  k  =  0  and  k  =  1,  we  may  assume  that  k  >  1  and  that  an  =  (n  +  1)2" 

for  all  n  in  the  interval  0  <  n  <  k.  We  wish  to  prove  the  formula  is  valid  for 

n  =  k\  that  is,  we  wish  to  prove  that  a^  =  (k  +  1)2*.  Since  k  >  2,  we  know 
that  a/c  =  4a* _  1  —  4a*_2.  Applying  the  induction  hypothesis  to  k  —  I  and  to 
k  —  2  (each  of  which  is  in  the  range  0  <  n  <  k),  we  have  a*_  1  =  k2k~x  and 
a*_ 2  =  (k  -  1)2*-2.  Thus, 

ak  =  4(/c2*~')  -  4 (k  -  l)2k~2  =  2k2k  -  k2k  +  2k  =  k2k  +  2k  =  (k  +  1)2* 

as  required.  By  the  Principle  of  Mathematical  Induction,  the  formula  is  valid  for 
all  n  >  0.  I 


In  effect,  our  method  of  verifying  the  formula  an  =  (n  +  1)2"  in  Problem  1 1 
amounts  simply  to  checking  that  it  satisfies  both  initial  conditions  and  also  the 
given  recurrence  relation.  We  prefer  the  more  formal  approach  of  mathematical 
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induction  since  it  emphasizes  this  important  concept  and  avoids  pitfalls  associated 
with  working  on  both  sides  of  an  equation  at  once. 

As  previously  mentioned,  there  is  nothing  unique  about  a  recursive  definition. 
The  sequence  in  the  last  example  can  also  be  defined  by 

«0  =  1,  a\  =4  and,  for  n  >  1,  an+ \  =  4a, ,  —  4a„_i. 

In  this  case,  we  again  obtain  an  =  (n  +  1)2".  We  could  also  say 

a\  =  1,  a2  =  4  and,  for  n  >  1,  a„+ 2  =  4a,,+i  —  4a„ 

but  then,  labeling  the  first  term  a\  instead  of  ao  would  give  a„  —  Other 

variants  are  also  possible. 

Some  Special  Sequences 

Suppose  you  have  $50  in  an  old  shoe  box  and  acquire  a  paper  route  which  nets 
you  $14  a  week.  Assuming  all  this  money  goes  into  your  shoe  box  on  a  weekly 
basis  (and  you  never  borrow  from  it),  after  your  first  week  delivering  papers, 
your  shoe  box  will  contain  $64;  after  two  weeks,  $78;  after  three  weeks,  $92 
and  so  on.  A  sequence  of  numbers  like  50,  64,  78,  92, ... ,  where  each  term  is 
determined  by  adding  the  same  fixed  number  to  the  previous  one,  is  called  an 
arithmetic  sequence.  The  fixed  number  is  called  the  common  difference  of  the 
sequence  (because  the  difference  of  successive  terms  is  constant  throughout  the 
sequence). 


EXAMPLES  12 


•  50,  64,  78,  92,  . . .  is  an  arithmetic  sequence  with  common  difference  14; 

•  -17,  -12,  -7,  —2,  3,  8, . . .  is  an  arithmetic  sequence  with  common  differ¬ 
ence  5; 

•  103,  99,  95,  91, . . .  is  an  arithmetic  sequence  with  common  difference  — 4.^ 


5.2.1  DEFINITION 


The  arithmetic  sequence  with  first  term  a  and  common  difference  d  is  the  sequence 
defined  by 


a\  =  a  and,  for  k  >  1,  a*+ 1  =  a*  +  d. 


The  general  arithmetic  sequence  thus  takes  the  form 
a,  a  +  d,  a  +  2d,  a  +  3 d,  . . . 


and  it  is  easy  to  see  that,  for  n  >  1,  the  nth  term  of  the  sequence  is 


(6) 


an  —  a  +  (n  —  l)d. 


We  leave  a  formal  proof  to  the  exercises  and  also  a  proof  of  the  fact  that  the  sum 
of  n  terms  of  the  arithmetic  sequence  with  first  term  a  and  common  difference 
d  is 

S  =  -  \2a  +  (n  -  1  )d] . 


(7) 


EXAMPLE  13 


EXAMPLES  14 


5.2.2  DEFINITION 
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The  first  100  terms  of  the  arithmetic  sequence  —17,  —12,  —7,  —2,  3,  . . .  have  the 
sum 

5  =  [2(  — 17)  +  99(5)]  =  50(— 34  +  495)  =  23,050. 

The  100th  term  of  this  sequence  is  a\o o  =  —17  +  99(5)  =  478  by  (6).  The  number 
2038  occurs  as  the  412th  term,  as  we  see  by  solving  — 17  +  (n  —  1)5  =  2038.  ▲ 

Many  people  with  paper  routes  deposit  their  earnings  in  a  bank  account  which 
pays  interest  instead  of  into  a  shoe  box,  which  does  not.  Fifty  dollars  in  a  bank 
account  which  pays  1%  interest  per  month  accumulates  to  50  +  (.01  x  50)  = 
50(1  +  .01)  =  50(1.01)  dollars  after  one  month.  After  another  month,  the  original 
investment  will  have  accumulated  to  what  it  was  at  the  start  of  the  month  plus 
1%  of  this  amount;  that  is, 

50(1.01)  +  .01  (50)  (1.01)  =  50(1.01)(1  +  .01)  =  50(1.01)2. 

After  three  months,  the  accumulation  is  50(1. 01)3  dollars;  after  twelve  months,  it 
is  50(1. 01)12  dollars  («s  $56.34).  A  sequence  of  numbers  such  as 

50,  50(1.01),  50(1 .01)2,  ... 

in  which  each  term  is  determined  by  multiplying  the  previous  term  by  a  fixed 
number  is  called  a  geometric  sequence.  The  fixed  number  is  called  the  common 
ratio. 

•  50,  50(1.01),  50(1. 01)2,  ...  is  a  geometric  sequence  with  common  ratio  1.01; 

•  3,  —6,  12,  —24, ...  is  a  geometric  sequence  with  common  ratio  —2; 

•  9,  3,  1,  j, . . .  is  a  geometric  sequence  with  common  ratio  ▲ 

The  geometric  sequence  with  first  term  a  and  common  ratio  r  is  the  sequence 
defined  by 

a\  =  a  and,  for  k  >  1,  a^+x  =  ra k. 


The  general  geometric  sequence  thus  has  the  form 

2  S  4 

a,  ar.  ar  ,  ar  ,  ar  , . . . 


the  nth  term  being  an  =  arn  1 .  This  is  straightforward  to  prove,  as  is  the  following 
formula  for  the  sum  S  of  n  terms,  provided  r  +  1 . 


(8) 


a( 1  —  rn) 
1  -r 


The  sum  of  29  terms  of  the  geometric  sequence  with  a  =  8 12  and  r  =  —l  is 


_  o  1 2 


5  =  8 


i-(-+) 


=  2 


36  =  2^±2l  ^  i(237  +  28). 


EXAMPLE  15 
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What  is  the  30th  term  of  the  geometric  sequence  just  described? 

Leonardo  Fibonacci,1  also  known  as  Leonardo  of  Pisa,  was  one  of  the  bright¬ 
est  mathematicians  of  the  Middle  Ages.  His  writings  in  arithmetic  and  algebra 
were  standard  authorities  for  centuries  and  are  largely  responsible  for  the  intro¬ 
duction  into  Europe  of  the  Arabic  numerals  0,  1,  •  •  •  ,  9  we  use  today.  Fibonacci 
was  fond  of  problems,  his  most  famous  of  which  is  concerned  with  rabbits! 

Suppose  that  newborn  rabbits  start  producing  offspring  by  the  end  of  their  sec¬ 
ond  month  of  life  and  that  after  this  point,  they  produce  a  pair  a  month  (one  male, 
one  female).  Assuming  just  one  pair  of  rabbits  initially,  how  many  pairs  of  rab¬ 
bits,  Fibonacci  asked,  will  be  alive  after  one  year?  The  sequence  which  gives  the 
number  of  pairs  at  the  end  of  successive  months  is  the  famous  Fibonacci  sequence. 

After  one  month,  there  is  still  only  one  pair  of  rabbits  in  existence,  but  after  a 
further  month,  this  pair  is  joined  by  its  offspring;  thus,  after  two  months,  there  are 
two  pairs  of  rabbits.  At  the  end  of  any  month,  the  number  of  pairs  of  rabbits  is  the 
number  alive  at  the  end  of  the  previous  month  plus  the  number  of  pairs  alive  two 
months  ago,  since  each  pair  alive  two  months  ago  produced  one  pair  of  offspring.  We 
obtain  the  sequence  1 ,  1 , 2,  3,  5,  8,  13,...,  which  is  defined  recursively  as  follows. 

f\  =  L  /2  =  1  and,  for  k  >  2,  fk+\  =  /*.  +  fk- 1 


5.2.3  THE  FIBONACCI 
SEQUENCE 


Think  of  the  Fibonacci  sequence  as  a  function  fib:  N  ->  N.  List  eight  elements 
of  this  function  as  ordered  pairs.  Is  fib  a  one-to-one  function?  Is  it  onto? 

What  is  the  answer  to  Fibonacci's  question? 


Although  we  have  found  an  explicit  formula  for  the  nth  term  of  most  of  the 
sequences  discussed  so  far,  there  are  many  sequences  for  which  such  a  formula  is 
difficult  or  impossible  to  obtain.  (This  is  one  reason  why  recursive  definitions  are 
important.)  Is  there  a  specific  formula  for  the  nth  term  of  the  Fibonacci  sequence? 
As  a  matter  of  fact,  there  is,  though  it  is  certainly  not  one  which  many  people 
would  discover  by  themselves.  We  show  in  the  next  section  (see  Problem  20)  that 
the  nth  term  of  the  Fibonacci  sequence  is  the  closest  integer  to  the  number 


(9) 


1  / 1  +  n/5\ 


The  first  few  values  of  ~^(  l+-/7  are  approximately  0.72361,  1.17082,  1.89443, 
3.06525,  4.95967,  8.02492,  12.98460,  and  21.00952;  the  integers  closest  to  these 
numbers  are  the  first  eight  terms  of  the  Fibonacci  sequence. 

Finally,  note  that  we  must  be  careful  with  sequences  apparently  defined  recur¬ 
sively,  since  some  recursive  definitions  do  not  define  actual  sequences!  Consider, 
for  example. 


a\  =  1  and,  for  k  >  1,  ak 


1  +  a*/2  if  k  is  even 
1  +  as^-i  if  k  is  odd. 


1  Fibonacci  was  born  around  the  year  1 1 80  and  died  in  1 228.  “Fibonacci"  is  a  contraction  of  “Filius 
Bonaccii,"  Latin  for  “son  of  Bonaccio.” 
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What  happens  if  we  try  to  write  down  the  first  few  terms  of  this  sequence? 
a\  =  1 

222  =  1  +  221  =  1  +  1=  2 

<23  =  1+^8  =  !  +  (!  +  224)  =  2  +  a4  =  2  +  (l  +  02)  —  3  +  222  —  5 
<24  =  1  +  £22  —  1  +  2  —  3, 

but  then 


225  —  1  +  22]4  —  1  +  (1  +  227)  —  2  +  227 

=  2  +  (1  +  2220)  =  3  +  2220  =  3  +  (l  +  £2io) 
=  4  +  22]0  =  4  +  (1  +  225)  =  5  +  225 


Answers  to  Pauses 


and,  to  our  dismay,  we  have  reached  the  absurdity  5  =  0.  Obviously,  no  sequence 
has  been  defined. 


Compute  the  first  six  terms  of  the  sequence  “defined”  as  follows: 

22i  =  1  and,  for  k  >  1,  a*  =  I  I  ! 

*  1  1  +  223/fe+l  1 

5.  Here  22  =  30,  22  =  812  =  (23)12  =  236  and  r  =  -(2-1),  so 


if  k  is  even 
if  k  is  odd. 


2230  =  236[-(2“')]zy  =  — 2J02~zy  =  —2'  =  -128. 


2-1  m29 


36->-29 


6.  The  most  obvious  eight  pairs  are  (1,  1),  (2,  1),  (3,  2),  (4,  3),  (5,  5),  (6,  8), 
(7,  13),  and  (8,  21).  The  Fibonacci  function  is  not  one-to-one  because  (1,  1)  e 
fib  and  (2,  1)  €  fib  but  1  ^  2.  It’s  not  onto  since,  for  example,  4  is  not  in 
the  range. 

7.  The  number  of  rabbits  in  existence  after  twelve  months  is  the  thirteenth  term 
of  the  Fibonacci  sequence,  233. 

8.  22|  =  1 ;  222  =  1  +  22]  =  1  +  1  =  2;  223  =  1  +  22  ]0  =  1  +  1  +  225  =  2  +  1  +  2216  = 
3  +  22|6-  Now  2216  =  1  +  228  =  1  +  1  +  224  =  1  +  1  +  1  +  222  =  3  +  2  =  5,  SO 
that  223  =  3  +  5  =  8.  Continuing,  224  =  1  +222  =  3;  225  =  1  +2216  =  1  +  5  =  6; 
226  =  1  +  223  =  1+8  =  9.  The  first  six  terms  are  1 , 2,  8,  3.  6,  9. 


EXERCISES 


The  symbol  [BB]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 


1.  Give  recursive  definitions  of  each  of  the  following 
sequences: 

(a)  [BB]  1,5,52,53,54, ... 

(b)  5,3,  1,-1, -3,... 

(c)  4,  1,3,  -2,5,  -7,  12,  -19,  31, ... 

(d)  1,2,0,  3,  -1,4,  -2, ... 

2.  (a)  [BB]  Find  the  first  seven  terms  of  the  sequence  {«„} 

defined  by 


22 1  =16,  and  for  k  >  1, 


222+1 


1  if  ak  =  I 

\ak  if  ak  is  even 

j(22/t  —  1)  if  22a-  /  1  is  odd. 


(b)  Repeat  part  (a)  with  <21  =  17. 

(c)  Repeat  part  (a)  with  <21  =  18. 

(d)  Repeat  part  (a)  with  a  \  =  100. 
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3.  Let  A],  02,03, . . .  be  the  sequence  defined  by  a\  =  1, 
cik+i  =  ~iak  for  k  >  I.  Prove  that  a„  =  3'1_l  for  all 
rt  >  1. 

4.  [BB]  Suppose  a\,  02,  <73, . . .  is  a  sequence  of  integers 
such  that  a]  =  0  and,  for  n  >  1,  a„  =  n?  +an-\.  Prove 
that  an  =  ("-1>(«+2h„2+»+2)  for  every  integer  n  >  1. 

5.  Define  the  sequence  «i,  «2>  <23, . . .  by  ai  =  0,  02  =  \ 
and  ak+2  =  \(ak  +  «/t+i )  for  k  >  1.  Find  the  first  seven 
terms  of  this  sequence.  Prove  that  an  =  ^(l  —  ( —  5)”—I) 
for  every  n  >  1 . 

6.  [BB]  Let  a \ ,  02,  a-}, . . .  be  the  sequence  defined  by  a\  = 
1  and,  for  n  >  1,  </„  =  2a, ,  \  +  1.  Write  down  the  first 
six  terms  of  this  sequence.  Guess  a  formula  for  a„  and 
prove  that  your  guess  is  correct. 

7.  Let  a\,  <22,  <23, . . .  be  the  sequence  defined  by  a\  =  ^ 
and  a„  —  5 a„_i  —  1  for  n  >  2.  Write  down  the  first 
six  terms  of  this  sequence.  Guess  a  formula  for  an  and 
prove  that  your  guess  is  correct. 

8.  Suppose  ao,  a\ ,  fl2.  •  •  •  is  a  sequence  such  that  ao  = 
a\  =  1  and  for  n  >  1,  a„+ 1  =  n(a„  +  a„- 1). 

(a)  Find  az ,  <23,  «4  and  05. 

(b)  Guess  a  formula  for  an,  valid  for  n  >  0,  and  use 
mathematical  induction  to  prove  that  your  guess  is 
correct. 

9.  [BB]  Consider  the  sequence  defined  by  a\  =  I,  an+\  = 
(n  +  l)2  —  a„  for  n  >  1.  Find  the  first  six  terms.  Guess 
a  general  formula  for  an  and  prove  that  your  answer  is 
correct. 

10.  Let  a\,  a2,  03, . . .  be  the  sequence  defined  by  eq  =  1, 
a/c+\  =  (k  +  \)dk  for  k  >  1.  Find  a  formula  for  a„  and 
prove  that  your  formula  is  correct. 

11.  [BB]  Suppose  a  1,02,03, ...  is  a  sequence  of  integers 
such  that  <21  =  0,  az  =  1  and  for  n  >  2,  a„  =  4a„_2- 
Guess  a  formula  for  a„  and  prove  that  your  guess  is 
correct. 

12.  A  sequence  is  defined  recursively  by  ao  =  2,  a\  =3 
and  a„  =  3a„_i  —  2a„-2  for  n  >  2. 

(a)  Find  the  first  five  terms  of  this  sequence. 

(b)  Guess  a  formula  for  an . 

(c)  Verify  that  your  guess  in  (b)  is  correct. 

(d)  Find  a  formula  for  an  which  involves  only  one  pre¬ 
ceding  term. 

13.  Let  a \.  az.  «3, ...  be  the  sequence  defined  by  a  1  =  1, 
C12  =  0  and  for  n  >  2,  a„  =  4a„_i  —  4fl„_2.  Prove  that 
a„  =  2"(1  -  f)  for  all  n  >  1. 


14.  Let  a\,az,as, . . .  be  the  sequence  defined  by 

a i  =  l,  and  for  k  >  1,  at+\  =  k2ak. 

Find  the  first  six  terms  of  this  sequence.  Guess  a  general 
formula  for  an  and  prove  your  answer  by  mathematical 
induction. 

15.  Consider  the  arithmetic  sequence  with  first  term  2  and 
common  difference  3. 

(a)  [BB]  Find  the  first  ten  terms  and  the  123rd  term  of 
this  sequence. 

(b)  [BB]  Does  752  belong  to  this  sequence?  If  so,  what 
is  the  number  of  the  term  where  it  appears? 

(c)  Repeat  (b)  for  1023  and  4127. 

(d)  [BB]  Find  the  sum  of  the  first  75  terms  of  this 
sequence. 

16.  [BB]  Consider  the  arithmetic  sequence  with  first  term  7 
and  common  difference  —5. 

(a)  Find  the  17th  and  92nd  terms. 

(b)  Find  the  sum  of  the  first  38  terms. 

17.  An  arithmetic  sequence  begins  1 16,  109,  102. 

(a)  Find  the  300th  term  of  this  sequence. 

(b)  Determine  whether  or  not  —480  belongs  to  this 
sequence.  If  it  does,  what  is  its  term  number? 

(c)  Find  the  sum  of  the  first  300  terms  of  the  sequence. 

18.  The  arithmetic  sequence  4,  15,  26,  37, . . .  begins  with  a 
perfect  square,  that  is,  an  integer  of  the  form  k2,  where 
k  is  also  an  integer.  Find  the  next  three  perfect  squares 
in  this  sequence. 

19.  Establish  formulas  (6)  [BB]  and  (7)  for  the  nth  term  and 
the  sum  of  the  first  n  terms  of  the  arithmetic  sequence 
with  first  term  a  and  common  difference  d. 

20.  [BB]  Consider  the  geometric  sequence  with  first  term 
59,049  and  common  ratio  — 

(a)  Find  the  first  ten  terms  and  the  33rd  term  of  this 
sequence. 

(b)  Find  the  sum  of  the  first  12  terms. 

21.  Consider  the  geometric  sequence  which  begins 
-3072,  1536,  -768. 

(a)  Find  the  13th  and  20th  terms  of  this  sequence. 

(b)  Find  the  sum  of  the  first  nine  terms. 

22.  If  the  first  term  of  a  geometric  sequence  is  48  and  the 
sixth  term  is  —  find  the  sum  of  the  first  ten  terms. 

23.  (a)  [BB]  Find,  to  four  decimal  places,  the  129th 

term  of  the  geometric  sequence  which  begins 
-0.00001240,  0.00001364. 

(b)  [BB]  Find  the  approximate  sum  of  the  first  129 
terms  of  the  sequence  in  (a). 
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24.  [BB]  Verify  formula  (8)  for  the  sum  of  n  terms  of  a 
geometric  sequence  with  first  term  a  and  common  ratio 
r#  1. 

25.  Consider  the  sequence  defined  recursively  by  a\  =  1 
and  for  n  >  l,  a„  =  Yl'i=i  °i •  Write  down  the  first  six 
terms  of  this  sequence,  guess  a  formula  for  an  valid  for 
n  >  2,  and  prove  your  answer. 

26.  (a)  Find  the  sum  of  1 8  terms  of  the  geometric  sequence 

with  first  term  7/1024  and  common  ratio  8. 

(b)  [BB]  Suppose  |r|  <  1.  Explain  why  the  sum  of  the 
first  n  terms  of  the  geometric  sequence  with  first 
term  a  and  common  ratio  r  is  approximately 

too  ^ 

(c)  [BB]  Approximate  ^  -y. 

*=o  2 

(d)  Find  the  approximate  sum  of  the  first  1  million  terms 
of  the  geometric  sequence  which  begins  144,  48,  16. 

27.  (a)  Find  the  19th  and  100th  terms  of  the  geometric 

sequence  which  has  first  term  98,415  and  common 
ratio  j. 

(b)  Find  the  sum  of  the  first  15  terms  of  the  sequence 
in  (a). 

(c)  Find  the  approximate  sum  of  the  first  10,000  terms 
of  the  sequence  in  (a). 

28.  Given  that  each  sum  below  is  the  sum  of  part  of  an 
arithmetic  or  geometric  sequence,  find  each  sum. 

(a)  [BB]  75  +  71  +  67  +  63  +  ■  •  ■  +  (-61) 

(b)  75+15  +  3+  !  +  --  '  +  ^7- 

(c)  —52  —  41  —  30  —  19  + - (-  949 

(d)  '-3  +  3-  5  + - F  jHt 

(e)  2  +  6+ 18  +  54  + •••+’354,294 

29.  [BB]  Is  it  possible  for  an  arithmetic  sequence  to  be  also 
a  geometric  sequence?  Explain  your  answer. 

30.  A  bank  account  pays  interest  at  the  rate  of  1001%  a  year. 
Assume  an  initial  balance  of  P,  which  accumulates  to 
sn  after  n  years. 

(a)  Find  a  recursive  definition  for  sn. 

(b)  Find  a  formula  for  s„. 

31.  [BB]  Maurice  borrows  $1000  at  an  interest  rate  of  15%, 
compounded  annually. 

(a)  Flow  much  does  Maurice  owe  after  two  years? 

(b)  In  how  many  years  will  the  debt  grow  to  $2000? 

32.  On  January  1,  2001,  you  have  $50  in  a  savings  account, 
which  pays  interest  at  the  rate  of  1%  per  month.  At  the 
end  of  January  and  at  the  end  of  each  month  there¬ 
after,  you  deposit  $56  to  this  account.  Assuming  no 


withdrawals,  what  will  be  your  balance  on  January  1, 

2002? 

33.  On  June  1,  you  win  $1  million  in  a  lottery  and  imme¬ 
diately  acquire  numerous  “friends,”  one  of  whom  offers 
you  the  deal  of  a  lifetime.  In  return  for  the  million,  she’ll 
pay  you  a  cent  today,  two  cents  tomorrow,  four  cents 
the  next  day,  eight  cents  the  next,  and  so  on,  stopping 
with  the  last  payment  on  June  21. 


(a)  Assuming  you  take  this  deal,  how  much  money  will 
you  receive  on  June  21? 

(b)  Should  you  take  the  deal?  Explain. 

(c)  Would  you  take  the  deal  if  payments  continued  for 
the  entire  month  of  June? 

34.  Define  a  sequence  [a„ }  recursively  as  follows: 

ao  =  0,  and  for  n  >  0,  an  =  a|_n/5j  +  a|>/5j  +  n. 

Prove  that  a„  <  20 n  for  all  n  >  0.  (Recall  that 
Lx]  denotes  the  floor  of  the  real  number  x.  See  para¬ 
graph  3.1.6.) 

I  +  \/5 

35.  The  number  d>  =  -  %  1.618  is  known  as  the 

2 

golden  mean.  It  has  many  remarkable  properties.  For 
instance,  the  geometric  sequence  1 , 4>,  <t>2, . . .  satisfies 
the  Fibonacci  recurrence  relation  an+ 1  =  an  +  «„_]. 
Establish  this  fact. 

36.  [BB]  Suppose  we  think  of  the  Fibonacci  sequence  as 

going  backward  as  well  as  forward.  What  seven  terms 
precede  1 ,  1 . 2,  3,  5,  8, . . .  ?  How  is  related 

to  /„? 

37.  [BB]  Let  {/„}  denote  the  Fibonacci  sequence.  Prove  that 
fn+ifn  =  E"=  1  f}  for  all  n  >  1. 

38.  Represent  the  Fibonacci  sequence  by  /[  =  fa  =  1, 
f„  =  /„_  1  +  fn-2  for  n  >  2. 

(a)  Verify  the  formula  /1+/2  +  /34 - Vf„  =  fn+ 2  - ' 

for  n  =  4,  5,  6. 

(b)  Prove  that  the  formula  in  (a)  is  valid  for  all  n  >  1. 

39.  Show  that,  for  n  >  2,  the  nth  term  of  the  Fibonacci 
sequence  is  less  than  (7/4)n~*.  [Use  the  definition  of 
the  Fibonacci  sequence,  not  the  approximation  to  /„ 
given  in  equation  (9).] 

40.  [BB]  What  is  wrong  with  the  following  argument,  which 
purports  to  prove  that  all  the  Fibonacci  numbers  after  the 
first  two  are  even? 


Let  /„  denote  the  nth  term  of  the  Fibonacci 
sequence.  We  prove  that  /„  is  even  for  all 
n  >  3  using  the  strong  form  of  the  Principle 
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of  Mathematical  Induction.  The  Fibonacci 
sequence  begins  1 ,  1,2.  Certainly  fo  =  2  is 
even  and  so  the  assertion  is  true  for  no  =  3. 

Now  let  k  >  3  be  an  integer  and  assume  that 
the  assertion  is  true  for  all  n,  3  <  n  <  k\ 
that  is,  assume  that  /„  is  even  for  all  n  < 
k.  We  wish  to  show  that  the  assertion  is 
true  for  n  =  k;  we  wish  to  show  that  fk  is 
even.  But  fk  =  fk  ,  +  fk  2.  Applying  the 
induction  hypothesis  to  k  —  1  and  to  k  —  2, 
we  conclude  that  each  of  fk~\  and  fk-2  is 
even,  hence,  so  is  the  sum.  By  the  Principle 
of  Mathematical  Induction,  /„  is  even  for 
all  n  >  3. 

41.  Let  /i  =  f2  =  1,  fk  =  fk- 1  +  fk- 2  for  k  >  2  be  the 
Fibonacci  sequence.  Which  terms  of  this  sequence  are 
even?  Prove  your  answer. 

42.  For  n  >  1 ,  let  a„  denote  the  number  of  ways  to  express  n 
as  the  sum  of  natural  numbers,  taking  order  into  account. 
For  example,  3  =  3  =  14-1  +  1=  2+1  =  1+2,  so 
a3  =  4. 

(a)  [BB]  Find  the  first  five  terms  of  the  sequence  {a,,}. 

(b)  Guess  and  then  establish  a  formula  for  an. 

43.  For  n  >  1 ,  let  bn  denote  the  number  of  ways  to  express 
n  as  the  sum  of  l’s  and  2’s,  taking  order  into  account. 
Thus,  b4  =  5  because  4  =  l  +  l  +  l  +  l  =  2  +  2  = 
2+1  +  1  =  1  +  1+  2=  1+2+1. 

(a)  Find  the  first  five  terms  of  the  sequence  { bn }. 

(b)  Find  a  recursive  definition  for  b„  and  identify  this 
sequence. 

44.  (a)  [BB]  Let  an  be  the  number  of  ways  of  form¬ 

ing  a  line  of  n  people  distinguished  only  by  sex. 
For  example,  there  are  four  possible  lines  of  two 
people — MM,  MF,  FM,  FF — so  ai  —  4.  Find  a 
recurrence  relation  satisfied  by  a„  and  identify  the 
sequence  a\ ,  ai,  . . .  . 

(b)  Let  an  be  the  number  of  ways  in  which  a  line  of  n 
people  can  be  formed  such  that  no  two  males  are 
standing  beside  each  other.  For  example,  ak  =  5 
because  there  are  five  ways  to  form  lines  of  three 
people  with  no  two  males  beside  each  other;  namely, 
FFF,  MFF,  FMF,  FFM,  MFM.  Find  a  recurrence 
relation  satisfied  by  a„  and  identify  the  sequence 

a i ,  02 . «3 . 


45.  Define  the  Fibonacci  sequence  by  f\  =  /2  =  1,  /n+i  = 
fn  +  fn—i  for  n  >  2. 

(a)  Prove  that  gcd(/„+i,  f„)  =  1  for  all  n  >  1. 

(b)  Prove  that  f„  =  fn-m+ifm  +  fn-mfm- 1  for  any 
positive  integers  n  and  m  with  n  >  m  >  1. 

(c)  Prove  that  for  any  positive  integers  n  and  m,  the 
greatest  common  divisor  of  /„  and  /„,  is  /gcd(n,m)- 

46.  Suppose  u„  and  v„  are  sequences  defined  recursively  by 

«i  =  0,  tq  =  1,  and,  for  n  >  1, 

Un+ 1  =  2  if  0/  +  Ufl),  Vn+ 1  =  4(^11  +  3l>„). 


(a)  Prove  that  v„  —  u„  =  for  n  >  1 . 

(b)  Prove  that  un  is  an  increasing  sequence;  that  is, 
un+ 1  >  un  for  all  n  >  1. 

(c)  Prove  that  vn  is  a  decreasing  sequence;  that  is, 
uH+i  <  v„  for  all  n  >  1. 

(d)  Prove  that  un  =  |  —  g  •  for  all  n  >  1 . 

(This  problem  is  taken  from  a  Portuguese  examination 
designed  to  test  the  level  of  mathematical  knowledge 
of  graduating  high  school  students.  It  was  reprinted  in 
Focus ,  the  newsletter  of  the  Mathematical  Association 
of  America  13,  no.  3,  June  1993,  p.  13.) 


5.2.4  DEFINITION 


The  powers  of  a  function  f:  A  —>■  A  are  defined  recur¬ 
sively  by 


/'  =/ 

47.  Suppose  /;  N 

f(m)  = 


and,  for  n  >  1 ,  /" 

-»■  N  is  defined  by 


if  m  =  0 


4m—  1 


4m  +  l 


if  m  = 


if  m  =  2 


/  o  r 


(mod  3) 
(mod  3) 
(mod  3). 


(a)  Prove  that  /  is  one-to-one  and  onto. 

(b)  [BB]  Find  the  first  ten  terms  of  the  sequence  /"( 1). 

(c)  Find  the  sequence  /"  (2),  n  >  1. 

(d)  Find  the  sequence  /"( 4),  n  >  1. 

(e)  Find  the  first  ten  terms  of  the  sequence  /"( 8), 


2It  is  unknown  whether  or  not  the  terms  of  this  sequence  ever  repeat. 
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48.  Define  g:  N  ->  N  by  g(m),  g2(m),  g3(m), ...  eventually  terminates  with  1. 

Verify  this  assertion  for  each  of  the  five  integers  m  = 
341,  96,  104,  336,  and  133.3 

49.  (For  students  who  have  had  a  course  in  linear  algebra) 
This  function  is  known  as  the  “3 m  +  1”  function.  It  is  Give  a  recursive  definition  of  the  determinant  of  an  n  x  n 

suspected  that  for  any  starting  number  m,  the  sequence  matrix,  for  n  >  1. 

5.3  Solving  Recurrence  Relations;  The  Characteristic  Polynomial 

Recursively  defined  sequences  were  introduced  in  the  previous  section.  Given  a 
particular  recurrence  relation  and  certain  initial  conditions,  you  were  encouraged 
to  guess  a  formula  for  the  nth  term  and  prove  that  a  guess  was  correct.  Guessing 
is  an  important  tool  in  mathematics  and  a  skill  which  can  be  sharpened  through 
practice,  but  we  now  confess  that  there  is  a  definite  procedure  for  solving  most 
of  the  recurrence  relations  we  have  encountered  so  far. 

In  this  section,  we  describe  a  procedure  for  solving  recurrence  relations  of 
the  form 

(10)  a„  =  ra„-\  +  sa„- 2  +  fin ) 

where  r  and  5  are  constants  and  f(n)  is  some  function  of  n.  Such  a  recurrence 
relation  is  called  a  second  order  linear  recurrence  relation  with  constant  coeffi¬ 
cients.  If  fin )  =  0,  the  relation  is  called  homogeneous.  Second  order  refers  to  the 
fact  that  the  recurrence  relation  (10)  defines  an  as  a  function  of  the  two  terms  pre¬ 
ceding  it,  linear  to  the  fact  that  the  terms  of  the  sequence  appear  by  themselves, 
to  the  first  power,  and  with  just  constant  coefficients.  You  should  consult  more 
specialized  books  in  combinatorics  for  a  general  treatment  of  constant  coefficient 
recurrence  relations  where  an  is  a  function  of  any  number  of  terms  of  the  form 
ca„-j,  c  e  R.4 

EXAMPLES  16  Here  are  some  second  order  linear  recurrence  relations  with  constant  coefficients. 

•  an  =  a„_i  +a„_2,  the  recurrence  relation  which  appears  in  the  definition  of  the 
Fibonacci  sequence.  This  is  homogeneous  with  r  =  s  =  1 .  Notice  that  we  have 
modified  slightly  the  definition  an+\  =  an  +an- 1  given  in  paragraph  5.2.3  so 
that  it  is  readily  seen  to  be  of  the  type  we  are  considering  here. 

•  an  =  5a„_i  —  6an-2  +  n.  Here  r  =  5,  s  =  —6,  fin)  =  n. 

•  a„  =  3a„_|.  This  is  homogeneous  with  r  =  3,  s  =  0.  A 

EXAMPLES  17  Consider  the  following  two  recurrence  relations. 

•  an  =  5  an—\  3nn_3 

•  (Xfy  —  Q.yi—\Qfi — 2  d-  tt 


g(m)  = 


2 

3m+l 


if  m  is  even 
if  m  is  odd. 


3While  this  conjecture  has  been  established  for  all  integers  m  <  240  ~  1013,  it  is  unknown  whether  it 
holds  for  all  integers!  This  problem  has  attracted  the  interest  of  many  people,  some  of  whom  have  offered  a 
sizeable  monetary  reward  for  its  solution!  We  refer  the  interested  reader  to  the  excellent  article,  “The  3x  +  1 
Problem  and  Its  Generalizations,"  by  Jeffrey  C.  Lagarias,  American  Mathematical  Monthly  92  ( 1 985 )  no.  I , 
1-23. 

4See,  for  example,  Alan  Tucker,  Applied  Combinatorics  Wiley,  New  York:  (1980). 
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5.3.1  THEOREM 


Neither  is  of  interest  to  us  in  this  section.  The  first  is  not  second  order  while  the 
second  is  not  linear.  A 


With  the  homogeneous  recurrence  relation  an  =  ran-\  +  san  2,  which  can 
be  rewritten  in  the  form 

an  ~  ran-\  -  san-2  =  0, 
we  associate  the  quadratic  polynomial 

x2  —  rx  —  s, 

which  is  called  the  characteristic  polynomial  of  the  recurrence  relation.  Its  roots 
are  called  the  characteristic  roots  of  the  recurrence  relation.  For  example,  the 
recurrence  relation  an  =  5a„_i  —  6a„_2  has  characteristic  polynomial  x2  —  5x  +  6 
and  characteristic  roots  2  and  3. 

The  following  theorem,  whose  proof  is  left  to  the  exercises,  shows  how  to 
solve  any  second  order  linear  homogeneous  recurrence  relation  with  constant 
coefficients. 


Let  x\  and  x2  be  the  roots  of  the  polynomial  x2  —  rx  —  s.  Then  the  solution  of 
the  recurrence  relation  a„  =  ran- \  +  san~ 2,  n  >  2,  is 


Ic\x1  +  c2x'i  if  X]  4-  x-) 

c\x  +  c2nx  if  xi  =  x2  =  x. 

In  each  case,  c\  and  c2  are  constants  determined  by  initial  conditions. 


PROBLEM  18.  Solve  the  recurrence  relation  a„  =  5an-\  —6an-2,  n  >  2,  given 
ay  =  1,  a\  =  4. 

Solution.  The  characteristic  polynomial,  x1  —  5x  -\-  6,  has  distinct  roots  x\  = 
2,  *2  =  3.  Theorem  5.3.1  tells  us  that  the  solution  is  a„  =  ci(2")  +  c2(3n). 
Since  ao  —  1,  we  must  have  ci(2°)  +  C2(3°)  =  1  and,  since  a\  =  4,  we  have 
c  1  (2 1 )  +  C2(3!)  =  4.  Therefore, 

c\  T  c2  =1 
2ci  +  3c2  =  4. 

Solving,  we  have  c\  —  —  1,  C2  =  2,  so  the  solution  is  an  =  —2"  +  2(3”).  (You 
are  encouraged  to  verify  that  this  formula  is  correct.)  I 


PROBLEM  19.  Solve  the  recurrence  an  —  4a„_i  —  4a„_2,  n  >  2,  with  initial 
conditions  ao  =  1,  a\  =4.  (We  solved  this  by  guesswork  and  ingenuity  in 
Problem  11.) 

Solution.  The  characteristic  polynomial,  x2  —  4x+4,  has  the  repeated  root  x  =  2. 
Hence,  the  solution  is  an  =  cj(2”)  +  C2/i(2").  The  initial  conditions  yield  c\  =  1, 
2ci  +  2c2  =  4,  so  C2  =  1.  Thus,  an  =  2"  +  n{ 2")  =  (n  +  1)2".  I 


PROBLEM  20.  Find  a  formula  for  the  nth  term  of  the  Fibonacci  sequence. 
Solution.  To  simplify  the  algebra  which  follows,  we  take  for  initial  conditions 
ao  =  a  1  =  1  rather  than  a\  =  a2  =  1.  Hence,  the  recurrence  relation  to  be  solved 
is  an  =  a„  1  +  an-2,  n  >  2,  and  we  must  remember  that  the  nth  term  will  be 
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an  .  i .  The  characteristic  polynomial,  x2  —  x  -  1,  has  distinct  roots  1±2V^.  Hence, 
the  solution  to  our  recurrence  relation  is 

=  Cl  +^{^2^)  • 

The  initial  conditions  give 

Cl  +  C2  =  1 


ci 


(HV3)+C2(J^/5)  =  1 


yielding  ci  =  and  c2  =  —  Thus,  the  solution  i 


/z  — f—  2 


The  nth  term  of  the  Fibonacci  sequence  is 

In  Exercise  18,  we  ask  you  to  use  this  result  to  obtain  the  simpler  formula  (9)  on 
p.  168.  I 


Why  is  the  real  number 


an  integer  for  any  n  > 


1? 


We  now  turn  our  attention  to  the  general  second  order  recurrence  relation 
an  =  ran^\  +  2  +  fin)  and  show  that  the  solution  is  closely  related  to  the 

corresponding  homogeneous  recurrence  relation. 

Suppose  we  could  find  one  specific  solution  pn  to  the  given  recurrence  rela¬ 
tion.  (Such  a  function  pn  is  called  a  particular  solution.)  Thus,  pn  —  rpn- 1  + 
spn—2  +  fin).  Suppose  tn  were  another  solution.  Then  we  would  also  have 
tn  =  rt„-\  +  stn-\  +  fin)  and,  subtracting, 


5.3.2  THEOREM 


tn  -  Pn  =  rit„- 1  -  P„- 1)  +  s(tn-2  -  p„- 2). 

This  equation  shows  that  t„  -  p„  satisfies  the  homogeneous  recurrence  relation 
a„  =  ra„-\  +  sa„- 2.  Setting  tn  —  pn  =  qn,  we  have  tn  =  pn  +  qn  where  pn  is  a 
particular  solution  to  the  given  recurrence  relation  and  qn  satisfies  the  associated 
homogeneous  recurrence  relation.  This  is  the  content  of  our  next  theorem. 

Let  pn  be  any  particular  solution  to  the  recurrence  relation  an  =  ra„-\  -\-san- 2  + 
fin)  ignoring  initial  conditions.  Let  qn  be  the  solution  to  the  homogeneous 
recurrence  an  —  ran~\  +  san- 2  given  by  Theorem  5.3.1,  again  ignoring  ini¬ 
tial  conditions.  Then  pn  +  q„  is  the  solution  to  the  recurrence  relation  an  = 
ran- 1  +  sa„  2  +  fin).  The  initial  conditions  determine  the  constants  in  qn. 


As  we  have  seen,  the  result  of  this  theorem  seems  plausible.  In  the  exercises, 
you  will  be  invited  to  supply  a  proof.  The  main  point  of  the  theorem  is  that  once 
some  particular  solution  to  the  recurrence  has  been  found,  the  problem  is  reduced 
to  the  homogeneous  case,  which  we  have  already  considered.  Finding  a  particular 
solution  can  be  difficult.  As  we  shall  see,  a  useful  trick  is  to  try  a  formula  for  pn 
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which  is  of  the  same  type  as  /(«).  For  example,  if  f(n)  is  a  linear  function,  try 
a  linear  function  for  pn. 


PROBLEM  21.  Solve  the  recurrence  relation  a„  =  —3an-\  +n,  n  >  1,  where 
«0  =  1  • 

Solution.  Since  f(n )  =  n  is  linear,  we  try  a  linear  function  for  pn;  that  is,  we 
set  pn  =  a  +  bn  and  attempt  to  determine  a  and  b.  Putting  this  expression  for  p„ 
in  the  given  recurrence  relation,  we  obtain 

a  +  bn  =  —  3[a  +  b(n  —  1)]  +  n  =  —3  a  +  3b  +  (1  —  3  b)n. 

This  equation  will  hold  if  a  =  —3 a  +  3 b  and  b  =  l  —  3b,  which  is  the  same  as 
a  =  jz,  b  =  We  conclude  that  pn  =  +  |h  is  a  particular  solution  to  the 

recurrence,  ignoring  initial  conditions. 

The  corresponding  homogeneous  recurrence  relation  in  this  case  is  an  = 
— 3a„_i,  whose  characteristic  polynomial  is  x2  +  3x.  The  characteristic  roots  are 
—3  and  0,  so  the  solution  to  the  homogeneous  recurrence  relation  is 

t7„=c,(-3)"  +  C2(0")  =  c,(-3)n. 

Thus, 

Pn  +  q„  =  fs  +  \n  +  c\ (—3)”. 

Since  ao  =  1,  +  ^(0)  +  C|  (—3)°  =  1;  that  is,  +  ct  =  1.  Thus,  c\  =  ||  and 

the  solution  is  a„  =  +  \n  +  ||(— 3)'!.  I 


PROBLEM  22.  Solve  a„  =  2«„._i  +3a,?_2  +  5",  n  >  2,  given  ao  =  -2,  a\  =  1. 
Solution.  In  looking  for  a  particular  solution,  it  seems  reasonable  to  try  p„  = 
a( 5").  Substituting  into  the  recurrence  relation,  we  get 

a(  5")  =  2a(5n-')  +  3a(5n~2)  +  5n 
25 a  =  10 a  4-  3c/  +  25 
12a  =  25, 

so  a  —  y?  and  pn  =  ^(5")  is  a  particular  solution. 

Next  we  solve  the  homogeneous  recurrence  relation  a„  —  2a„_i  +3a„_2-  The 
characteristic  polynomial,  x2  —  2x  —  3,  has  distinct  roots  —  1  and  3,  so  the  solution 
is  qn  =  C|(—  1)"  +£’2(3”).  By  Theorem  5.3.2.  the  given  recurrence  relation  has 
the  solution 

Pn  +  qn  =  f§(5")  +  c\  (—1)”  +  c2(3”). 

The  initial  conditions  give 

ao  =  —2  =  +  C]  +  C2 

ai  =  1  =  y|(5)  -  C]  +  3c2, 

hence,  c\  =  — C2  =  —  Our  solution  is  a„  =  j|(5”)  -  jj(— 1)”  - 

f(3”).  “  ■ 


Answers  to  Pauses 


9.  All  the  terms  of  the  Fibonacci  sequence  are  integers! 
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EXERCISES 


Back  of  the  Book. 


The  symbol  [BB]  means  that  an  answer  can  be  found  in  the 

1.  [BB]  Solve  the  recurrence  relation  an  =  an  \  +  6an-2, 
n  >  2,  given  «o  =  1,  fli  =3. 

2.  Solve  the  recurrence  relation  an  =  —6a„  ,  +  7a„_2, 
n  >  2,  given  ao  =  32,  a\  =  —17. 

3.  [BB]  Solve  the  recurrence  relation  an  =  6an  \  —  9a„_2, 
n  >  2,  given  ao  =  —5,  aj  =  3. 

4.  Solve  the  recurrence  relation  a„+ 1  =  lan  —  10a„_i, 
n  >  2,  given  ai  =  10,  «2  =  29. 

5.  [BB]  Solve  the  recurrence  relation  a„  =  — 8a„_i  — 2. 
77  >  2,  given  ao  =  0,  <21  =  1. 

6.  Solve  the  recurrence  relation  a„  =  —  5a„_i  +  6tf„_2> 
7j  >  2,  given  ao  =  5,  a\  =  19. 

7.  [BB]  Solve  the  recurrence  relation  an+\  =  2 a„  +  3a„„i, 
n  >  1,  given  uq  =  0,  ai  =  8. 

8.  Solve  the  recurrence  relation  a„  =  2a„_i  —a„- 2,  n  >  2, 
given  «o  =  40,  ai  =  37. 

9.  Solve  the  recurrence  relation  9a„  =  6a„_i  —  a„_ 2, 7(  > 
2,  given  «o  =  3,  fl|  =  —  1. 

10.  (a)  [BB]  Solve  the  recurrence  relation  an  =  —  2«„.  1  + 

15a„_2,  77  >  2,  given  ao  =  1,  «i  =  —1. 

(b)  [BB]  Solve  the  recurrence  relation  a„  =  —2an-  \  + 
15a, ,_2  +  24,  77  >  2,  given  ao  =  1,  a\  —  —  1. 

11.  (a)  Solve  the  recurrence  relation  a„+\  =  — 8a„  — 

16a, ,_i,  77  >  1,  given  ao  =  5,  a\  =  17. 

(b)  Solve  the  recurrence  relation  a„+ 1  =  —  8a, ,  — 
16a„_i  +5,  77  >  1,  given  ao  =  2,  a\  =  —  1. 

12.  Solve  the  recurrence  relation  a„  =  4a„  [  —  4a„  2  +  77, 
77  >  2,  given  ao  =  5,  a\  =  9. 

13.  (a)  [BB]  Solve  the  recurrence  relation  a„  =  4a„  -1, 77  > 

1 ,  given  ao  =  1 . 

(b)  [BB]  Solve  the  recurrence  relation  a„  =  4a„_i  +8", 
77  >  1 ,  given  ao  =  1 . 

(c)  [BB]  Verify  that  your  answer  to  (b)  is  correct. 

14.  (a)  Solve  the  recurrence  relation  a„  =  5a„_i  —  6a„_2, 

77  >  2,  given  ao  =  2,  a\  =  11. 

(b)  Solve  the  recurrence  relation  a„  =  5a„_i  —  6a„_2  + 
377,  77  >  2,  given  ao  =  2,  a\  =  14. 

(c)  Verify  that  your  answer  to  (b)  is  correct. 

15.  (a)  Solve  the  recurrence  relation  a„  =  — 6a„_i  —  9a„_2, 

77  >  2,  given  ao  =  1,  a\  =  —4. 


(b)  Solve  the  recurrence  relation  a„  =  — 6a„_i  — 
9an-2+7J2+377, 77  >  2,  given  ao  =  ai  = 

(c)  Verify  that  your  answer  to  (b)  is  correct. 

16.  (a)  [BB]  Solve  the  recurrence  relation  a„  =  4a„_i  —9, 

77  >  1,  given  ao  =  4. 

(b)  [BB]  Solve  the  recurrence  relation  a„  =  4a„_i  + 
3n2".  77  >  1,  given  ao  =  4. 

17.  Solve  the  recurrence  relation  a„  —  5a„_]  — 2a„_2  +  3772, 
77  >  2,  given  ao  =  0,  a\  =3. 

18.  [BB]  Using  the  result  of  Problem  20,  show  that  the  77th 
term  of  the  Fibonacci  sequence  is  the  integer  closest  to 
-^(  l+y^ )”.  This  result  was  stated  without  proof  in  the 

text.  [Hint:  Show  that  1 ( 1  ) |  <  5.] 

19.  Let  a„  denote  the  number  of  77-digit  numbers,  each  of 
whose  digits  is  1,2,  3,  or  4  and  in  which  the  number  of 
1  ’s  is  even. 

(a)  Find  a  recurrence  relation  for  a„. 

(b)  Find  an  explicit  formula  for  a„. 

20.  The  Towers  of  Hanoi  is  a  popular  puzzle.  It  consists  of 
three  pegs  and  a  number  of  discs  of  differing  diameters, 
each  with  a  hole  in  the  center.  The  discs  initially  sit  on 
one  of  the  pegs  in  order  of  decreasing  diameter  (smallest 
at  top,  largest  at  bottom,  as  in  Fig  5.5),  thus  forming  a 
triangular  tower.  The  object  is  to  move  the  tower  to  one 
of  the  other  pegs  by  transferring  the  discs  to  any  peg 
one  at  a  time  in  such  a  way  that  no  disc  is  ever  placed 
upon  a  smaller  one. 


Figure  5.5  The  Towers  of  Hanoi. 

(a)  Solve  the  puzzle  when  there  are  n  =  2  discs  and 
show  your  moves  by  completing  a  little  table  like 
that  below.  [The  pegs  are  labeled  A ,  B,  C,  and  we 
have  used  an  asterisk  (*)  to  denote  an  empty  peg. 
The  disks  are  numbered  in  order  of  increasing  size, 
thus  disk  1  is  the  smallest.] 
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A 

B 

C 

Initial  position 

1,2 

* 

* 

Move  1 

?? 

?? 

?? 

Move  2 

etc. 

?? 

?? 

?? 

Also  solve  the  puzzle,  with  a  similar  table,  when 
n  —  3.  How  many  moves  are  required  in  each  case? 

(b)  Give  a  recurrence  relation  for  an,  the  number  of 
moves  required  to  transfer  n  discs  from  one  peg  to 
another. 

(c)  Find  an  explicit  formula  for  an . 

(d)  Suppose  we  can  move  a  disc  a  second.  Estimate  the 
time  required  to  transfer  the  discs  if  n  —  8,  n  =  16, 
n  =  32,  and  n  =  64. 

21.  Suppose  we  modify  the  traditional  rules  for  the  Tow¬ 
ers  of  Hanoi  as  described  in  the  preceding  question  by 
requiring  that  one  move  discs  only  to  an  adjacent  peg. 
Answer  all  four  parts  of  the  previous  question  for  this 
new  version  of  the  puzzle. 

22.  [BB]  Let  an  =  rcin~\  +sa„_ 2,  n  >  2,  be  a  second  order 
homogeneous  recurrence  relation  with  constant  coeffi¬ 
cients. 

(a)  If  jc  is  a  root  of  the  characteristic  polynomial  and 
c  is  any  constant,  show  that  a„  =  cxn  satisfies  the 
given  recurrence  relation  for  n  >  2. 


(b)  If  pn  and  q„  both  satisfy  the  given  recurrence  for 
n  >  2,  show  that  a„  =  pn  +  qn  also  satisfies  the 
recurrence  for  n  >2. 

(c)  Using  (a)  and  (b),  prove  Theorem  5.3.1  for  the 
case  where  the  characteristic  polynomial  has  distinct 
roots. 

23.  Let  an  =  ran_\  +  san-i,  n  >  2,  be  a  second  order 
homogeneous  recurrence  relation  with  constant  coeffi¬ 
cients  and  assume  that  its  characteristic  polynomial  has 
just  one  (repeated)  nonzero  root. 

(a)  [BB]  If  x  is  the  characteristic  root,  show  that  r  =2x 
and  s  =  —x2. 

(b)  Conclude  from  (a)  that  if  c  is  any  constant,  then 
a„  =  cnxn  is  a  solution  of  the  recurrence  relation. 

(c)  Use  (b)  and  the  previous  exercise  to  prove  Theo¬ 
rem  5.3.1  for  this  case. 

24.  Let  a„  =  ran_\  +  sa„  2  +  fin),  n  >  2,  be  a  second 
order  recurrence  relation  with  constant  coefficients. 

(a)  Show  that  if  p„  satisfies  this  recurrence  relation 
for  n  >  2  and  qn  satisfies  the  associated  homo¬ 
geneous  recurrence  relation  a„  =  ra„-\  +san- 2  for 
n  >  2,  then  p„  +  q„  satisfies  the  given  relation  for 
n  >  2. 

(b)  Complete  the  proof  of  Theorem  5.3.2. 


5.4  Solving  Recurrence  Relations;  Generating  Functions 

The  brief  introduction  to  generating  functions  which  we  give  in  this  section  belies 
the  importance  of  this  concept  in  combinatorial  mathematics.  Our  purpose  is  just 
to  give  some  indication  as  to  how  generating  functions  can  be  used  to  solve 
recurrence  relations. 

Roughly  speaking,  a  generating  function  is  a  polynomial  that  “goes  on  for¬ 
ever,’’  that  is,  an  expression  of  the  form 

f{x)  =  ao  +  a\x  +  CI2X2  +  a^x^  +  •  ■  •  +  anxn  +  •  •  •  . 

Unlike  the  usual  polynomial,  in  which  the  coefficients  a,  are  all  zero  after  a 
certain  point,  a  generating  function  usually  has  infinitely  many  nonzero  terms. 
There  is  an  obvious  correspondence  between  generating  functions  and  sequences 
a 0,  a\ ,  a.2, . . . ;  namely, 

2  ^ 

ao  +  a\x  +  a2X  +  a 3X  +  •  •  •  < — >  ao,  a\ ,  02,  aj, . . .  . 


5.4.1  DEFINITION 


The  generating  function  of  a  sequence  ao,  a\,  02,  ■  ■  ■  is  the  expression  f(x)  = 
ao  +  ai.r  +  a2X2  +  •  ■  ■ . 
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The  generating  function  of  the  sequence  1,  2,  3, . . .  of  natural  numbers  is  fix)  = 
1  +  2x  +  3x2  +  •  •  • ,  while  the  generating  function  of  the  arithmetic  sequence 
1,  4,  7,  10, . . .  is  f(x)  =  1  +  4x  +  lx 2  4-  IOjc3  +  •  •  • .  A 

Two  generating  functions  can  be  added  and  multiplied  term  by  term  just  like 

polynomials.  If  fix)  =  ao  +  a\x  +  aix2  +  •  •  •  and  g(x)  —  bo4-bix-\-b2x2  4 - , 

then 

fix)  +  g(x)  =  (ao  +  b0)  +  (a  i  +  b\)x  +  {a2  +  b2)x2  4 - 

f(x)g(x)  =  (aobo)  +  (fliho  +  aob\)x  +  iaob2  +  a\b\  +  a2bo)x 2  4 - . 

Note  that  while  generating  functions  have  infinitely  many  terms,  the  definitions  of 
addition  and  multiplication  involve  no  infinite  sums;  for  example,  the  coefficient 
of  xn  in  the  product  / (x)g(x)  is  the  finite  sum 

r*0 bn  +  a\bn-\  +  o2bn-2  +  •  •  •  +  anbo  ■ 


PROBLEM  24.  If 

fix)  =  1  +  X  +  X2  +  •  ■  •  +  xn  +  •  •  • 
and 

g(x)  =  1  -x  +x2  -  X3  H - f  {-\)nxn  H - , 

find  fix)  +  g(*)  and  fix)gix). 

Solution,  fix)  +  g(x )  =  ( 1  4-  x  4-  x2  +  ■■■+  xn  - ) 

+  (1  -*  +JC2  -  Jt3  +  •••  +  (-1)"*"  +  •  ••) 

=  (l  +  l)  +  (l-l)x  +  (l  +  l)x2 

+  ■  ■  ■  +  i\  +  i-\)n)Xn  +  ■  ■  ■ 

=  2  4-  2x2  +  2.r4  +  •  •  ■ 

fix)gix)  =  (1  4- X  4-  X2  4 - hr"  4 - ) 

•(I  —  x  4-  x2  —  x2  4-  ■  ■  ■  4-  (—l)nxn  4-  •  ■  ■) 

=  1  +[1(-1)+  l(l)]x  +  [l(l)  +  1(-1)  +  1(1)]jc2  +  •  •  - 
=  1  +  x2  -j-  x^  +  x ^  -f-  ■  ■  ■  I 


Students  who  have  studied  calculus  for  more  than  one  year  should  notice 
the  obvious  similarity  between  generating  functions  and  power  series  and  will  be 
comfortable  with  the  fact  that  generating  functions  often  can  be  expressed  as  the 
quotient  of  polynomials.  An  important  example  is 

(11)  - - =  1  +  x  +  x2  +  x2  +  •  •  ■  , 

1  —  X 


which  shows  that 


is  the  generating  function  of  the  sequence  1,  1,  1 . 
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Suppose  a  is  a  real  number.  Show  that 
certain  geometric  sequence. 


1 

1  —  ax 


is  the  generating  function  for  a 


In  combinatorics,  the  fundamental  distinction  between  power  series  and  gen¬ 
erating  functions  is  that,  whereas  power  series  in  calculus  are  functions  R  — >  R 
with  radii  of  convergence,  generating  functions  are  purely  formal  objects  which 
will  never  be  “evaluated”  at  a  specific  real  number  x.  Thus,  we  do  not  worry 
about  what  the  infinite  sum  ao  +  a\x  +  aix2  +  ■  ■  ■  means.  Whereas  it  is  a  topic  of 
interest  in  calculus  to  prove  formula  (11),  for  instance,  and  to  discover  for  which 
real  numbers  this  formula  is  valid,  for  us  the  proof  is  a  routine  application  of  the 
definition  of  multiplication  of  generating  functions. 


(1—  x)(l  x  x~  x"  +  •••) 

=  1  +  [1(1)  —  l(l)]x  +  [l(l)-  l(l)]x2  +  -..  +  [l(l)-  1(1)]jc"  H - 

=  1  +  Ox  +  Ox2  H - 1-  Ox"  H - 

=  1 

Another  very  useful  formula  for  us  is 

(12)  *  =  1  +  2x  +  3x2  +  4  x3  +  •••  +  («  +  l)x"  +  •  •  •  , 

(1  -  x)1 

which  says  that  - »  is  the  generating  function  of  the  sequence  of  natural 

(1  -x)1 

numbers. 

Prove  formula  (12). 

Suppose  f(x)  is  the  generating  function  of  the  sequence  0,  1. 2,  3, . . .  ;  that 
is,  /(x)  =  0  +  lx  +  2x2  +  3x3  +  4x4  +  •  •  • .  Then 

f  (x)  —  x  ~T  2x“  -f  3x3  T  4x4  T  •  •  • 

=  x(l  +  2x  +  3x2  +  4x3  H - )  =  x— — - — -x 

(1  -  xy 

X 

by  (12),  so  /(x)  takes  the  simpler  form  f(x)  = - 

(1  -xY 

We  now  present  a  few  examples  which  show  how  generating  functions  can 
be  used  to  solve  recurrence  relations. 


PROBLEM  25.  Solve  the  recurrence  relation  an  —  3a„_i,  n  >  1,  given  ao  =  1. 

Solution.  Consider  the  generating  function  /(x)  =  ao  +  a\x  +  a2X2  +  •••-)- 
a„x"  +  ■  ■  ■  of  the  sequence  ao,  a\,  ai,  ■  ■  ■  ■  Multiplying  by  3x  and  writing  the 
product  3 x/(x)  below  /(x)  so  that  terms  involving  x"  match,  we  obtain 

fix)  —  ao  +  a\x  +  aix-  +  +  anxn  +  ••• 

3  x/(x)  =  3flo-*  +  3aix2  +  •  ■  •  +  3<7,;_ix''  +  ■  •  •  . 
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Subtracting  gives 
fix)  -  3  xfix) 

=  ao  +  (a\  —  3ao)x  +  (02  -  3ai).v2  H - F  (a„  -  3a„_i).v'!  H - . 

Since  ao  =  1,  tzi  =  3ao  and,  in  general,  a„  =  3an-\,  this  says  that  (1  —  3x)  fix)  = 

1.  Thus,  fix)  =  - - —  and,  using  (11), 

1  —  3x 

fix)  —  1  -F  3.v  +  (3x)‘  T-  •  •  •  T  (3x)n  +  •  •  • 

=  1  +  3x  +  9x2  + - F  3nxn  +  ■■■  . 

We  conclude  that  an,  which  is  the  coefficient  of  xn  in  fix),  must  equal  3".  So 
we  have  an  =  3”  as  the  solution  to  our  recurrence  relation.  I 


PROBLEM  26.  Solve  the  recurrence  relation  a„  —  2a,;_i  —  an- 2,  n  >  2,  given 
ao  =  3,  a\  =  —2. 

Solution.  Letting  fix)  be  the  generating  function  of  the  sequence  in  question, 
we  have 

fix)=ao+  a\x  +  a2X2  +  ■■■  +  anxn  + 

2  xfix)  =  2aox  +  2a\x2  +  ■  ■  ■  +  2an-\xn  +  •  •  • 

x2fix)  =  oqx 2  +  •  •  •  +  an-2x”  +  •  •  •  . 

Therefore, 

fix)  -  2  xfix)  +  x2  fix)  =  ao  +  ia\  -  2ao)-*  +  O2  -  2fl|  +ao)x 2  H - 

+  —  2o„_i  +  an-2)xn  +  •  ■  • 

=  3  -  8jc 

since  ao  =  3,  «i  =  —2  and  an  —  2a„_i  +  a„_ 2  =  0  for  a  >  2. 

So  (1  —  2x  +  x2) f (x)  —  3  —  8a%  (1  —  x)2 fix)  =  3  —  8jc  and 

fix)  = - x  (3  —  8.r) 

J  il-x)2 

=  (1  +  2jc  +  3jc2  +  •  •  •  +  in  +  \)xn  H - )(3  -  8.r)  by  (12) 

=  3  -  2x  -  lx2  -  12jc3  +  •  •  •  +  [3(n  +  1)  -  8 n]xn  +  ••• 

=  3  -  2x  -  lx2  -  12x3  +  •  •  •  +  (— 5n  +  3)xn  +  •  •  • 

and  an  =  3  —  5n  is  the  desired  solution.  [Verify  that  this  recurrence  can  also  be 
solved  by  Theorem  5.3.1  of  Section  5.3.]  I 


PROBLEM  27.  Solve  the  recurrence  an  =  —  3a„..  1  +  10a„_2,  n  >  2,  given 
ao  =  1,  a\  =4. 
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Solution.  Letting  fix)  be  the  generating  function  of  the  sequence  in  question, 
we  have 

fix)  =  ao  +  a\x  +  aix2  +  •  •  •  +  anxn  +  •  •  • 

3xf(x)=  3aox  +  3a\x2  +  •••  +  3an-\xn  +  ••• 

\Qx2  fix)  =  \OaQX2  +  ■  •  •  +  \0a„-2Xn  +  •  •  •  . 

Therefore, 

fix)  +  3xf(x)  —  10x2  f{x)  =  ao  +  (a  i  +  3ao)x  +  («2  +  3a\  —  10ao)x2 

+  •  •  •  +  ian  +  3a„-i  —  I0an-2)x"  +  •  •  • 


=  1  +  lx 

since  ao  =  L  a\  =  4  and  a„  +  3a„_i  —  10a„_2  =  0  for  n  >  2.  So 


and 


f(x)  = 


(l  +  3x  -  10 xz)f{x)  =\  +  lx 
1  T  lx  1  lx 


1+ 3a: -10a:2  (1 +5jc)(1 -2x)  ' 

At  this  point,  it  is  useful  to  recall  the  method  of  partial  fractions.  We  set 
1  A  B  AH -2x)  +  Bil +5x) 

(1  +5x)(l  —  2x)  ~  1  +5*  +  ~2x  ~  (1  +  5jc)(1  —  2jc)  ‘ 

Equating  numerators,  1  =  (A  + B)  +  (—2A+5B)x,  so  A  +  B  =  1,  -2A  +  5B  =  0. 
Solving  for  A  and  B,  we  get  A  =  B  =  Therefore, 


1 


and  so  fix)  = 


(1  +  5x)(l  —  2x) 
\+lx 

(1  A-  5jc)(1  -2x) 


1 


1  +5* 


+ 


I 


7  V 1  +  5x  ) 


(1  +  lx)  +  — 


1  -2x 


1  \  1  -  2x 


(1  +  lx) 


—  y(l  +  (— 5x)  +  (—5a:)2  +  ■  •  •  )(1  +  lx) 

+  ^(1  +  2x  +  {2xY  +  •  •  •  )(1  +  lx) 

=  f  (1  -  5x  +  25a:2  +  •  •  •  +  (-5)nxn  +  •••)(!+  lx) 

+  f  (1  +  2x  +  4x2  +  ■  ■  •  +  2nxn  +  ■••)(!+  lx) 


=  f  (1  +  2a:  -  10a:2  +  •  •  •  +  [(-5)"  +  l(-5)n~l]xn  +  •  •  • ) 
+  f  (1  +9x  +  18a:2  +  •  •  ■  +  [2n  +  li2n-l)]xn  +  ■■■) 
=  f  (1  +  2x  -  10a:2  +  •  ■  •  +  2(-5)"-!a:"  +  •  •  ■ ) 

+  lj  (1  +  9a;  +  18a:2  +  •  •  •  +  9(2n~l)xn  +  •  •  • ) 

=  1  +  4x  -  2x2  +  ■■■  +  (-f  (-5)"  +  j(2n))xn  +  ■■■ 
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and  hence  an  =  —^(—5)”  +  |( 2 ")  is  the  desired  solution.  (Again,  we  suggest  that 
you  verify  that  this  recurrence  can  also  be  solved  by  the  methods  of  Section  5.3.) 

I 


PROBLEM  28.  Solve  the  recurrence  relation  a„  =  —  an-\  +  2n  —  3,  n  >  1, 
given  «o  =  1. 

Solution.  Let  fix)  be  the  generating  function  of  the  sequence  ao,  a\,  02,  ...  . 
Then 

fix)  =  ao  +  a\x  +  a2X2  +  +  anxn  +  ••• 

xf(x )  =  aox  +  a\x2  +  •  •  •  +  \xn  +  •  •  ■  . 

Therefore, 

f(x)  +  xf(x)  ~ao  +  (a I  +ao)x  +  iai  +  a\)x2  3 - h  (an  +  a„-i)xn  H - . 

We  are  given  that  ao  =  1  and  an  +  an-\  =  2 n  —  3.  Thus  a  \  +ao  =  2(1)  —  3  =  — 1, 
ai+a\  =  2(2)  —3=1,  and  so  on.  We  obtain  (1  +x)f(x)  =  1  —  a:  +  x2  +  •  •  •  + 
(2 n  —  3)xn  +  •  •  •  and  so 

1  , 

fix)  =  - - (1  -  x  +  x2  H - h  (2n  -  3)xn  H - ) 

1  +  x 

=  (1  +  (-jc)  +  i-x)2  +  •  ■  ■  +  (-x)n  +  ■  •  • ) 

■  (l  -  x  +  X2  +  ■  ■  ■  +  (2n  -  3)xn  +  ■  ■  ■) 

=  (1  -x  +  x2  -X3  +  ---  +  (-x)n  +  ■■■) 

•  (1  —  x  +  x2  +  •  •  •  +  (2n  —  3)xn  +  •  •  ■ ) 

=  1  —  2x  ~F  3.r“  T  *  * ' 


+  [(2/;  -  3)  -  (2u  -  5)  +  •  •  •  +  (— l)"_l(— 1)  +  (-1)"]*"  + 
Now  a„  is  the  coefficient  of  x",  the  term  in  the  square  brackets: 
f»-> 

=  E(_1)A[  ln-{2k  +  3)]\ 

1 1= o 


+  (-D" 


n— 1  n- 1 

=  (2n-3)^(-l)‘  -2^(-l)^  +(-1)”. 

*=0  k=0 

We  leave  it  to  you  to  verify  that 

—  |  if  n  is  even 
if  n  is  odd. 

So,  if  n  is  even,  a„  =  0  —  2(— |)  +  1  =  n  +  1,  while  if  n  is  odd,  a„  =  (2 n  —  3)  — 
2 i'2-^-)  ~  1  =  2n  —  3  —  n  +  1  —  1  =  n  —  3.  The  solution  is 

{n  +  1  if  n  is  even 
n  —  3  if  n  is  odd. 


B- 

k= 0 


1  )kk 
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Note  that  this  solution  could  also  be  written  as  a„  =  2(— 1)”  +  n  —  1,  which  is 
the  answer  one  would  obtain  by  the  methods  of  Section  5.3.  I 


Answers  to  Pauses 


10.  Replacing  *  by  ax  in  (11),  we  see  that 

- -  =  1  +  (ax)  +  (ax)2  +  (ax)3  +  •  -  -  =  1  +  ax  +  a2x2  +  a3* 3  +  •  •  •  . 

1  —  ax 


From  this,  we  see  that  -  is  the  generating  function  for  the  sequence 

1  —  ax 

1,  a,  a2,  a 3, . . . ,  which  is  the  geometric  sequence  with  first  term  1  and  com¬ 
mon  ratio  a. 

11.  (1  -  x)2(  1  +  2x  +  3x2  +  ■  ■  ■  +  (n  +  1  )x"  +  ■  •  • ) 

=  ( 1  —  2*  +  *2)(1  t  2.x  +  3x2  —F  *  ■  *  — (/?  -f-  \.)xft  +  •  •  • ) 

=  1  +  [1(2)  -  2(1)]*  +  [1(3)  -  2(2)  +  1(1)]*2  +  ■  •  • 


+[l(n  +  1)  -  2 (n)  +  1(«  -  1)]*"  +  •••  =  ! 


since  n  +  1  —  2/j  +  n  —  1  =  0. 


EXERCISES 


The  symbol  [BBj  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 


1. 


What  sequence  is  associated  with  each  of  the  following 
generating  functions? 

(a)  [BB]  (2  -  3*) 2 


(c)  [BB] 


1 

(1+3*) 


2 


(d) 

(e) 


1 

(1  -x3)2 

x2  I 

(1  +  x)2  t  +  5* 


2.  Express  the  generating  function  of  each  of  the  following 
sequences  as  a  polynomial  or  as  the  quotient  of  polyno¬ 
mials. 


(a)  [BB]  1,2,  5,0,  0, 

(b)  0,  1,4,  1,0,0, ... 

(c)  1,2,4,  8,  16,  ... 

(d)  I, -I, 1,-1,... 

(e)  3,3,3,... 

(f)  [BB]  1,0,  1,0,  .. 

(g)  1.-2,  3, -4,... 


3.  [BB]  Using  the  method  of  generating  functions,  solve 
the  recurrence  relation  an  =  2a „ _  i ,  n  >  1,  given  ay  =  1. 
Compare  this  solution  to  the  sequence  defined  by  (5), 
p.  164. 

4.  Use  the  method  of  generating  functions  to  solve  the 
recurrence  relation  a„  =  3a„-i  + 1,  n  >  1 ,  given  ao  =  1 . 
Compare  your  solution  with  the  sequence  given  in  Prob¬ 
lem  10,  p.  164. 

5.  [BB]  Use  the  method  of  generating  functions  to  solve  the 
recurrence  relation  «„  =  5 an-i  —  6a„~2,  given  ao  =  1, 
a  i=4.  Verify  your  answer  by  comparing  with  Prob¬ 
lem  18. 

6.  Use  the  method  of  generating  functions  to  solve  the 
recurrence  relation  a„  —  4a„_i  —  4fl„_2,  n  >  2,  given 
ao  =  1,  a\  =  4.  Compare  your  solution  with  the 
sequence  given  in  Problem  19,  p.  174. 

7.  [BB]  Use  generating  functions  to  find  a  formula  for  a„ 
given  ao  =  5  and  a„  =  an-\  +  2"  for  n  >  1. 

8.  (a)  Use  the  method  of  the  characteristic  polynomial  (as 

in  Section  5.3)  to  solve  the  recurrence  relation  a„  = 

4a„-i  —  4«„_2+4",  n  >  2,  with  the  initial  conditions 

«o  =  2,  a i  =  8. 
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(b)  Solve  the  recurrence  in  (a)  by  means  of  generating 
functions. 

(c)  Which  of  the  preceding  methods  do  you  prefer  in 
this  case? 

9.  Solve  each  of  the  following  using  generating  functions. 
In  each  case,  use  the  methods  of  Section  5.3  to  verify 
your  answer. 

(a)  [BB]  an  =  —  5a„_i,  n  >  1,  given  ao  =  2. 

(b)  an  =  — 5n„_i  +  3,  n  >  1,  given  ao  =  2. 

10.  Solve  each  of  the  following  using  generating  functions. 
Verify  your  answer  by  the  method  of  Section  5.3. 

(a)  a„  =  4«„_i  -  3a„_2,  n  >  2,  given  a0  =  2,  a\  =  5. 

(b)  an  =  —  10a„_i  —  25a, ,_2,  n  >  2,  given  ao  —  1, 
a\  =  25. 

11.  [BB]  Using  generating  functions,  solve  the  recurrence 
relation  an  —  2a„_i  +  a„_ 2  —  2a„_3,  n  >  3,  given 
ao  =  1,  a i  =  3,  02  =  6-  (Note  that  this  recurrence  is  not 
second  order  and  so  cannot  be  solved  by  the  methods 
of  Section  5.3.) 

12.  [BB]  Use  the  method  of  generating  functions  to  solve 
the  recurrence  relation  an  =  an-\  +an-2  —  «H-3,  n  >  3, 
given  ao  =  2,  ai  =  —  1,  «2  =  3. 

_ I  _  Ax +  B  C 

(ax  +  b)2(cx  +  d)  ( ax  +  b )2  cx  +  d 

13.  The  Pell  sequence  is  delined  by  po  =  1,  p\  =  2  and 
Pn  =  2pn_i  +  P„- 2  for  n  >  2. 


(a)  Use  the  characteristic  polynomial  to  solve  this  recur¬ 
rence  relation. 

(b)  Show  that  p„  is  the  integer  closest  to  ( ' '?  ' )  ( 1  + 

•Jif. 

(c)  Find  the  generating  function  of  the  Pell  sequence, 
finding  explicitly  its  first  four  terms. 


Remarkably,  there  exist  closed  form  solutions  for  pn. 
For  example, 


Pn= 

i.j,k>0 


( i  +  j  +  k) ! 
KW 


i+j+2k=n 


See  the  American  Mathematical  Monthly  107  (2000), 
no.  4,  p.  370  for  three  verifications  of  this  formula. 

14.  This  question  concerns  the  Fibonacci  sequence  defined 
by  the  recurrence  relation  an  =  a„  \  +  a„- 2,  where 
ao  =  a\  =  1 . 

(a)  Suppose  f(x)  is  the  generating  function  of  the 

Fibonacci  sequence.  Show  that  f(x)  —  — — — - 3-. 

1  —  x  —  xL 

(b)  Finda  and  such  that  J—x—x2  =  (1—  ax)(\—  fix). 

(c)  Find  A  and  B ,  in  terms  of  a  and  p,  such  that 

1  _  A  B 

1  —  x  —  x2  1  —  ax  \  —  fix' 

(d)  Use  the  results  of  the  previous  parts  to  obtain  a 
formula  for  a„  and  compare  your  answer  with  that 
found  in  Problem  20,  p.  174.  Which  method  for 
finding  a„  do  you  prefer? 


REVIEW  EXERCISES  FOR  CHAPTER  5 


1.  Using  mathematical  induction,  show  that  ^  3' 
3"  -  1 


;=i 


for  all  n  >  1 . 


2.  Using  mathematical  induction,  show  that  ^1  —  -  j  > 

1  —  —  for  all  n  >  1 . 

2 

3.  Use  mathematical  induction  to  prove  that  10,,+2+10”+ 1 
is  divisible  by  3  for  all  n  >  1 . 

4.  Give  a  recursive  definition  of  each  of  the  following 
sequences: 


(a)  1,5,29,  173,  1037,... 

(b)  3,5,  13,85.3613,... 


(c)  1,9,36,  100,  225, ... 

5.  Let  an  be  defined  recursively  by  a\  =  0,  «2  =  5  and, 
for  k  >  1,  ak+ 2  -  \(ak+ak+\).  Prove  that  an  =  |(1  - 
(— j)”-1)  for  all  integers  n  >  1. 

6.  Consider  the  arithmetic  sequence  which  begins  5,  9,  13. 

(a)  Find  the  32nd  and  100th  terms  of  this  sequence. 

(b)  Does  125  belong  to  the  sequence?  If  so,  where  does 
it  occur? 

(c)  Repeat  (b)  for  the  numbers  429  and  1 000. 

(d)  Find  the  sum  of  the  first  18  terms. 

7.  The  first  two  terms  of  a  sequence  are  6  and  2. 

(a)  If  the  sequence  is  arithmetic,  find  the  27th  term  and 
the  sum  of  the  first  30  terms. 
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(b)  If  the  sequence  is  geometric,  find  an  expression  for 
the  27th  term  and  the  sum  of  the  first  30  terms. 

8.  (a)  Define  the  Fibonacci  sequence. 

(b)  Is  it  possible  for  three  successive  terms  of  the 
Fibonacci  sequence  to  be  odd? 

(c)  Is  it  possible  for  two  successive  terms  in  the 
Fibonacci  sequence  to  be  even? 

Justify  your  answers. 

9.  Solve  the  recurrence  relation  a„  =  5an-\  —  4an  2.  n  > 
2,  given  that  ao  =  —3  and  a  1  =  6.  Use  the  characteristic 


polynomial  as  described  in  Section  5.3. 

10.  Solve  Exercise  9  using  the  method  of  generating  func¬ 
tions  described  in  Section  5.4. 

11.  Solve  an  =  4cr„_i  +  5a„_i  +  3",  n  >  2,  given  ao  =  4, 
a\  =  — 1. 

12.  (For  students  of  calculus)  Let  /1./2./3,  denote 
the  Fibonacci  sequence  as  defined  in  5.2.3.  Evaluate 
^2,  fk 

y  exactly.  Then  approximate  this  sum  to  19  dec¬ 

imal  places  and  admire  its  beauty.  What  do  you  notice? 


*  m- 
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Principles  of  Counting 

. 


6.1  The  Principle  of  Inclusion-Exclusion 

Glenys  is  thinking  about  registering  for  a  word  processing  course.  Of  the  100 
people  who  have  registered  so  far,  she  discovers  that  three  quarters  are  men  and 
80%  own  personal  digital  assistants  (PDAs). 

(a)  Find  a  formula  for  the  number  of  women  in  the  course  who  do  not  have 
PDAs.  How  large  might  this  number  be?  How  small? 

(b)  How  many  of  the  men  registered  in  this  course  could  conceivably  own 
PDAs? 

The  object  of  this  chapter  is  to  illustrate  some  basic  principles  of  counting.  We 
consider  first  the  number  of  elements  in  various  combinations  of  finite  sets  such 
as  union,  intersection,  and  difference.  As  in  Section  3.3,  the  number  of  elements 
in  a  finite  set  S  will  be  denoted  |S|. 

To  analyze  Glenys’ s  questions,  we  introduce  the  sets  U  of  all  registrants,  M 
of  male  registrants,  and  P  of  those  registrants  who  own  a  PDA.  We  are  given 
that  |t/[  =  100,  |M|  =  75,  and  IP!  =  80.  The  set  of  women  who  do  not  have  a 
PDA  is  Mc  D  Pc  and  it  is  the  size  of  this  set  with  which  part  (a)  is  concerned. 
By  one  of  the  laws  of  De  Morgan, 

Mc  r  Pc  =  (M  U  P)c 

and  so 

| Mc  n  Pc\  =  I (M  U  Pf  \  =  100  -  \M  U  P |. 

How  big  is  Ml)  PI  Adding  the  number  of  men  and  the  number  of  registrants 
who  have  PDAs  counts  twice  the  men  with  PDAs  (see  Fig  6.1),  so 

\MU  P\  =  |M|  +  |  PI  —  \M  n  P\  =  75  +  80  -  \M  fl  P\  =  155  -  \M  D  P|. 

Therefore, 

\MC  n  Pc'|  =  100  -  (155  -  \M  G  P|)  =  \M  n  P|  -  55, 
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Figure  6.1  \M  U  P\  =  | M\  +  |Fj  —  \M  n  P\  because  Region  x  gets  counted  twice  in  the  sum  |M|  +  \P\. 

which  is  a  formula  for  the  number  of  women  without  PDAs.  Since  M  D  P  is 
a  subset  of  M,  a  set  of  75  people,  \M  D  P\  <  \M\  =  75  and  so  Mc  n  Pc  < 
75  —  55  =  20.  Conceivably,  \MC  fl  Pc\  could  be  0  (if  \M  n  P\  =  55).  So  the 
number  of  women  without  PDAs  is  between  0  and  20  (inclusive). 

Part  (b)  asks  about  M  n  P.  Observing  that  \M  H  P\  —  55  >  0  (because  it  is 
the  number  of  women  without  PDAs),  we  see  that  \M  0  P|  >55;  that  is,  at  least 
55  men  must  own  PDAs.  As  noted  previously,  the  upper  bound  for  \M  fl  P\  is 
75  and  it  is  possible  for  this  number  to  be  realized. 

Suppose,  in  fact,  that  \M  O  P\  =72.  Then 

\M  U  P|  =  jM|  +  |P|  -  \M  (T  P|  =  75  +  80-72  =  83. 

There  are  83  people  in  the  class  who  are  either  men  or  own  a  PDA. 

How  many  men  registered  in  the  class  would  not  have  PDAs?  Surely,  75  - 
72  =  3: 


\M  \  P\  =  \M\  —  \M  O  P\  =  75  —  72  =  3. 

How  many  of  the  owners  of  PDAs  would  be  women?  Surely,  80  —  72  =  8; 

\P  \  M\  =  |P|  -  |PHM|  =  80-  72  =  8. 

How  many  of  those  registered  are  either  men  without  PDAs  or  women  with 
PDAs?  This  question,  which  asks  for  \M  ©  P|,  can  be  answered  in  several  ways. 
It  is  the  number  of  men  without  PDAs  plus  the  number  of  women  with  a  PDA; 
that  is, 


|M©  P\  =  \M  \  P\  +  \P\M\  =  3  +  8  =  11. 

It  is  also  the  number  of  people  who  are  men  or  owners  of  a  PDA  but  not  both: 

|M©  P\  =  |MU  P\  -  \MC  P\  =  83  -72  =  11. 

We  could  also  legitimately  argue  that  this  number  is 

|M®  P|  =  \M\  +  |P|  —  2|M  n  P|  =  75  +  80-  2(72)  =  155  -  144  =  11 

because  adding  the  number  of  men  and  the  number  of  people  with  PDAs  counts 
twice  the  “unwanted”  individuals,  the  men  with  PDAs. 

We  summarize  the  basic  principles  of  counting,  most  of  which  we  have  just 
illustrated. 


6.1.1  PROPOSITION 


Proof 
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Let  A  and  B  be  subsets  of  a  finite  universal  set  U.  Then 

(a)  |A  U  fi[  =  \A\  +  \B\  —  \  A  C\  B\ 

(b)  \A  n  B\  <  minf|A|,  |B|},  the  minimum  of  |A|  and  |fi| 

(c)  |A  \  B\  =  | A |  —  \A  n  B\  >  | A |  —  |fi| 

(d)  \AC\  =  \U\  —  |Aj 

(e)  |A©fl|  =  |AUB|-|AnB|  =  |A|  +  \B\  -  2|A  n  B\  =  |A  \  B\  +  \B  \  A| 

(f)  |A  x  B|  =  |A|  x  |fi| 

(a)  If  A  =  0,  then  A  ft  B  =  0,  and  |  A  |  =  |  A  n  B  \  —  0  so  the  result  holds  because 
each  side  of  (a)  equals  \B\.  Similarly,  the  result  holds  if  B  =  0,  so  we  assume 
henceforth  that  neither  A  nor  B  is  empty. 

Suppose  A  n  B  =  0.  Let  A  =  [a\ ,  a2,  •  •  •  ,  ar]  and  B  =  {b\ ,  bj,  . . .  ,  bs}. 
Then 

A  U  B  —  {a\ ,  a2, . . .  ,  ar,  b\ .  bj,  ■  ■ .  ,  bs) 

and,  since  there  is  no  repetition  among  the  elements  listed  here, 

\AU  B\  =  r+s  =  \A\  +  \B\  =  |A|  +  |B|  -  |A  n  B\ 

and  the  desired  formula  is  true. 

On  the  other  hand,  if  A  fT  B  ^  0  and  we  let 

A  \  B  -  [a\,  o2,  •  •  •  ,  a,  } 

B  \  A  =  {b\,b2,  ■  ■  ■  ,bs] 

A  fi  B  =  {x\ ,  x2. . . .  ,  x,} 

then 

A  =  {«| ,  a2,  ■  ■  ■  ,  ar,  x\ ,  x2, . . .  ,  xt }  and 

B  =  {b\,b2 - -  bs,x  i,X2,  ■■■  ,  xt], 

so 

AU5  =  {fl|,.,.  ,ar,b  i, . . .  ,bs,x\, ...  ,  xt} 

with  no  repetition  among  the  elements  listed  in  any  of  these  sets.  Thus, 

|A|  +  |6|  —  |Anfi|  =  (r  +  /)  +  ($ +  r)  —  t  =  r  +  i  +  t  =  |AUfi| 

and  the  formula  holds  in  this  case  as  well. 

We  omit  the  proofs  of  (b)-(e)  since,  as  with  the  proof  of  part  (a),  these  follow 
in  a  straightforward  way  from  the  definitions  of  U,  fl,  \ ,  ©,  and  set  complement. 
The  proof  of  (f)  is  left  to  the  exercises.  I 

The  formula  for  the  number  of  elements  in  the  union  of  two  sets — part  (a) 
of  Proposition  6.1.1 — is  the  simplest  case  of  the  Principle  of  Inclusion-Exclusion, 
which  gives  the  general  formula  for  the  number  of  elements  in  the  union  of  any 
finite  collection  of  finite  sets. 
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Suppose,  for  instance,  we  have  three  finite  sets  A,  B,  and  C.  Then 


i 

\A  U  B  U  C\  =  \  A  U  (B  U  C)| 

=  |A|  +  |BUC|-|An(BUC)| 

=  \A)  +  \B  U  C\  -  I  (A  nB)U(An  C)| 

=  \A\  +  [|5|  +  \C\  -  nc|] 

-[I a  n  b\  +  \a  n  c\  -  |(A  n  B)  n  (A  n  C)|] 

=  \  a\  +  |B|  +  \c\  -  |snc|  -  |AnB|  -  | a  nc|  +  \A  n  snc|, 

where,  at  the  spots  marked  with  single  arrows,  we  used  the  fact  that  |X  U  Y\  = 
| X |  +  |T|  —  |X  fl  F|  and,  at  the  last  step,  that  (A  n  B)  n  (A  Cl  C)  =  A  Pi  B  D  C. 

What  happened  at  the  double  arrow? 

Here  is  the  formula  for  the  cardinality  of  the  union  of  four  finite  sets: 


\A  U  B  U  C  U  D\ 


=  \A\  +  \B\  +  \C\  +  \D\ 

— |Anfl|  - \a  nci  -  |AnD|  -  |gnc|  -  |sdd|  -  |cnz)| 
+1  a  n5nc|  +  |An«nD|  +  |AncnD|  +  |isncnD| 

-\ADBC\C(AD\. 


6.1.2  PRINCIPLE  OF 
INCLUSION- 
EXCLUSION 


The  general  pattern  should  be  evident.  Add  the  cardinalities  of  each  of  the 
sets,  subtract  the  cardinalities  of  the  intersections  of  all  pairs  of  sets,  add  the 
cardinalities  of  all  intersections  of  the  sets  taken  three  at  a  time,  subtract  the 
cardinalities  of  all  intersections  of  the  sets  taken  four  at  a  time,  and  so  on. 

Given  a  finite  number  of  finite  sets,  A\,  A 2,  ...  ,  A„,  the  number  of  elements  in 
the  union  A\  U  A2  U  •  •  •  U  A„  is 


|A1UA2U-..UAB|  =  ^|Ai|-J]|A/nAJ-|+  |A<  n  Aj  n  Ak\ 

i  i<j  i<j<k 

- +  (—  i)n+1 1  a  1  n  a2  n  •  •  •  n  a„\, 

where  the  first  sum  is  over  all  i,  the  second  sum  is  over  all  pairs  i,  j  with  i  <  j, 
the  third  sum  is  over  all  triples  i,  j,  k  with  /  <  j  <  k,  and  so  forth. 


PROBLEM  1.  Of  30  personal  computers  (PCs)  owned  by  faculty  members  in  a 
certain  university  department,  20  run  Windows,  eight  have  21  inch  monitors,  25 
have  CD-ROM  drives,  20  have  at  least  two  of  these  features,  and  six  have  all  three. 

(a)  How  many  PCs  have  at  least  one  of  these  features? 

(b)  How  many  have  none  of  these  features? 

(c)  How  many  have  exactly  one  feature? 
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Solution.  Let  W  be  the  set  of  PCs  running  under  Windows,  M  the  set  of  PCs 
with  21  inch  monitors,  and  C  the  set  with  CD-ROM  drives.  We  are  given  that 
\W\  =20,  \M\  =  8,  \C\  =25, 

I (W  n  M)  u  (w  n  C)  u  (M  n  C)\  =  20, 

and  |  W  fl  M  fT  C|  =  6.  By  the  Principle  of  Inclusion-Exclusion, 

20  =  j(W  n  M)  u  (W  n  c)  u  (M  n  C)\ 

=  \  w  n  M|  +  \w  nc|  +  \m  nc|  -  |(wnM)  n  (W  nc)| 

-  \(W  n  M)  n  (M  n  C)|  -  \(w  nc>n(Mn  C)| 

+  \(W  n  M)  n  (w  n  C)  n  (Mr  C) |. 

Since  each  of  the  last  four  terms  here  is  fl  M  n  Cj,  we  obtain 

20  =  |  w  n  m\  + 1  w  n  c\  +  \m  n  c\  -  2\  w  n  m  n  c\-, 

therefore, 

\W  C\C\  +  \W  C\M\  +  \M  C\C\  =  20  +  2(6)  =  32. 

(a)  The  number  of  PCs  with  at  least  one  feature  is 

IWUMUCI  =  |W|  +  |M|  +  |C| 

-|wnM|-|wnc|-|Mnc|  +  |wnMnc| 

=  20  +  8  +  25  -  (|Wn  A/|  +  |WnC|  +  |MOC|)  +6 
=  59  -  32  =  27. 

(b)  It  follows  that  30  —  27  =  3  PCs  have  none  of  the  specified  features. 

(c)  Since  the  number  of  computers  with  exactly  one  feature  is  the  number  with 

at  least  one  feature  less  the  number  with  at  least  two,  the  number  with 
exactly  one  is  27  —  20  =  7.  I 


Continuing  the  preceding  problem,  how  many  faculty  have  computers  with  exactly 
two  of  the  three  features  described? 

Some  readers  may  notice  that  the  problems  discussed  so  far  could  also  have 
been  solved  by  Venn  diagrams,  which  are  an  acceptable  approach  wherever  pos¬ 
sible.  When  more  than  three  sets  are  involved,  however,  Venn  diagrams  can  no 
longer  be  used.  Thus,  our  solutions,  which  avoid  Venn  diagrams,  illustrate  tech¬ 
niques  which  can  be  applied  in  general. 

PROBLEM  2.  Suppose  18  of  the  30  personal  computers  in  Problem  1  have 
Pentium  III  processors,  including  ten  of  those  running  Windows,  all  of  those  with 
21  inch  monitors,  and  15  of  those  with  CD-ROM  drives.  Suppose  also  that  every 
computer  has  at  least  one  of  the  four  features  now  specified.  How  many  have  at 
least  three  features? 
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Solution.  Let  P  be  the  set  of  PCs  with  Pentium  III  processors.  Then,  with  W, 
M ,  and  C  as  before,  the  question  asks  for  the  number 

n  =  \(W  n  m  nc)  u  (W  nMnP)  u  (w  ncnp)  u  (M  ncnP)\ 

=  \w  dm  nc\  +  \w  nM  n  p\  +  \wnc  n  p\  +  \m  nc  n  p\ 

-6\w  n  m  nc  n  p\+4\w  n  m  nc  n  p\  -  \  w  nM  nc  n  p\ 

=  \  w  n  m  n  c\  +  |W  n  m  n  p\  +  \w  n  c  n  p\  +  \m  n  c  n  p\ 

-3|  w  nM  nc  n  p\. 

We  are  given  that 

30=  |WUMUCUP| 

=  \W\  +  \M\  +  \C\  +  \P\ 

-  \  w  n  m\  —  \  w  n  c\  —  \  w  n  p\  —  \m  n  c\  —  \m  n  p\  —  \c  n  p\ 

+  |VknMnc|  +  |wnMnP|  +  |wncnP|  +  |MncnP| 

-  \w  n  m  nc  n  p\ 

=  \W\  +  \M\  +  \C\  +  \P\ 

-  \  w  n  m\  -  \  w  n  c\  -  \  w  n  p\  -  \m  n  c\  -  \m  n  p\  -  \c  n  p\ 

+  n +  2\w nM nc n  p\ 

because 

\wnMnc\  +  \wnMnp\  +  \wncnp\  +  \Mncnp\ 

=  n  +  3\w  n  m  nc  n  p\. 

In  Problem  1,  we  learned  that  |WnC|  +  |WnM|  +  |MnC|  =32  and  were  told 
that  | VkflMnC|  =  6.  Here,  since  M  c  p,  we  have  WnMncnP  =  VknMfiC 
and  so|WnMnCnPj=6.  Therefore, 

30  =  20  +  8  +  25  +  18  -  32  -  10  -  8  -  15  +  n  +  2(6), 

so  n  =30-71  +65  -  12=  12.  I 


PROBLEM  3.  How  many  integers  between  1  and  300  (inclusive)  are 

(a)  divisible  by  at  least  one  of  3,  5,  7? 

(b)  divisible  by  3  and  by  5  but  not  by  7? 

(c)  divisible  by  5  but  by  neither  3  nor  7? 

Solution.  Let  A,  B ,  and  C  be  the  sets  of  those  integers  between  1  and  300 
which  are  divisible  by  3,  by  5,  and  by  7,  respectively;  thus, 
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A  =  {n  |  1  <  n  <  300,  3  |  n) 

B  —  {n  |  1  <  n  <  300,  5  |  n } 

C  =  {n  |  1  <  n  <  300,  l\n). 

(a)  To  be  divisible  by  either  3  or  5  or  7  is  to  be  in  at  least  one  of  the  sets  A,  B, 

or  C.  Thus,  part  (a)  asks  us  to  find  |A  U  B  U  Cj.  To  determine  this  number,  we 
need  |A|,  |5|,  |C|,  |A  fl  S|,  |A  H  Cj,  \B  Pi  Cj  and  \A  fl  B  n  Cj.  The  elements  of 
A  are  3,  6,  9,  12, . . .  ,  300,  so  |A|  =  100.  Note  that  this  number  is  where, 

as  in  paragraph  3.1.6,  [jcJ  denotes  the  floor  of  the  real  number  a;. 

In  general,  for  natural  numbers  a  and  b,  the  number  of  positive  integers  less 
than  or  equal  to  a  and  divisible  by  b  is  [§  J.  (See  Exercise  18.)  Thus,  we  also 
have 

.  300  .  ,  300  . 

|B|  =  l_j=60  and  |C|  =  L— J=42. 


Next  we  have  to  find  A  D  B,  the  set  of  integers  between  1  and  300  which  are 
divisible  by  both  3  and  5.  Since  3  and  5  are  relatively  prime  numbers,  any  number 
divisible  by  each  of  them  must  be  divisible  by  their  product.  (See  Exercise  10  of 
Section  4.2.)  Therefore,  A  D  B  is  just  the  set  of  integers  between  1  and  300  which 
are  divisible  by  15  and,  similarly,  A  fl  C,  B  n  C,  and  A  fl  B  n  C  are  the  sets  of 
integers  between  1  and  300  which  are  divisible  by  21,  35,  and  105,  respectively. 
So  we  have 


.  300  .  .  300  . 

|An,8|  =  [  — J  =20.  |AHC|  =  L^fJ  =  14’ 

ienci  =  =8,  |Ansnc|  =  L— J  =2 

and  hence,  |  A  U  B  U  Cj  =  100  +  60  +  42  —  20  —  14  —  8  +  2  =  162. 


(b)  The  numbers  divisible  by  3  and  by  5,  but  not  by  7,  are  precisely  those  numbers 
in  (A(lfi)\C,a  set  of  cardinality  |A  fl  B\  —  |A  IT  B  fl  C|  =  20  —  2  =  18. 

(c)  The  numbers  divisible  by  5  but  by  neither  3  nor  7  are  those  in  B  \  (A  U  C),  a 
set  of  cardinality  |B|  —  \B  D  (A  U  C)|.  Since 


Bn(AUC)  =  (fiflA)U(Bfl  C), 


the  Principle  of  Inclusion-Exclusion  gives 

I b  n  (A  u  C) |  =  \b  n  A|  +  \b  n  Cj  -  |  (B  n  A)  n  (B  nc)  | 

=  |snA|  +  |Bnc|-|snAnc| 

because  (finA)D(BflC)  =  fiDAnC.  Therefore,  |fin(AUC)|  =  20+8  —  2  =  26 
and  the  number  we  seek  is  |5|  —  26  =  60  —  26  =  34.  I 


In  Problem  3,  we  characterized  the  set  of  integers  divisible  by  both  3  and  5  as 
those  divisible  by  15.  In  general,  for  any  natural  numbers  a  and  b,  how  can  we 
characterize  the  set  of  natural  numbers  divisible  by  both  a  and  b? 
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Answers  to  Pauses 


Suppose  the  Sieve  of  Eratosthenes  (4.3.5)  is  used  to  enumerate  all  primes  between 
1  and  100.  How  many  integers  will  remain  after  the  first  five  steps  of  the  proce¬ 
dure? 

1.  See  equation  (3)  on  p.  44. 

2.  The  number  of  computers  with  exactly  two  of  the  features  is  the  number  with 
at  least  two  less  the  number  with  exactly  three;  that  is,  20  —  6  =  14. 

3.  This  set  is  just  the  set  of  integers  divisible  by  the  least  common  multiple  of 
a  and  b  since  a  | n,  b  | n  if  and  only  if  lcm(a,  b)\n.  (See  Section  4.2.) 

4.  The  integers  which  have  been  crossed  out  after  five  steps  are  those  which  are 
divisible  by,  but  not  equal  to,  at  least  one  of  2,  3,  5,  7,  11.  Letting  A,  B,  C, 
D,  and  E  be  the  sets  of  integers  between  2  and  100  which  are  divisible  by 
2,  by  3,  by  5,  by  7,  and  by  11,  respectively,  the  set  of  numbers  divisible  by 
at  least  one  of  these  has  cardinality 

|  A  U  B  U  C  U  D  U  E\  =  (50  +  33  +  20  +  14  +  9) 


+  (3  +  2+  1  +  1)  =  79. 

(There  are  no  terms  arising  from  intersections  of  four  or  five  sets,  since 
these  intersections  are  empty.)  These  79  numbers,  with  the  exception  of  2, 
3,  5,  7,  11,  are  those  which  have  been  crossed  out  after  five  steps.  So  25 
numbers  remain.  (Remember  that  the  list  of  numbers  used  with  the  Sieve  of 
Eratosthenes  starts  with  2.) 


EXERCISES 


The  symbol  [BB]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 


1.  [BB]  In  a  group  of  15  pizza  experts,  ten  like  Canadian 
bacon,  seven  like  anchovies,  and  six  like  both. 

(a)  How  many  people  like  at  least  one  of  these  top¬ 
pings? 

(b)  How  many  like  Canadian  bacon  but  not  anchovies? 

(c)  How  many  like  exactly  one  of  the  two  toppings? 

(d)  How  many  like  neither? 

2.  Multiple  personality  disorder  (MPD)  is  a  condition  in 
which  different  personalities  exist  within  one  person 
and  at  various  times  control  that  person’s  behavior.  In 
a  recent  survey  of  people  with  MPD,  it  was  reported 
that  "98%  had  been  emotionally  abused,  89%  had  been 
physically  abused,  and  most  had  experienced  both  types 
of  abuse.”  Make  this  statement  more  precise. 

3.  Among  the  30  students  registered  for  a  course  in  discrete 
mathematics,  15  people  know  the  JAVA  programming 
language,  12  know  HTML,  and  five  know  both  of  these 
languages. 


(a)  How  many  students  know  at  least  one  of  JAVA  or 
HTML? 

(b)  How  many  students  know  only  JAVA? 

(c)  How  many  know  only  HTML? 

(d)  How  many  know  exactly  one  of  the  languages, 
JAVA  and  HTML? 

(e)  How  many  students  know  neither  JAVA  nor  HTML? 

4.  [BB]  In  a  recent  survey  of  college  graduates,  it  was 
found  that  200  had  undergraduate  degrees  in  arts,  95 
had  undergraduate  degrees  in  science,  and  120  had  grad¬ 
uate  degrees.  Fifty-five  of  those  with  undergraduate  arts 
degrees  had  also  a  graduate  degree,  40  of  those  with 
science  degrees  had  a  graduate  degree,  25  people  had 
undergraduate  degrees  in  both  arts  and  science,  and  five 
people  had  undergraduate  degrees  in  arts  and  science 
and  also  a  graduate  degree. 

(a)  How  many  people  had  at  least  one  of  the  types  of 
degrees  mentioned? 
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(b)  How  many  people  had  an  undergraduate  degree  in 
science  but  no  other  degree? 

5.  The  owner  of  a  corner  store  stocks  popsicles,  gum,  and 
candy  bars.  After  school  one  day,  he  is  swamped  by  an 
influx  of  15  school  children.  They  are  in  and  out  of  his 
store  in  minutes.  Later,  the  clerk  reports  that  ten  children 
purchased  popsicles,  seven  purchased  gum,  twelve  pur¬ 
chased  candy  bars,  five  purchased  popsicles  and  gum, 
six  purchased  popsicles  and  candy  bars,  and  two  pur¬ 
chased  gum  and  candy  bars.  The  owner  is  very  upset. 
Why? 

6.  (a)  In  a  group  of  82  students,  59  are  taking  English,  46 

are  taking  mathematics,  and  12  are  taking  neither  of 
these  subjects.  How  many  are  taking  both  English 
and  math? 

(b)  In  a  group  of  97  students,  the  number  taking  English 
is  twice  the  number  taking  math.  Fifty-three  students 
take  exactly  one  of  these  subjects  and  15  are  taking 
neither  course.  How  many  students  are  taking  math? 
How  many  are  taking  English? 

7.  [BB]  Of  the  2300  delegates  at  a  political  convention, 
1542  voted  in  favor  of  a  motion  to  decrease  the  deficit, 
569  voted  in  favor  of  a  motion  dealing  with  environ¬ 
mental  issues,  and  1197  voted  in  favor  of  a  motion 
not  to  increase  taxes.  Of  those  voting  in  favor  of  the 
motion  concerning  environmental  issues,  327  also  voted 
to  decrease  the  deficit,  and  92  voted  not  to  increase 
taxes.  Eight  hundred  and  thirty-nine  people  voted  to 
decrease  the  deficit  while  also  voting  against  increas¬ 
ing  taxes,  but  of  these  839,  only  50  voted  also  in  favor 
of  the  motion  dealing  with  the  environment. 

(a)  How  many  delegates  did  not  vote  in  favor  of  any  of 
the  three  motions? 

(b)  How  many  of  those  who  voted  against  increasing 
taxes  voted  in  favor  of  neither  of  the  other  two 
motions? 

8.  Seven  members  of  a  group  of  nineteen  people  dislike 
the  New  Democratic  Party  (NDP),  ten  dislike  the  Lib¬ 
erals,  eleven  dislike  the  Conservatives,  and  six  dislike 
the  Canadian  Alliance.  Five  of  the  group  dislike  both  the 
Liberals  and  the  New  Democratic  Party,  five  dislike  both 
the  NDP  and  the  Conservatives,  six  dislike  the  Liberals 
and  Conservatives,  three  dislike  the  New  Democratic 
and  Canadian  Alliance  parties,  four  dislike  the  Liber¬ 
als  and  the  Alliance,  and  five  dislike  the  Conservatives 
and  the  Alliance.  Three  people  dislike  the  Conservatives, 
Liberals,  and  the  NDP,  while  two  dislike  the  Liberals, 
NDP,  and  Alliance;  three  dislike  the  Conservatives,  New 


Democrats,  and  Alliance;  and  four  dislike  the  Conser¬ 
vatives,  Liberals,  and  Alliance.  Two  people  dislike  all 
four  parties.  How  many  members  of  the  group  like  all 
four  parties? 

9.  The  owner  of  a  convenience  store  reports  that  of  890 
people  who  bought  bottled  fruit  juice  in  a  recent 
week, 

•  750  bought  orange  juice 

•  400  bought  apple  juice 

•  100  bought  grapefruit  juice 

•  50  bought  citrus  punch 

•  328  bought  orange  juice  and  apple  juice 

•  25  bought  orange  juice  and  grapefruit  juice 

•  12  bought  orange  juice  and  citrus  punch 

•  35  bought  apple  juice  and  grapefruit  juice 

•  8  bought  apple  juice  and  citrus  punch 

•  33  bought  grapefruit  juice  and  citrus  punch 

.  4  bought  orange  juice,  apple  juice,  and  citrus  punch 

•  17  bought  orange  juice,  apple  juice,  and  grapefruit 
juice 

•  2  bought  citrus  punch,  apple  juice,  and  grapefruit  juice 

•  9  bought  orange  juice,  grapefruit  juice  and  citrus 
punch. 

Determine  the  numbers  of  people  who  bought 

(a)  [BB]  all  four  kinds  of  juice 

(b)  grapefruit  juice,  but  nothing  else 

(c)  exactly  two  kinds  of  juice 

(d)  more  than  two  kinds  of  juice 

10.  Suppose  U  is  a  set  containing  75  elements  and  A i,  A2, 
A3,  A4  are  subsets  of  U  with  the  following  properties: 

•  each  subset  contains  28  elements; 

•  the  intersection  of  any  two  of  the  subsets  contains  1 2 
elements; 

•  the  intersection  of  any  three  of  the  subsets  contains  5 
elements; 

•  the  intersection  of  all  four  subsets  contains  1  element. 

(a)  [BB]  How  many  elements  belong  to  none  of  the 
four  subsets? 

(b)  How  many  elements  belong  to  exactly  two  of  the 
four  subsets? 

11.  [BB]  How  many  integers  between  1  and  500  are 

(a)  divisible  by  3  or  5? 

(b)  divisible  by  3  but  not  by  5  or  6? 

12.  (a)  How  many  integers  less  than  500  are  relatively 

prime  to  500? 

(b)  How  many  integers  less  than  9975  are  relatively 
prime  to  9975? 
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13.  [BB]  How  many  integers  between  1  and  250  are  divisi¬ 
ble  by  at  least  one  of  the  three  integers  4,  6, 
and  15? 

14.  (a)  [BB]  How  many  integers  between  1  and  1000  (inclu¬ 

sive)  are  not  divisible  by  2,  3,  5,  or  7? 

(b)  How  many  integers  between  1  and  1000  (exclusive) 
are  not  divisible  by  2,  3,  5,  or  7? 

15.  Find  the  number  of  integers  between  1  and  10,000  inclu¬ 
sive  which  are 

(a)  divisible  by  at  least  one  of  3,  5,  7,  1 1; 

(b)  divisible  by  3  and  5,  but  not  by  either  7  or  1 1; 

(c)  divisible  by  exactly  three  of  3,  5,  7,  11; 

(d)  divisible  by  at  most  three  of  3,5,7,  11. 

16.  How  many  integers  between  1  and  106  (inclusive)  are 
neither  perfect  squares  nor  perfect  cubes?  (See  Exer¬ 
cise  30,  Section  4.3.) 

17.  [BB]  How  many  primes  are  less  than  200?  Explain  your 
answer. 


18.  [BB]  Let  a  and  b  be  natural  numbers.  Show  that  the 
number  of  positive  integers  less  than  or  equal  to  a  and 
divisible  by  b  is  ]_fj- 

19.  Suppose  A  and  B  are  finite  sets  with  |AUB|  <  |A|  +  |£|. 
Show  that  A  and  B  have  an  element  in  common. 

20.  Suppose  A  and  B  are  finite  sets.  Prove  that  \A  x  B\  = 

Ml  x  1*1. 

21.  Suppose  A  and  B  are  subsets  of  a  universal  set  U .  Find 
a  formula  for  |A  U  Bc\  and  prove  that  it  is  correct. 

22.  Let  A,  B  and  C  be  sets.  Prove  that 

(a)  [BB]  | {A  ©  B)  n  C|  =  \A  D  C|  +  \B  fl  C| 
-2|AnBnc| 

(b)  |A0fi©C|  =  |A|  +  |B|  +  |C|-2|An£|-2|An 
c |  -2|Bnc|  +4|Ansnc| 

23.  Prove  the  Principle  of  Inclusion-Exclusion  by  mathe¬ 
matical  induction.  (For  this,  the  result  of  Exercise  20, 
Section  5.1  will  be  helpful.) 


6.2  The  Addition  and  Multiplication  Rules 


6.2.1  PRINCIPLE  OF 
INCLUSION-EXCLUSION 
(DISJOINT  SETS) 


The  Principle  of  Inclusion-Exclusion  gives  a  formula  for  the  number  of  elements 
in  the  union  of  a  finite  number  of  finite  sets.  If  the  sets  are  pairwise  disjoint 
(that  is,  if  the  intersection  of  any  pair  of  sets  is  empty),  then  the  formula  takes  a 
particularly  simple  form. 

Given  n  pairwise  disjoint  finite  sets,  A  \ ,  A 2,  . . .  ,  An,  then 

n 

|Ai  U  A2  U  •  ■  •  U  A„|  =  ]T|A,|. 

!  =  1 


6.2.2  ADDITION 
RULE 


In  this  section,  we  use  the  general  term  event  to  mean  the  outcome  of  any 
process  or  experiment,  for  example,  the  course  a  student  selects  to  complete  his 
or  her  degree.  Events  are  mutually  exclusive  if  no  two  of  them  can  occur  together; 
for  example,  if  a  student  selects  one  course  to  complete  his  or  her  degree,  then 
Mathematics  2320  and  Statistics  2500  are  mutually  exclusive. 

Notice  that  if  n  sets  A  \ ,  A 2, ...  ,  A„  correspond  to  events  with  the  elements 
of  A;  representing  the  ways  in  which  the  corresponding  event  can  occur,  then  the 
events  are  mutually  exclusive  if  and  only  if  the  A,  are  pairwise  disjoint.  Thus, 
the  Principle  of  Inclusion-Exclusion  for  disjoint  sets  translates  into  the  following 
basic  principle  of  counting. 

The  number  of  ways  in  which  precisely  one  of  a  collection  of  mutually  exclusive 
events  can  occur  is  the  sum  of  the  numbers  of  ways  in  which  each  event  can 


occur. 


6.2.3  MULTIPLICATION 
RULE 
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For  example,  a  student  who  needs  one  course  to  complete  her  degree  decides 
to  take  either  computer  science  or  statistics  and  makes  lists  of  the  courses  for 
which  she  is  eligible.  The  computer  science  courses  might  comprise  the  set 
A\  ={CS2602,  CS2700,  CS2721,  CS2800,  CS2500}  and  the  statistics  courses 
the  set  A2  ={S2510,  S2511,  S3500).  There  are  5  +  3  =  8  ways  in  which  this 
student  can  register  for  a  course  because  the  two  sets  here  are  disjoint:  The  events 
“register  for  CS”  and  “register  for  Statistics”  are  mutually  exclusive. 

It  is  Friday  night  and  Ursula  has  been  invited  to  two  parties  but  feels  more 
inclined  to  go  to  a  movie;  there  are  six  new  movies  in  town.  Assuming  she  is  not 
pennitted  two  engagements  in  the  same  evening  (that  is,  assuming  that  the  events 
“party”  and  “movie”  are  mutually  exclusive),  there  are  2  +  6  =  8  possible  ways 
to  spend  her  evening. 

In  the  two  examples  just  given,  the  individual  events  A,  were  clearly  specified. 
By  contrast,  the  addition  rule  is  probably  most  often  applied  in  situations  where 
this  is  not  the  case.  A  problem  asking  for  the  number  of  ways  in  which  a  certain 
event  can  occur  can  sometimes  be  analyzed  by  partitioning  the  event  into  mutually 
exclusive  subevents  (cases),  precisely  one  of  which  must  occur,  and  then  applying 
the  addition  rule.  It  is  for  the  problem  solver  to  find  a  convenient  set  of  subevents. 

PROBLEM  4.  In  how  many  ways  can  you  get  a  total  of  six  when  rolling  two 
dice? 

Solution.  The  event  “get  a  six"  is  the  union  of  the  mutually  exclusive  subevents. 

•  A| :  “two  3’s” 

.  A2:  “a  2  and  a  4” 

.  A3:  “a  1  and  a  5” 

Event  A 1  can  occur  in  one  way,  A2  can  occur  in  two  ways  (depending  on  which 
die  lands  4),  and  A3  can  occur  in  two  ways,  so  the  number  of  ways  to  get  a  six 
is  1  +  2  +  2  =  5.  I 

In  Section  6.1,  we  observed  that  if  A  and  B  are  finite  sets,  then  the  Cartesian 
product  A  x  B  contains  |A|  x  |5|  elements.  More  generally,  if  A],  A2,  •  ■  ■  ,  A„ 
are  finite  sets, 

n 

1 A 1  X  A2  x  •  •  •  X  An |  =  Yi  lA'I- 

I  —  1 

As  before,  thinking  of  A,  as  the  set  of  ways  a  certain  event  can  occur,  we  are  led 
to  another  basic  principle  of  counting. 

The  number  of  ways  in  which  a  sequence  of  events  can  occur  is  the  product  of 
the  numbers  of  ways  in  which  each  individual  event  can  occur. 

If  Ursula’s  parents  had  a  change  of  mind  and  permitted  her  to  go  to  a  party 
after  the  show,  there  would  be  6x2  =  12  ways  in  which  she  could  spend  her  Friday 
night,  as  shown  in  Fig  6.2.  The  figure  also  illustrates  why  this  number  is  the  car¬ 
dinality  of  the  Cartestian  product  Movie  x  Party,  where  Movie  =  { 1 , 2,  3,  4,  5,  6} 
and  Party  =  {1,2}. 
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Movie  Party 
'<2 
2<2 

6<2 

Figure  6.2  There  are  6  x  2  =  12  ways  to  choose  one  of  six  movies  and  then  one  of  two  parties. 


Suppose  that  there  are  five  computer  science  courses  and  three  statistics 
courses  in  which  a  student  is  eligible  to  enrol  and  that  this  student  wishes  to 
take  a  course  in  each  subject.  There  are  5  x  3  =  15  ways  in  which  the  choice  can 
be  made. 

If  Jack,  Maurice,  and  Tom  each  roll  two  dice,  then  the  number  of  ways  in 
which  Jack  can  get  a  total  of  six,  Maurice  a  total  of  four,  and  Tom  a  total  of  three 
is  5  x  3  x  2  =  30  since  there  are  five  ways  to  get  a  total  of  six,  as  shown  before, 
three  ways  to  get  a  total  of  four  (two  2’s,  a  1  and  a  3,  or  a  3  and  a  1),  and  two 
ways  to  get  a  total  of  three. 


PROBLEM  5.  How  many  numbers  in  the  range  1000-9999  do  not  have  any 
repeated  digits? 

Solution.  Imagine  enumerating  all  numbers  of  the  desired  type  in  the  spirit  of 
Fig  6.2.  There  are  nine  choices  for  the  first  digit  (any  of  1-9).  Once  this  has  been 
chosen,  there  remain  still  nine  choices  for  the  second  (the  chosen  first  digit  cannot 
be  repeated  but  0  can  now  be  used).  There  are  now  eight  choices  for  the  third 
digit  and  seven  for  the  fourth.  Altogether,  there  are  9x9x8x7  =  4536  possible 
numbers.  I 


PROBLEM  6.  License  plates  in  the  Canadian  province  of  Ontario  consist  of 
four  letters  followed  by  three  of  the  digits  0-9  (not  necessarily  distinct).  How 
many  different  license  plates  can  be  made  in  Ontario? 

Solution.  There  are  26  ways  in  which  the  first  letter  can  be  chosen,  26  ways  in 
which  the  second  can  be  chosen,  and  similarly  for  the  third  and  fourth.  By  the 
multiplication  rule,  the  number  of  ways  in  which  the  three  letters  can  be  chosen 
is  26  x  26  x  26  x  26  =  264.  By  the  same  reasoning  there  are  103  ways  in  which 
the  final  three  digits  of  an  Ontario  license  plate  can  be  selected  and,  all  in  all, 
264  x  103  =  456,976,000  different  license  plates  which  can  be  manufactured  by 
the  government  of  Ontario  under  its  current  system.  I 


How  many  different  license  plates  can  be  made  by  a  government  which  permits 
four  letters  followed  by  three  digits  or  three  digits  followed  by  four  letters? 
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PROBLEM  7.  Darlene  wishes  to  fly  from  Portland,  Maine,  to  Portland,  Oregon. 
There  are  two  possible  routes  she  can  take,  namely,  Portland-Detroit-Portland  and 
Portland-Chicago-Portland.  Two  flights  from  Portland  to  Detroit  each  connect 
with  three  flights  from  Detroit  to  Portland,  while  two  flights  from  Portland  to 
Chicago  each  connect  with  four  flights  from  Chicago  to  Portland. 

(a)  Find  the  total  number  of  different  flight  sequences  from  Portland,  Maine,  to 
Portland,  Oregon,  via  Detroit. 

(b)  Repeat  (a)  if  “via  Detroit”  is  omitted. 

Solution. 

(a)  The  multiplication  rule  applies.  There  are  2  x  3  =  6  possibilities. 

(b)  By  (a),  there  are  six  routes  via  Detroit.  Also,  there  are  2x4  =  8  routes 

via  Chicago.  Using  the  addition  rule,  the  number  of  routes  via  Detroit  or 
Chicago  is  6  +  8  =  1 4.  I 


To  determine  the  number  of  ways  in  which  some  event  can  occur,  it  is  often 
helpful  to  break  the  event  into  mutually  exclusive  subevents  or  cases,  one  of  which 
must  occur.  By  counting  the  number  of  ways  in  which  each  case  can  occur  and 
adding  these  numbers,  the  addition  rule  gives  the  total  number  of  ways  in  which 
the  entire  event  can  occur.  Such  a  subtle  use  of  the  addition  rule  is  probably  its 
most  important  virtue. 

PROBLEM  8.  How  many  even  numbers  in  the  range  1 00-999  have  no  repeated 
digits? 

Solution.  The  question  is  equivalent  to  asking  for  the  number  of  ways  in  which 
one  can  write  down  an  even  number  in  the  range  100-999  without  repeating  digits. 
This  event  can  be  partitioned  into  two  mutually  exclusive  cases. 

Case  1:  The  number  ends  in  0. 

In  this  case,  there  are  nine  choices  for  the  first  digit  (1-9)  and  then  eight 
for  the  second  (since  0  and  the  first  digit  must  be  excluded).  So  there  are 
9  x  8  =  72  numbers  of  this  type. 

Case  2:  The  number  does  not  end  in  0. 

Now  there  are  four  choices  for  the  final  digit  (2,  4,  6,  and  8),  then  eight  choices 
for  the  first  digit  (0  and  the  last  digit  are  excluded),  and  eight  choices  for  the 
second  digit  (the  first  and  last  digits  are  excluded).  There  are  4  x  8  x  8  =  256 
numbers  of  this  type. 

By  the  addition  rule,  there  are  72  +  256  =  328  even  numbers  in  the  range 
100-999  with  no  repeated  digits.  I 

In  Case  2  of  Problem  8,  suppose  we  choose  the  last  digit,  then  the  middle,  and 
finally  the  first  digit.  By  considering  two  subcases, 

2a:  the  middle  digit  is  0 
2b:  the  middle  digit  is  not  0 
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show  again  that  there  are  256  even  numbers  in  the  range  100-999  with  no  repeated 
digits  and  last  digit  different  from  0. 

■BB  Answer  Problem  8  again  by  considering  the  four  cases: 

1.  first  two  digits  are  even, 

2.  first  two  digits  are  odd, 

3.  first  digit  even,  second  digit  odd, 

4.  first  digit  odd,  second  digit  even. 

These  last  two  Pauses  illustrate  an  important  point.  There  are  usually  several 
ways  to  approach  a  combinatorial  problem! 


PROBLEM  9.  A  typesetter  (long  ago)  has  before  him  26  trays,  one  for  each 
letter  of  the  alphabet.  Each  tray  contains  ten  copies  of  the  same  letter.  In  how 
many  ways  can  he  form  a  three  letter  “word”  which  requires  at  most  two  different 
letters?  By  "word,”  we  mean  any  sequence  of  three  letters— xpt,  for  example — not 
necessarily  a  real  word  from  the  dictionary.  Two  “ways”  are  different  unless  they 
use  the  identical  pieces  of  type. 

Solution.  The  event  “at  most  two  different  letters”  is  comprised  of  two  mutually 
exclusive  cases: 

Case  1:  The  first  two  letters  are  the  same. 

Here,  the  third  letter  can  be  arbitrary;  that  is,  any  of  the  258  letters  which 
remain  after  the  first  two  are  set  can  be  used.  So  the  number  of  ways  in  which 
this  case  can  occur  is  260  x  9  x  (260  —  2)  =  603,720. 

Case  2:  The  first  two  letters  are  different. 

In  this  case,  the  third  letter  must  match  one  of  the  first  two,  so  it  must  be  one 
of  the  1 8  letters  remaining  in  the  two  trays  used  for  the  first  two  letters.  The 
number  of  ways  in  which  this  case  occurs  is  260  x  250  x  18  =  1,170,000. 

By  the  addition  rule,  the  number  of  ways  to  form  a  word  using  at  most  two 
different  letters  is  603,720+  1,170,000  =  1,773,720.  I 


With  many  counting  problems,  there  are  several  different  ways  of  arriving  at 
the  same  answer.  Another  approach  to  the  previous  problem  is  to  count  the  total 
number  of  ways  of  forming  words  and  to  subtract  from  this  the  number  of  cases  in 
which  all  three  letters  are  different.  (Recall  that  |AC|  =  \U\  —  \A\.)  There  are  260  x 
259  x  258  =  17,373,720  possible  words  of  which  260  x  250  x  240  =  15,600,000 
consist  of  three  different  letters.  So  the  number  of  ways  of  forming  words  in  which 
at  most  two  different  letters  are  used  is  17,373,720  —  15,600,000  =  1,773,720 
as  before. 


PROBLEM  10.  Continuing  Problem  9,  determine  the  number  of  ways  of  form¬ 
ing  words  which  use  exactly  two  different  letters. 

Solution.  We  subtract  from  the  1,773,720  cases  requiring  at  most  two  different 
letters  the  number  of  ways  of  forming  words  which  use  just  one  letter.  So  the 
answer  is  1,773,720  -  (260  x  9  x  8)  =  1,755.000.  I 
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It  is  instructive  to  observe  that  this  number  could  also  be  obtained  by  con¬ 
sideration  of  three  mutually  exclusive  cases: 

Case  1:  The  first  two  letters  are  the  same  but  the  third  is  different. 

Case  2:  The  first  and  third  letters  are  the  same  but  the  second  is  different. 

Case  3:  The  second  and  third  letters  are  the  same  but  different  from  the  first. 

By  this  method,  we  obtain  (260  x  9  x  250)  +  (260  x  250  x  9)  +  (250  x  260  x9)  = 
3(260  x  250  x  9)  =  1,755,000,  as  before. 

In  Problems  9  and  10,  we  counted  the  number  of  ways  of  forming  certain  three 
letter  words,  not  the  number  of  different  words  which  can  be  formed.  How  many 
different  “words”  can  be  formed  in  each  of  these  problems? 


6.2.4  PROPOSITION 


A  set  of  cardinality  n  contains  2”  subsets  (including  the  empty  set  and  the  entire 
set  itself). 


Proof  There  are  several  ways  to  prove  this  fundamental  result.  We  present  one  here 
which  uses  the  ideas  of  this  section. 

Given  n  objects  a\,ci2 . a„,  each  subset  corresponds  to  a  sequence  of 

choices.  Is  ci\  in  the  subset?  Is  «2  in  the  subset?  Finally,  is  an  in  the  subset? 
There  are  two  answers  to  the  first  question,  two  for  the  second,  and  so  on.  In  all, 
there  are 


2  x  2  x  •  •  •  x  2  =  2" 

' - V - ' 

n  factors 


ways  in  which  all  n  choices  can  be  made.  Thus,  there  are  2"  subsets.  I 


Answers  to  Pauses 


5.  As  in  Problem  6,  the  number  of  different  plates  with  the  letters  first  is  264  x 
103  =  456,976,000.  Similarly,  the  number  of  different  plates  with  the  digits 
first  is  also  103  x  264  =  456,976,000.  By  the  addition  rule,  the  number  of 
plates  altogether  is  456,976,000  +  456,976.000  =  913,952,000. 

6.  Case  2a:  The  middle  digit  is  0. 

Here,  there  are  four  choices  for  the  last  digit,  one  choice  for  the  middle  digit, 
and  eight  choices  for  the  first  (which  can  be  any  of  1-9  except  that  chosen 
for  the  last  digit).  There  are  32  such  numbers. 

Case  2b:  The  middle  digit  is  not  0. 

In  this  case,  there  are  again  four  choices  for  the  last  digit,  there  are  eight 
choices  for  the  middle  (which  is  not  0  and  not  the  last  digit),  and  just  seven 
choices  for  the  first  digit  since  neither  of  the  digits  used  for  the  last  or  middle 
position  is  eligible.  There  are  4  x  8  x  7  =  224  such  numbers. 

As  before,  there  are  32  +  224  =  256  even  numbers  in  Case  2. 

7.  Case  1:  The  first  digit  can  be  chosen  in  four  ways  (2,  4,  6,  or  8),  then  the 
second  digit  in  four  ways  (avoid  repeating  the  first),  and  the  third  in  three. 
There  are  4  x  4  x  3  =  48  ways  for  Case  1  to  occur. 

Case  2:  The  first  digit  can  be  chosen  in  five  ways,  the  second  in  four,  and 
the  last  in  five.  There  are  5  x  4  x  5  =  100  ways  for  Case  2  to  occur. 
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Case  3:  There  are  four  choices  for  the  first  digit,  five  for  the  second,  and  four 
for  the  third  (just  avoid  repeating  the  first).  There  are  4  x  5  x  4  =  80  ways 
for  Case  3  to  occur. 

Case  4:  There  are  five  choices  for  the  first  digit,  five  choices  for  the  second, 
and  four  for  the  third  (avoid  repeating  the  second).  There  are  5  x  5  x  4  =  100 
ways  for  Case  4  to  occur. 

By  the  addition  rule,  the  number  we  seek  is  48  +  100  +  80  +  100  =  328,  as 
before. 

8.  In  Problem  9,  there  are  262  different  words  which  have  the  same  first  two 
letters  and  (26) (25)(2)  which  have  the  first  two  letters  different.  So  the  answer 
is  262  +  (26)(25)(2)  =  1976. 

In  Problem  10,  there  are  1976  —  26  =  1950  words  which  use  exactly  two 
different  letters. 


EXERCISES 


The  symbol  [BB]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 


1.  (a)  [BB]  Roberta  needs  two  courses  to  complete  her 

university  degree.  There  are  three  sections  of  the 
first  course  which  meet  in  the  morning  and  four 
afternoon  sections.  In  how  many  different  ways  can 
she  select  the  two  courses  she  needs? 

(b)  [BB]  Suppose  Roberta  needs  just  one  course  to  com¬ 
plete  her  degree.  In  how  many  ways  can  she  select 
this  course? 

2.  A  building  supplies  store  carries  metal,  wood,  and  plas¬ 
tic  moldings.  Metal  and  wood  molding  comes  in  two 
different  colors.  Plastic  molding  comes  in  six  different 
colors. 

(a)  How  many  choices  of  molding  does  this  store 
offer? 

(b)  If  each  kind  and  each  color  of  molding  comes  in  four 
different  lengths,  how  many  choices  does  the  con¬ 
sumer  have  in  the  purchase  of  one  piece  of 
molding? 

3.  [BB]  In  how  many  of  the  three-digit  numbers  000-999 
are  all  the  digits  different? 

4.  (a)  How  many  numbers  in  the  range  100-999  have  no 

repeated  digits? 

(b)  Without  using  the  result  of  Problem  8.  determine 
how  many  odd  numbers  in  the  range  100-999  have 
no  repeated  digits. 

(c)  Answer  (b)  again,  this  time  making  use  of  (a)  and 
Problem  8. 


5.  (a)  [BB]  Some  license  plates  in  California  consist 

of  one  of  the  digits  1-9,  followed  by  three  (not 
necessarily  distinct)  letters  and  then  three  of  the  dig¬ 
its  0-9  (not  necessarily  distinct).  How  many  possi¬ 
ble  license  plates  can  be  produced  by  this  method? 

(b)  [BB]  Other  California  license  plates  consist  of  one 
of  the  digits  1-9  followed  by  three  letters  and  then 
three  of  the  digits  0-9.  (The  same  digit  or  letter  can 
be  used  more  than  once.)  How  many  license  plates 
of  this  type  can  be  made? 

(c)  [BB]  What  is  the  maximum  number  of  license  plates 
in  California  that  can  be  made  assuming  plates  have 
one  of  the  two  types  described  in  (a)  and  (b)? 

6.  In  Mark  Salas,  the  1991  Detroit  Tigers  had  probably 
the  only  palindromic  player  in  major  league  baseball 
(certainly,  the  only  palindromic  catcher).  A  palindrome 
is  a  word  that  reads  the  same  forward  and  backward, 
like  SALAS. 

(a)  How  many  five-letter  palindromes  (not  necessarily 
real  words)  can  be  made  from  the  letters  of  the 
English  alphabet? 

(b)  How  many  eight-letter  palindromes  are  possible? 

(c)  How  many  “words”  not  exceeding  eight  letters  in 
length  are  palindromes? 

(d)  One  of  the  most  famous  palindromes  of  all  time  is 
one  which  might  have  been  uttered  by  Napoleon 
(had  his  native  tongue  been  English):  ABLE  WAS 
I  ERE  I  SAW  ELBA.  How  many  palindromes  (not 
necessarily  of  real  words)  are  this  of  this  length? 
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7.  [BB]  From  a  group  of  13  men,  6  women,  2  boys,  and  4 
girls, 

(a)  In  how  many  ways  can  a  man,  a  woman,  a  boy,  and 
a  girl  be  selected? 

(b)  In  how  many  ways  can  a  man  or  a  girl  be  selected? 

(c)  In  how  many  ways  can  one  person  be  selected? 

8.  In  how  many  ways  can  one  draw  from  a  standard  deck 
of  52  playing  cards 

(a)  a  heart  or  a  spade? 

(b)  an  ace  or  a  king? 

(c)  a  card  numbered  2  through  1 0? 

(d)  a  card  numbered  2  through  10  or  a  king? 

9.  [BB]  Using  only  the  digits  1,  3,  4,  and  7, 

(a)  how  many  two-digit  numbers  can  be  formed? 

(b)  how  many  three-digit  numbers  can  be  formed? 

(c)  how  many  two-  or  three-digit  numbers  can  be 
formed? 

10.  How  many  possible  telephone  numbers  consist  of  seven 
digits,  the  first  two  in  the  range  2-9  (inclusive),  the  third 
in  the  range  1-9  (inclusive),  and  each  of  the  last  four  in 
the  range  0-9  (inclusive)? 

11.  [BB]  A  company  produces  combination  locks,  the  com¬ 
binations  consisting  of  three  different  numbers  in  the 
range  0-59  (inclusive)  which  must  be  dialed 
in  order.  How  many  different  combinations  are  possi¬ 
ble? 

12.  In  how  many  ways  can  two  adjacent  squares  be  selected 
from  an  8  x  8  chess  board? 

13.  [BB]  New  parents  wish  to  give  their  baby  one,  two, 
or  three  different  names.  In  how  many  ways  can  the 
baby  be  named  if  the  parents  will  choose  from  a  book 
containing  500  names? 

14.  There  are  three  different  roads  from  Cupids  to  Harbour 
Grace  and  five  different  roads  from  Harbour  Grace  to 
Heart's  Desire. 

(a)  How  many  different  routes  are  there  from  Cupids  to 
Heart’s  Desire  via  Harbour  Grace? 

(b)  How  many  different  round  trips  are  there  from  Cu¬ 
pids  to  Heart's  Desire  and  back,  passing  through 
Harbour  Grace  each  way? 

(c)  Repeat  (b)  if  you  don’t  want  to  drive  on  any  road 
more  than  once. 

15.  How  many  three-digit  numbers  contain  the  digits  2  and 
5  but  none  of  the  digits  0,  3,  7? 


16.  You  are  dealt  four  cards  from  a  standard  deck  of  52 
playing  cards.  In  how  many  ways  can  you  get 

(a)  [BB]  four  of  a  kind  (four  2's,  or  four  kings,  etc.)? 

(b)  two  (different)  pairs? 

(c)  four  of  a  kind  or  two  (different)  pairs? 

(d)  three,  but  not  four,  of  a  kind? 

(e)  at  least  one  pair? 

17.  Two  dice  are  rolled. 

(a)  In  how  many  ways  can  a  total  of  eight  arise? 

(b)  In  how  many  ways  can  a  total  of  seven  arise? 

(c)  In  how  many  ways  can  a  total  of  eight  or  seven 
arise? 

(d)  In  how  many  ways  can  one  get  “doubles"  (the  dice 
land  with  the  same  side  up)? 

18.  [BB]  Make  a  table  which  shows  all  the  totals  which  are 
possible  when  two  dice  are  rolled  and  the  number  of 
ways  in  which  each  total  can  occur. 

19.  (a)  In  how  many  ways  can  two  dice  land? 

(b)  In  how  many  ways  can  five  dice? 

(c)  In  how  many  ways  can  n  dice  land? 

(d)  If  n  dice  are  rolled,  in  how  many  ways  can  they 
land  not  all  showing  the  same  number? 

20.  A  coin  is  tossed  four  times. 

(a)  [BB]  Make  a  list  of  all  the  possible  outcomes.  For 
example,  the  sequence  HHTH,  representing  head, 
head,  tail,  head,  is  one  possibility.  How  many  pos¬ 
sibilities  are  there  altogether? 

(b)  In  how  many  ways  can  you  get  exactly  one  head? 

(c)  In  how  many  ways  can  you  get  exactly  two  heads? 

(d)  In  how  many  ways  can  you  get  exactly  three  heads? 

(e)  In  how  many  ways  can  you  get  at  least  one  head? 

21.  (a)  How  many  five-digit  numbers  can  be  formed  using 

the  digits  0-9  inclusive  if  repetitions  are  allowed? 
(Leading  0’s  are  not  allowed:  07392,  for  example, 
should  not  be  considered  a  five-digit  number.) 

(b)  How  many  five-digit  numbers  can  be  formed  if  rep¬ 
etition  is  not  allowed? 

(c)  How  many  five-digit  numbers  have  one  or  more 
repeated  digits? 

22.  How  many  possible  license  plates  can  be  manufactured 
if  a  license  plate  consists  of  three  letters  followed  by 
three  digits  and 

(a)  [BB]  the  digits  must  be  distinct;  the  letters  can  be 
arbitrary? 

(b)  the  letters  must  be  distinct;  the  digits  can  be  arbi¬ 
trary? 

(c)  the  digits  and  the  letters  must  be  distinct? 
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23.  The  complete  menu  from  a  local  gourmet  restaurant  is 
shown  in  Table  6.1. 


Table  6.1 

Appetizers 

Beverages 

cod  au  gratin 

house  wine 

squid  ribs 

imported  wine 

snail’s  tails 

soft  drink 

smoked  seaweed 

coffee 

tea 

milk 

Entree 

vegetables 

flipper  pie 

spinach 

cod  tongues 

fiddleheads 

caplin  fillets 

cabbage 

turnip 

potatoes 


(a)  In  how  many  ways  can  John  plan  a  four-course 
meal,  one  course  from  each  group? 

(b)  Suppose  Mary  wants  two  appetizers  and  one  course 
from  each  other  group.  How  many  meals  are  possi¬ 
ble? 

(c)  Mike  wants  a  two-course  meal.  How  many  choices 
does  he  have? 

(d)  Suppose  Rod  wants  at  most  one  course  from  each 
group  and  at  least  one  course  from  some  group.  How 
many  meals  are  possible? 

24.  Suppose  n  —  p“'  /?“2  •  •  •  p?r  is  the  decomposition  of  n 
into  the  product  of  distinct  primes  p\,  p2,  ...  ,  />,-■  How 
many  (unordered)  pairs  [s,t]  of  positive  integers  satisfy 
both  st  =  n  and  gcdfs,  f)  =  1? 

25.  [BB]  Let  A  =  [a\,a2, . . .  ,  an }  be  a  set  of  n  elements 
and  B  =  {0,  1}. 

(a)  Show  that  there  are  2"  functions  from  A  to  B. 

(b)  Show  that  there  are  2n  —  2  onto  functions  from  A 
to  B. 

26.  Let  A  =  {ai,  a2 . a„]  be  a  set  of  n  elements,  n  >  1, 

and  B  =  {1,2,3). 

(a)  Prove  that  there  are  3"  functions  A  —>■  B. 

(b)  How  many  functions  A  — ►  B  are  not  onto? 

(c)  How  many  onto  functions  are  there  A  — >  B2  Justify 
your  answer. 


6.3  The  Pigeon-Hole  Principle 


6.3.1  THE  PIGEON-HOLE 
PRINCIPLE 


This  section  deals  with  a  deceptively  simple  restatement  of  a  certain  fact  about 
functions  between  finite  sets,  namely,  that  if  A  and  B  are  finite  sets  with  |  A\  >  \B\, 
no  function  / :  A  — ►  B  can  be  one-to-one.  For  example,  no  function  {1,  2,  3}  — > 
{.v,  y)  can  be  one-to-one. 

Suppose  that  B  is  a  set  of  bird  houses,  A  is  a  set  of  pigeons,  and  /  is  a 
function  which  assigns  a  bird  house  to  each  pigeon.  The  statement  that  /  is  not 
one-to-one  is  just  the  observation  that  at  least  two  pigeons  are  confined  to  the 
same  house.  Thus,  we  obtain  the  so-called  Pigeon-Hole  Principle. 

If  n  objects  are  put  into  m  boxes  and  n  >  m,  then  at  least  one  box  contains  two 
or  more  of  the  objects. 


We  said  that  this  principle  is  “deceptively”  simple.  The  deception  often  lies 
in  recognizing  that  the  principle  can  be  applied.  For  example,  within  any  group  of 
13  people,  there  must  be  two  who  have  their  birthdays  in  the  same  month.  (The 
people  are  the  objects  and  the  months  of  the  year  are  the  boxes.) 
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PROBLEM  11.  Given  five  points  inside  a  square  whose  sides  have  length  2, 
prove  that  two  are  within  sfl  of  each  other. 


Solution.  Subdivide  the  square  into  four  squares  with 
sides  of  length  1,  as  shown.  By  the  Pigeon-Hole  Princi¬ 
ple,  at  least  two  of  the  five  chosen  points  must  lie  in,  or  on 
the  boundary  of,  the  same  smaller  square.  But  these  points 
are  at  most  \fl  apart  (the  length  of  the  diagonal  of  a  smaller 
square).  I 


Jl/ 

1  1 


We  continue  with  some  subtler  applications  of  the  Pigeon-Hole  Principle. 


PROBLEM  12.  Show  that  among  n  +  1  arbitrarily  chosen  integers,  there  must 
exist  two  whose  difference  is  divisible  by  n. 

Solution.  Denote  the  integers  a\ ,  ai . an+\ .  As  we  know,  the  n  congruence 

classes  0,1,...  ,  n  —  1  of  integers  mod  n  partition  the  integers  into  disjoint  sets 
whose  union  is  Z.  (See  Proposition  4.4.5.)  Put  each  integer  a ,•  into  its  own  con¬ 
gruence  class.  By  the  Pigeon-Hole  Principle,  two  integers,  a\  and  c ij  will  go  to 
the  same  class  k.  Since  a,-  e  k,aj  =  k  (Proposition  4.4.3)  and,  similarly,  a]  =  k. 
Thus,  ai  =  a],  so  ax  =  cij  (mod  n)\  that  is,  n  |  (a,-  —  a j),  as  desired.  I 

The  solution  to  this  problem  involves  a  property  of  the  integers  which  is  often 
used  in  Pigeon-Hole  problems:  If  two  numbers  lie  in  the  same  congruence  class 
of  integers  mod  «,  then  their  difference  is  divisible  by  n. 

PROBLEM  13.  Prove  that  in  any  list  of  ten  natural  numbers,  a\,  «2,  . . .  ,  a io, 
there  is  a  string  of  consecutive  items  of  the  list,  at,  ae+\ ,  fl^+2,  •  •  • ,  whose  sum  is 
divisible  by  10.  (We  include  the  possibility  that  the  “string”  consists  of  just  one 
number.) 

Solution.  Consider  the  ten  numbers 

a\,  a i  +  02.  •  •  •  >  a\  +  «2  H - f  oio- 

If  any  of  these  is  divisible  by  10,  we  have  the  desired  conclusion;  otherwise, 

each  number  lies  in  one  of  the  nine  congruence  classes  1,2 . 9  of  integers 

mod  10.  By  the  Pigeon-Hole  Principle,  two  of  them  must  lie  in  the  same  class  and 
hence  have  a  difference  divisible  by  10.  Again  we  reach  the  desired  conclusion 
because,  if  s  >  t,  (t?i  +  ai  +  ■  ■  ■  +  as)  —  (a i  +  ci2  +  •  ■  •  +  a,)  =  at+  \  +  a,+ 2 
+  •  ■  ■  +  as  ■  1 


PROBLEM  14.  Martina  has  three  weeks  to  prepare  for  a  tennis  tournament.  She 
decides  to  play  at  least  one  set  every  day  but  not  more  than  36  sets  in  all.  Show  that 
there  is  a  period  of  consecutive  days  during  which  she  will  play  exactly  21  sets. 

Solution.  Suppose  Martina  plays  a\  sets  on  day  1,  at  sets  011  day  2,  and  so 
on,  <321  sets  on  the  last  day  of  her  preparation  period.  Consider  the  21  natural 
numbers  a\ ,  a\  +02,  ...  ,  a  1  +  a2  +  •  •  ■  +  «2i  •  Since  each  ax  >  1  and  since  the 
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6.3.2  PIGEON-HOLE 
PRINCIPLE  (STRONG 
FORM) 


EXAMPLE  16 


sum  a\  +  aj  +  ■  ■  ■  +  «2i  is  the  total  number  of  sets  Martina  will  play,  we  have 
1  <  a i  <  a\  +  «2  <  •••  <  a\  +  a2  + ■  •  •  +  a2\  <  36. 

Now  the  only  natural  number  between  1  and  36  which  is  divisible  by  21  is  21 
itself.  Therefore,  each  of  these  sums  is  either  21  or  it  belongs  to  one  of  the 

20  nonzero  congruence  classes  of  integers  mod  21.  In  the  first  case,  we  have 

a\  +  «2  +  •  •  •  +  =  21  for  some  t;  so  Martina  would  play  precisely  21  sets  on 

days  1  through  i  giving  the  desired  result.  In  the  second  case,  by  the  Pigeon-Hole 
Principle,  two  sums  lie  in  the  same  congruence  class  and  hence  have  difference 
divisible  by  21.  As  in  Problem  13,  we  obtain  a,+ \  +  al+ 2  +  ■  ■  ■  +  as  divisible  by 

21  for  some  s  and  t.  Since  this  is  a  number  between  1  and  36,  it  must  equal  21 

and  we  conclude  that  Martina  plays  21  games  on  days  t  +  1  through  s.  I 


PROBLEM  15.  Suppose  Martina  has  11  weeks  to  prepare  for  her  tournament 
and  that  she  intends  to  play  at  least  one  set  a  day  and  at  most  132  practice  sets 
in  all.  Draw  again  the  conclusion  that  during  some  period  of  consecutive  days, 
Martina  will  play  precisely  21  sets. 

Solution.  The  approach  used  before  doesn’t  work  this  time  (see  Exercise  9),  so 
we  find  a  slightly  different  solution.  We  let  bj  be  the  number  of  sets  Martina 
plays  on  days  1  through  i  inclusive  (bj  =  a\  +  a 2  +  ■■■  +  +  in  the  notation  of 
Problem  14)  and  consider  the  154  numbers 

b\ ,  Z>2,  •  •  •  ,  b~p ,  b\  +  21,  b2  +  21, . . .  ,  bj-j  +  21 . 

The  largest  number  here,  ^>77  +  21,  is  at  most  132  +  21  =  153.  By  the  Pigeon-Hole 
Principle,  we  conclude  that  two  are  the  same.  Since  bj  >  bj  if  i  >  j,  the  only  way 
for  two  to  be  equal  is  for  b,  —  bj  -\-  21  for  some  i  and  j  with  i  >  j.  Therefore, 
bi  —  bj  =  21  and  Martina  plays  21  sets  on  days  j  +  1  through  i.  I 

We  noted  earlier  that  in  any  group  of  13  people,  at  least  two  must  have 
birthdays  in  the  same  month.  The  same  must  hold  in  any  larger  group,  but  surely 
something  stronger  is  also  true.  If  at  most  two  people  in  a  group  of  30  had 
birthdays  in  the  same  month,  we  would  account  for  at  most  24  people.  Thus,  in 
any  group  of  30,  there  must  be  at  least  three  with  birthdays  in  the  same  month. 

Recall  the  definition  of  the  ceiling  function  given  in  3.1.6.  For  any  real  number 
x,  1+1  means  the  least  integer  which  is  greater  than  or  equal  to  x.  For  example, 
[3.5]  =  4,  r0.24]  =  1,  F-2.91  =  -2  and  =  3- 

If  n  objects  are  put  into  in  boxes  and  n  >  in,  then  some  box  must  contain  at  least 

111  ob.iects- 


If  there  are  44  chairs  positioned  around  five  tables  in  a  room,  some  table  must 
have  at  least  fyl  =  9  chairs  around  it.  A 

To  prove  the  strong  form  of  the  Pigeon-Hole  Principle,  we  establish  the  truth 

of  its  contrapositive.  Note  that 

n  n  r  n 

—  < - 1-1  and  hence  - 

m  m  m 


1-1< 


m 
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because,  for  any  real  number  x, 

X  <  [x]  <  X  +  l. 

Thus,  if  a  box  contains  fewer  than  [■£■]  objects,  then  it  contains  at  most  —  1 
and  so  fewer  than  ^  objects.  If  all  m  boxes  are  like  this,  we  account  for  fewer 
than  m  x  A  —n  objects. 


PROBLEM  17.  In  any  group  of  six  people,  at  least  three  must  be  mutual  friends 

or  at  least  three  must  be  mutual  strangers. 

Solution.  Pretend  you  are  one  of  the  six  people  and  put  the  other  n  =  5  people 
into  m  =  2  “boxes”  labeled  “my  friends”  and  “strangers  to  me.”  Since  [ n/m\  = 
[5/2]  =  3,  by  the  strong  form  of  the  Pigeon-Hole  Principle,  at  least  one  of  these 
boxes  must  contain  three  people.  Suppose  three  people  are  your  friends.  If  any  two 
of  these  three  are  friends,  then  together  with  you,  we  have  a  set  of  three  mutual 
friends.  If  no  two  of  these  three  are  friends,  then  these  three  people  are  mutual 
strangers.  In  either  case,  we  have  one  of  the  desired  conclusions.  The  remaining 
possibility,  that  three  of  the  five  people  are  strangers  to  you,  leads  again  to  the 
desired  conclusion  with  an  argument,  which  we  omit,  similar  to  the  one  already 
presented.  I 

Provide  the  details  of  this  omitted  argument. 


Answers  to  Pauses 


9.  Either  the  three  people  in  this  group  are  mutual  friends  or  there  are  two  who 
are  strangers.  Since  each  of  these  two  is  a  stranger  to  you,  we  have  a  set  of 
three  mutual  strangers. 


EXERCISES 


The  symbol  [BB]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 


1.  [BB]  Show  that  in  any  group  of  eight  people,  at  least 
two  have  birthdays  which  fall  on  the  same  day  of  the 
week  in  any  given  year. 

2.  Write  down  any  six  natural  numbers.  Verify  that  there 
is  a  string  of  consecutive  numbers  in  your  list  (possibly 
a  “string”  of  just  one  number)  whose  sum  is  divisible 
by  6.  Prove  that  this  must  always  be  the  case. 

3.  In  any  list  of  n  natural  numbers,  prove  that  there  must 
always  exist  a  string  of  consecutive  numbers  (possibly 
just  one  number  in  the  string)  whose  sum  is  divisible 
by  n. 

4.  [BB]  In  a  group  of  100  people,  several  will  have  their 
birthdays  in  the  same  month.  At  least  how  many  must 
have  birthdays  in  the  same  month?  Why? 

5.  A  standard  deck  of  playing  cards  contains  52  cards 
divided  into  four  suits  (club,  diamond,  heart,  spade) 


of  13  denominations  (Ace,  2,  3,  ...  ,  10,  Jack,  Queen, 
King).  How  many  cards  of  a  single  suit  must  be  present 
in  any  set  of  n  cards?  How  many  cards  of  the  same 
denomination?  Explain. 

6.  (a)  [BB]  If  20  processors  are  interconnected  and  every 

processor  is  connected  to  at  least  one  other,  show 
that  at  least  two  processors  are  directly  connected 
to  the  same  number  of  processors. 

(b)  [BB]  Is  the  result  of  (a)  still  true  without  the  assump¬ 
tion  that  every  processor  is  connected  to  at  least  one 
other?  Explain. 

7.  Thirty  buses  are  to  be  used  to  transport  2000  refugees 
from  Gander  to  St.  John’s,  Newfoundland.  Each  bus  has 
80  seats.  Assume  one  seat  per  passenger. 

(a)  Prove  that  one  of  the  buses  will  carry  at  least  67 
passengers. 
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(b)  Prove  that  one  of  the  buses  will  have  at  least  14 
empty  seats. 

8.  In  a  gathering  of  30  people,  there  are  104  different  pairs 
of  people  who  know  each  other. 

(a)  Show  that  some  person  must  have  at  least  seven 
acquaintances. 

(b)  Show  that  some  person  must  have  fewer  than  seven 
acquaintances. 

9.  [BB]  Try  to  solve  Problem  15  with  the  method  used  to 
solve  Problem  14.  What  goes  wrong? 

10.  [BB]  Brad  has  five  weeks  to  prepare  for  his  driver’s  test. 
His  mother  volunteers  to  drive  with  him  for  either  15 
minutes  or  a  half  hour  every  day  until  the  test,  but  not 
for  more  than  15  hours  in  all.  Show  that  during  some 
period  of  consecutive  days,  Brad  and  his  mother  will 
drive  for  exactly  eight  and  three  quarter  hours. 

11.  Linda  has  six  weeks  to  prepare  for  an  examination  and 
at  most  50  hours  available  to  study.  She  plans  to  study 
at  least  an  hour  a  day  and  a  whole  number  of  hours  each 
day.  Show  that  no  matter  how  she  schedules  her  study 
time,  there  is  a  period  of  consecutive  days  during  which 
she  will  have  studied  exactly  33  hours. 

12.  George  has  promised  to  increase  his  vocabulary  by 
learning  the  meanings  of  90  new  words  during  his 
summer  holidays.  Suppose  he  has  53  days  in  which 
to  accomplish  this  task  and  will  learn  at  least  one 
new  word  a  day.  Show  that  during  some  span  of 
consecutive  days,  George  will  learn  precisely  15  new 
words. 

13.  [BB]  In  a  12-day  period,  a  small  business  mailed  195 
bills  to  customers.  Show  that  during  some  period  of 
three  consecutive  days,  at  least  49  bills  were  mailed. 

14.  The  circumference  of  a  roulette  wheel  is  divided  into  36 

sectors  to  which  the  numbers  1,2,3 . 36  are  assig¬ 

ned  in  some  arbitrary  manner.  Show  that  there  must  be 
three  consecutive  sectors  whose  assigned  numbers  add 
to  at  least  56. 

15.  How  many  seats  in  a  large  auditorium  have  to  be  occu¬ 
pied  to  be  certain  that  at  least  three  people  seated  have 
the  same  first  and  last  initials? 

16.  fBB]  Of  any  26  points  within  a  rectangle  measuring 
20  cm  by  15  cm,  show  that  at  least  two  are  within  5  cm 
of  each  other. 

17.  Of  any  five  points  chosen  within  an  equilateral  triangle 
whose  sides  have  length  1,  show  that  two  are  within  a 
distance  of  ^  of  each  other. 


18.  A  cake  is  in  the  shape  of  a  regular  hexagon  with  each 
of  its  sides  exactly  30  cm  long.  Seven  flowers  of  icing 
adorn  the  top.  Show  that  at  least  two  flowers  are  not 
more  than  30  cm  apart. 

19.  Let  5  =  {2,  3,  5,  7,  1 1.  13,  17,  19)  be  the  set  of  prime 
numbers  less  than  20.  If  A  is  a  subset  of  S ,  we  can  form 
the  sum  and  product  of  the  elements  of  A.  For  example, 
if  A  =  {7,  1 1.  13},  then  the  associated  sum  is  7  +  1 1  + 
13  =  31  and  the  associated  product  is  7(1 1 )( 1 3)  = 
1001. 

(a)  Use  the  Pigeon-Hole  Principle  to  show  that  there 
are  four  nonempty  subsets  of  S  with  the  same  sum. 

(b)  Are  there  two  nonempty  subsets  of  S  with  the  same 
product?  Explain. 

20.  fBB]  Given  any  positive  integer  n ,  show  that  some  mul¬ 
tiple  of  n  is  an  integer  whose  representation  in  base  10 
requires  just  3’s  and  0's. 

[Hint:  Let  M\  =  3,  M2  =  33,  M3  =  333 . Then 

think  about  the  remainders  when  each  of  these  numbers 
is  divided  by  //.] 

21.  Show  that  some  multiple  of  2002  consists  of  a  string  of 
1  ’s  followed  by  a  string  of  0’s. 

22.  [BB]  Show  that  the  decimal  expansion  of  a  rational  num¬ 
ber  must,  after  some  point,  become  periodic  or  stop. 
[Hint:  Think  about  the  remainders  in  the  process  of  long 
division.] 

23.  One  hundred  and  one  numbers  are  chosen  from  the  set  of 
natural  numbers  { 1 , 2,  3, . . .  ,  200}.  Show  that  one  must 
be  a  multiple  of  another.  [Hint :  Any  natural  number  can 
be  written  in  the  form  2 ka  with  k  >  0  and  a  odd.] 

24.  In  a  room  where  there  are  more  than  50  people  with 
ages  between  I  and  100,  show  that 

(a)  [BB]  Either  two  people  have  the  same  age  or  there 
are  two  people  whose  ages  are  consecutive  integers. 

(b)  Either  two  people  have  the  same  age  or  one  person’s 
age  is  a  multiple  of  another’s. 

(c)  Some  of  the  people  shake  hands.  Show  that  at  least 
two  shook  the  same  number  of  hands.  (“No  hands” 
is  a  possibility.) 

(d)  Some  people  shake  hands.  Show  that  among  those 
who  shook  at  least  one  hand,  two  people  shook  the 
same  number  of  hands. 

25.  (a)  Let  A  be  a  set  of  seven  (distinct)  natural  numbers 

none  of  which  exceeds  21.  Prove  that  the  sums  of 
the  elements  in  all  the  nonempty  subsets  of  A  are 
not  distinct. 
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(b)  Improve  the  result  of  (a)  by  showing  that  the  result 
holds  under  the  assumption  that  the  integers  of  A 
do  not  exceed  23. 

(c)  Assume  none  of  the  elements  of  A  exceeds  12.  At 
least  how  many  subsets  of  A  must  have  the  same 
sum? 

26.  (a)  [BB]  Show  that  in  any  group  of  ten  people,  there 
is  either  a  set  of  three  mutual  strangers  or  a  set  of 

four  mutual  friends. 

(b)  Show  that  in  any  group  of  20  people,  there  is  either 
a  set  of  four  mutual  strangers  or  a  set  of  four  mutual 
friends. 


27.  Suppose  a\,a2 . a \ o  are  ten  integers  between  1  and 

100  (inclusive). 

(a)  Prove  that  there  exist  two  subsets  {a,-, ,  a;2, . . .  ,  a,r } 
and  { ctj ,  ,aj2,...  ,  ajs )  with  equal  sums. 

(b)  Prove  that  there  exist  two  disjoint  subsets  {a,-, , 

cij2 . a,r)  and  jay, ,  aj2 . ajs }  with  equal 

sums. 

28.  Prove  the  (first  form  of  the)  Pigeon-Hole  Principle 
by  mathematical  induction  on  m,  the  number  of 
boxes. 

29.  Given  any  52  integers,  show  that  there  exist  two  whose 
sum  or  difference  is  divisible  by  100.  [Hint:  x\  ±  .v, . | 
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1.  Suppose  A  and  B  are  nonempty  finite  sets  and  |AUB|  < 
| A |  +  |B|.  Show  that  4  0  5^0. 

2.  Using  the  Principle  of  Inclusion-Exclusion,  find  the 
number  of  integers  between  1  and  2000  (inclusive) 
which  are  divisible  by  at  least  one  of  2,  3,  5,  7. 

3.  John  Sununu  was  once  the  governor  of  New  Hamp¬ 
shire,  and  his  name  reminds  one  of  the  authors  of  a 
palindrome. 

(a)  What  is  a  palindrome  and  what  made  the  author 
think  of  this? 

(b)  How  many  seven-letter  palindromes  (not  necessarily 
real  words)  begin  with  the  letter  “S’"  and  contain  at 
most  three  different  letters? 

(c)  How  many  seven-letter  palindromes  (not  necessarily 
real  words)  are  there  in  all? 

(d)  Name  a  palindromic  rock  band  of  the  late  1970s 
which  is  still  popular  today. 

4.  Two  Math  2320  students  are  arguing  about  the  number 
of  palindromes  with  nine  and  ten  letters.  One  claims 
there  are  more  with  ten  letters  than  with  nine;  the  other 


says  there  are  the  same  number  in  each  case.  Who  is 
right?  Explain. 

5.  Four  sets  A\,  Ai,  A3,  A4  have  the  property  that  A,-  D 
Aj  C\  Aic  =  0  whenever  i,  j.  k  are  distinct.  In  addition, 
| A/  C\  A j\  —  1  whenever  i  ^  j  and  |A,|  =  5  for  all  i. 
Find  the  number  of  elements  that  belong  to  at  least  one 
of  the  sets. 

6.  Seventy  cars  sit  on  a  parking  lot.  Thirty  have  stereo 
systems,  30  have  air  conditioners  and  40  have  sun  roofs. 
Thirty  of  the  cars  have  at  least  two  of  these  three  options 
and  10  have  all  three.  How  many  cars  on  the  lot  have  at 
least  one  of  these  three  options?  How  many  have  exactly 
one? 

7.  State  the  strong  form  of  the  Pigeon-Hole  Principle. 

8.  Show  that  among  18  arbitrarily  chosen  integers,  there 
must  exist  two  whose  difference  is  divisible  by  17. 

9.  Use  the  Pigeon-Hole  Principle  and  the  definition  of  infi¬ 
nite  set  to  prove  that  Z  is  infinite. 

10.  Show  that,  of  any  ten  points  chosen  within  an  equilateral 
triangle  whose  sides  have  length  1,  there  are  two  whose 
distance  apart  is  at  most 
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7.1  Permutations 

The  registrar  of  a  school  must  schedule  six  examinations  in  an  eight-day  period 
and  has  promised  the  students  not  to  schedule  more  than  one  examination  per 
day.  How  many  different  schedules  can  be  made?  The  answer  to  this  question 
involves  a  straightforward  application  of  the  multiplication  rule.  The  first  exam 
can  be  given  on  any  of  the  eight  days,  but  once  this  day  has  been  settled,  the 
second  exam  can  be  given  on  only  one  of  the  remaining  seven  days.  There  are 
8  x  7  =  56  pairs  of  days  on  which  the  first  two  examinations  can  be  scheduled. 
Once  the  first  two  exams  have  been  scheduled,  there  remain  six  days  on  which 
to  schedule  the  third  exam.  Continuing  to  reason  this  way,  we  see  that  there  are 
8x7x6x5x4x3  =  20, 160  possible  schedules. 

In  how  many  ways  can  a  blue,  a  white,  and  a  red  marble  be  put  into  10 
numbered  boxes?  If  there  is  no  limit  on  the  number  of  marbles  which  can  be 
put  into  a  box,  there  are  10  possible  boxes  into  which  the  blue  marble  can  be 
placed  and  then  10  choices  for  the  white  one  and  10  choices  for  the  red  one, 
giving  103  =  1000  possibilities  altogether.  Suppose,  on  the  other  hand,  we  are 
not  allowed  to  put  more  than  one  marble  into  a  box.  As  before,  there  are  10 
possible  boxes  for  the  blue  marble,  but  now  there  are  just  9  possible  boxes  for 
the  white  one  and  8  for  the  red.  Altogether,  there  are  10  x  9  x  8  =  720  possi¬ 
bilities. 

The  solution  to  a  counting  problem  often  involves  the  product  of  consecutive 
integers.  Remember  that  »!  denotes  the  product  of  all  the  natural  numbers  from 
1  to  n  (inclusive): 


nl  =  n(n  -  1 )(«  -  2)  ■  •  •  (3)(2)(1) 


(see  Definition  5.1.2).  The  notation  P(n,r )  denotes  the  product  of  the  first  r 
factors  of  nl. 
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7.1.1  DEFINITION 


Pause  1 


For  integers  n  and  r,  n  >  1,  0  <  r  <  n,  the  symbol  P(n,r)  is  defined  by 
P(n,  0)  =  1  and,  for  r  >  0, 

Pin,  r)  =  n(n  —  l)(n  —  2)  •  •  •  (n  —  r  +  1) . 

r  factors 

For  example,  P(6,  2)  =  6  ■  5  =  30  (the  first  two  factors  of  6!),  P( 7,  3)  = 
7-6-5  =  210  (the  first  three  factors  of  7!),  P( 8,  5)  =  8-  7-  6-  5-  4  =  6720, 
P(  12,  0)  =  1,  and  so  on. 

Find  P( 5,3),  P(4,4),  and  P( 7,2). 

Among  the  combinatorial  problems  to  which  P(n,r)  is  the  answer  is  the 
following  one,  which  we  have  already  illustrated. 


7.1.2  PROPOSITION 


Given  natural  numbers  r  and  n  with  r  <  n,  the  number  of  ways  to  place  r  marbles 
of  different  colors  into  n  numbered  boxes,  at  most  one  marble  to  a  box,  is  P(n,  r). 


Notice  that 

n(n  —  1  )(n  —  2)  •  •  •  (n  —  r  4-  1)  (n  —  r)(n  —  r  —  1)  •  •  •  (3)(2)(1)  =  n\. 


Pin,  r)  = 


(a  —  r)! 


Pause  2 


a  formula  which  holds  also  for  r  —  0  and  r  =  n  because  P(n,  0)  =  1  and 

0!  =  1. 

Some  people  have  been  known  to  protest  that  this  formula  is  difficult  to  use 
because,  for  example,  they  are  unable  to  evaluate  numbers  like  P(25,  1)  =  ||| 
with  their  calculator,  25 !  being  too  large  to  compute.  It  is  good  for  us  all  to  notice 
that 


25!  _  25  -24  -23---X  -X 
24!  24  •  yi  ■  •  -X  -X 


Find  $},  and  P( 7,0). 

For  us,  the  symbol  P(n,  r)  is  primarily  a  notational  device  which  makes  it  easy 
to  write  down  the  answers  to  certain  combinatorial  problems.  We  caution  against 
trying  to  fit  a  problem  involving  a  straightforward  application  of  the  multiplication 
rule  into  the  context  of  marbles  and  boxes. 


PROBLEM  1.  Flow  many  pairs  of  dance  partners  can  be  selected  from  a  group 
of  12  women  and  20  men? 

Solution.  The  first  woman  can  be  paired  with  any  of  20  men,  the  second  woman 
with  any  of  the  remaining  19  men,  the  third  with  any  of  the  remaining  18,  and  so 
on.  There  are  20  ■  19  ■  18  •  •  ■  9  =  P(20,  12)  possible  couples.  I 
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7.1.3  DEFINITION 


EXAMPLES  2 


7.1.4  DEFINITION 


. . 

7.1.5  PROPOSITION 


There  is  a  second  question,  having  to  do  with  permutations,  to  which  P{n ,  r) 
is  the  answer  and  which  explains  the  “P”  in  “ P(n ,  r).” 

A  permutation  of  a  set  of  distinct  symbols  is  an  arrangement  of  them  in  a  line  in 
some  order. 

ab  and  ba  are  permutations  of  the  symbols  a  and  b\  1642,  4126,  and  6241  are 
permutations  of  the  symbols  1,  2,  4,  and  6.  A 

The  permutations  1642,  4126,  and  6241  are  also  examples  of  4 -permutations 
of  the  symbols  1,  2,  3,  4,  5,  6,  that  is,  permutations  of  the  symbols  1,  2.  3,  4, 
5,  6  taken  four  at  a  time.  Here  are  some  3-permutations  of  the  26  letters  of  the 
alphabet:  zxy,  aqr,  cat,  how. 

For  natural  numbers  r  and  n,  r  <  n,  an  r -permutation  of  n  symbols  is  a  permu¬ 
tation  of  r  of  them,  that  is,  an  arrangement  of  r  of  the  symbols  in  a  line  in  some 
order. 

As  a  direct  consequence  of  the  multiplication  rule,  we  have  the  following. 

The  number  of  permutations  of  n  symbols  is  n\.  The  number  of  /--permutations 
of  n  symbols  is  P(n,  r). 

For  example,  there  are  3!  =  6  permutations  of  a,  b,  c,  namely,  abc,  acb,  bac, 
bca,  cab,  and  cba.  (There  are  three  choices  for  the  first  symbol  and  then  two  for 
the  second  and  one  for  the  third.)  There  are  6  •  5  ■  4  •  3  =  360  =  P( 6,  4)  ways  in 
which  four  of  the  six  creatures — man,  woman,  boy,  girl,  dog,  cat — can  walk  in  a 
line  down  a  road,  one  after  the  other. 

PROBLEM  3.  There  are  7!  =  5040  ways  in  which  seven  people  can  form  a 
line.  In  how  many  ways  can  seven  people  form  a  circle? 

Solution.  A  circle  is  determined  by  the  order  of  the  people  to  the  right  of  any 
one  of  the  individuals,  say  Eric.  There  are  six  possibilities  for  the  person  on  Eric’s 
right,  then  five  possibilities  for  the  next  person,  four  for  the  next,  and  so  on.  The 
number  of  possible  circles  is  6!  =  720.  I 


Another  way  to  obtain  6!  is  to  relate  the  two  problems,  line  and  circle.  Each 
circle  determines  seven  lines,  determined  by  asking  the  people  to  join  hands  and 
then  breaking  the  circle  at  one  of  the  seven  people. 

No.  of  circles  x  7  =  no.  of  lines  =  7! 


7! 

Therefore,  no.  of  circles  =  —  =  6! 

7 


PROBLEM  4.  A  man,  a  woman,  a  boy,  a  girl,  a  dog,  and  a  cat  are  walking 
down  a  long  and  winding  road  one  after  the  other. 

(a)  In  how  many  ways  can  this  happen? 

(b)  In  how  many  ways  can  this  happen  if  the  dog  comes  first? 
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(c)  In  how  many  ways  can  this  happen  if  the  dog  immediately  follows  the 
boy? 

(d)  In  how  many  ways  can  this  happen  if  the  dog  (and  only  the  dog)  is  between 
the  man  and  the  boy? 

Solution. 

(a)  There  are  6!  =  720  ways  for  six  creatures  to  form  a  line. 

(b)  If  the  dog  comes  first,  the  others  can  form  5!  lines  behind. 

(c)  If  the  dog  immediately  follows  the  boy,  then  the  dog-boy  pair  should  be 
thought  of  as  a  single  object  to  be  put  into  a  line  with  four  others.  There 
are  5!  =  120  such  lines. 

(d)  If  the  man,  dog,  and  boy  appear  in  this  order,  then  thinking  of  man-dog- 

boy  as  a  single  object  to  be  put  into  a  line  with  three  others,  we  see  that 
there  are  4!  possible  lines.  Similarly,  there  are  4!  lines  in  which  the 
boy,  dog,  and  man  appear  in  this  order.  So,  by  the  addition  rule,  there 
are  4!  +  4!  =  48  lines  in  which  the  dog  (and  only  the  dog)  is  between  the 
man  and  the  boy.  I 


PROBLEM  5.  In  how  many  ways  can  ten  adults  and  five  children  stand  in  a 
line  so  that  no  two  children  are  next  to  each  other? 

Solution.  Imagine  a  line  of  ten  adults  named  A,  B,  . . .  ,  J , 

xDxJxHxCxIxExBxAxGxFx, 

the  x’s  representing  the  11  possible  locations  for  the  children.  For  each  such 
line,  the  first  child  can  be  positioned  in  any  of  the  1  1  spots,  the  second  child 
in  any  of  the  remaining  10,  and  so  on.  Hence,  the  children  can  be  positioned  in 
1 1  •  10  •  9  •  8  ■  7  =  P{  1 1,  5)  ways.  For  each  such  positioning,  there  are  10!  ways  of 
ordering  the  adults  A, . . .  ,  7,  so,  by  the  multiplication  rule,  the  number  of  lines 
of  adults  and  children  is  10!  P{  11, 5).  I 

mmzmi  h°w  many  ways  can  ten  adults  and  five  children  stand  in  a  circle  so  that  no 
two  children  are  next  to  each  other? 


PROBLEM  6.  In  how  many  ways  can  the  letters  of  the  English  alphabet  be 
arranged  so  that  there  are  exactly  ten  letters  between  a  and  z? 

Solution.  There  are  P(24.  10)  arrangements  of  the  letters  of  the  alphabet  (exclu¬ 
ding  a  and  z)  taken  ten  at  a  time,  and  hence  2-  P(24,  10)  strings  of  12  letters,  each 
beginning  and  ending  with  an  a  and  a  z  (either  letter  coming  first  in  a  string).  For 
each  of  these  strings,  there  are  15!  ways  to  arrange  the  14  remaining  letters  and 
the  string.  So  there  are  altogether  2  •  P{ 24,  10)  ■  15!  arrangements  of  the  desired 
type.  I 

The  answer  to  Problem  6  is  also  30(24!).  Why? 

1.  P(5,  3)  =  5  •  4  ■  3  =  60;  P( 4,  4)  =  4  •  3  ■  2  ■  1  =  24;  P( 7,  2)  =  7  •  6  =  42. 

2.  fj$  =  20  •  19  •  18  =  6840;  ^  =  100  •  99  =  9900;  P( 7,  0)  =  ||  =  1. 


Answers  to  Pauses 
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3.  Arrange  the  adults  into  a  circle  in  one  of  9!  ways.  There  are  then  10  locations 
for  the  first  child,  9  for  the  second,  8  for  the  third,  7  for  the  fourth,  and  6  for 
the  fifth.  The  answer  is  9!(10  ■  9  •  8  •  7  •  6)  =  9!  ■  P(10,  5). 

4.  First  we  count  the  number  of  arrangements  of  the  prescribed  type  in  which 
a  precedes  z-  For  each  of  the  24!  permutations  of  the  letters  b  through  y, 
there  are  15  locations  for  a  after  which  the  position  of  z  is  fixed.  Thus,  there 
are  15(24!)  arrangements  with  a  preceding  z.  Similarly,  there  are  15(24!) 
arrangements  with  z  preceding  a  giving,  altogether,  2(1 5) (24!)  arrangements. 


EXERCISES 


The  symbol  /  BR  /  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 


1.  [BB]  How  many  ways  are  there  to  distribute  eight  dif¬ 
ferent  books  among  thirteen  people  if  no  person  is  to 
receive  more  than  one  book? 

2.  Eight  horses  are  entered  in  a  race  in  which  a  first,  sec¬ 
ond,  and  third  prize  will  be  awarded.  Assuming  no  ties, 
how  many  different  outcomes  are  possible? 

3.  [BB]  A  club  has  ten  members.  In  how  many  ways  can 
they  choose  a  slate  of  four  officers  consisting  of  a  pres¬ 
ident,  vice  president,  secretary,  and  treasurer? 

4.  There  are  30  people  in  a  class  learning  about  permu¬ 
tations.  One  after  another,  eight  people  gradually  slip 
out  the  back  door.  In  how  many  ways  can  this  exodus 
occur? 

5.  [BB]  Find  the  number  of  ways  in  which  six  children  can 
ride  a  toboggan  if  one  of  the  three  girls  must  steer  (and 
therefore  sit  at  the  back). 

6.  Four  cats  and  five  mice  enter  a  race.  In  how  many  ways 
can  they  finish  with  a  mouse  placing  first,  second,  and 
third? 

7.  (a)  In  how  many  ways  can  ten  boys  and  four  girls  sit 

in  a  row? 

(b)  [BB]  In  how  many  ways  can  they  sit  in  a  row  if 
the  boys  are  to  sit  together  and  the  girls  are  to  sit 
together? 

(c)  In  how  many  ways  can  they  sit  in  a  row  if  the  girls 
are  to  sit  together? 

(d)  In  how  many  ways  can  they  sit  in  a  row  if  just  the 
girls  are  to  sit  together? 

8.  Answer  all  parts  of  Exercise  7  with  "circle"  instead  of 
"row.”  (In  every  circle,  people  sit  facing  inward.) 

9.  [BB]  In  how  many  ways  is  it  possible  to  sit  seven 
knights  at  a  round  table  if 


(a)  Friday,  Saturday,  and  Sunday  insist  on  sitting 
together? 

(b)  Wednesday  refuses  to  sit  next  to  Saturday  or  Sun¬ 
day? 

10.  [BB]  Let  X  and  Y  be  sets  with  \X\  =  m  and  |K|  =  n. 

(a)  How  many  bijective  functions  are  there  X  — ►  XI 

(b)  How  many  injective  functions  are  there  X  — *  Y? 

11.  How  many  permutations  of  the  letters  a.  b,  c,  d ,  e,  f.  g 
contain  neither  the  pattern  bge  nor  the  pattern  eafl 

12.  [BB]  In  how  many  ways  can  two  couples,  the  Nosewor- 
thys  and  the  Abbotts,  form  a  line  so  that 

(a)  the  Noseworthys  are  beside  each  other? 

(b)  the  Noseworthys  are  not  beside  each  other? 

(c)  each  couple  is  together? 

(d)  the  Noseworthys  are  beside  each  other  but  the 
Abbotts  are  not? 

(e)  at  least  one  couple  is  together? 

(f)  exactly  one  couple  is  together? 

13.  Repeat  Exercise  12,  assuming,  in  each  part,  that  a  dog 
also  forms  part  of  the  line. 

14.  Three  couples,  the  Smiths,  Joneses,  and  Murphys,  are 
going  to  form  a  line. 

(a)  [BB]  In  how  many  such  lines  will  Mr.  and 
Mrs.  Jones  be  next  to  each  other? 

(b)  In  how  many  such  lines  will  Mr.  and  Mrs.  Jones 
be  next  to  each  other  and  Mr.  and  Mrs.  Murphy  be 
next  to  each  other? 

(c)  In  how  many  such  lines  will  at  least  one  couple  be 
next  to  each  other? 

15.  How  many  permutations  of  the  letters  a,  b,  c,  d ,  <?,  /,  g 
have  either  two  or  three  letters  between  a  and  bl 

16.  (a)  [BB]  How  many  seven-digit  numbers  have  no 

repeated  digits? 
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(b)  How  many  seven-digit  numbers  with  no  repeated 
digits  contain  a  3  but  not  a  6? 

(Leading  zeros  are  not  permitted  in  either  part  of  this 
question.) 

17.  (a)  In  how  many  numbers  with  seven  distinct  digits  do 
only  the  digits  1-9  appear? 

(b)  [BB]  How  many  of  the  numbers  in  (a)  contain  a  3 
and  a  6? 

(c)  In  how  many  of  the  numbers  in  (a)  do  3  and  6  occur 
consecutively  (in  either  order)? 


(d)  [BB]  How  many  of  the  numbers  in  (a)  contain  nei¬ 
ther  a  3  nor  a  6? 

(e)  How  many  of  the  numbers  in  (a)  contain  a  3  but  not 
a  6? 

(f)  In  how  many  of  the  numbers  in  (a)  do  exactly  one 
of  the  numbers  3,  6  appear? 

(g)  In  how  many  of  the  numbers  in  (a)  do  neither  of  the 
consecutive  pairs  36  and  63  appear? 

18.  Let  X  and  Y  be  finite  nonempty  sets,  |X|  =  m,  |K|  = 
n  <  m.  Let  fin,  m)  denote  the  number  of  partitions  of 
X  into  n  subsets.  Prove  that  the  number  of  surjective 
functions  X  ->  Y  is  n\f(n,  m). 


7.2  Combinations 

There  are  12  different  kinds  of  drinks  for  sale  in  a  store  and  a  customer  wants 
to  buy  two  different  kinds.  How  many  choices  does  this  person  have?  There  are 
12  women  and  20  men  in  a  club,  five  of  whom  will  be  chosen  to  organize  a 
Christmas  party.  In  how  many  ways  can  these  five  people  be  chosen?  If  a  coin  is 
tossed  eight  times,  in  how  many  ways  can  exactly  five  heads  turn  up?  These  are 
the  sorts  of  questions  which  we  answer  in  this  section. 

In  the  last  section,  we  found  that  the  number  of  ways  to  place  three  different 
colored  marbles  into  ten  numbered  boxes,  at  most  one  to  a  box,  is  P(10,  3)  =  720. 
Suppose  the  marbles  have  the  same  color.  Are  there  still  720  possibilities?  In  fact, 
the  number  is  much  smaller.  When  there  are  red,  blue,  and  green  marbles,  for 
each  choice  of  boxes,  there  are  3!  =  6  different  ways  of  assigning  the  marbles  to 
them  corresponding  to  the  six  permutations  of  “red,”  “blue,”  “green.”  Figure  7.1 
illustrates  the  six  ways  the  marbles  can  be  assigned  to  boxes  3,  4,  and  8,  for 
instance.  When  the  marbles  have  the  same  color,  these  six  assignments  are  all  the 
same. 


BOX  1 

BOX  2  BOX  3 

BOX  4  •  • 

BOX  8 

rbg 

red 

blue 

green 

rgb 

red 

green 

blue 

brg 

blue 

red 

green 

bgr 

blue 

green 

red 

grb 

green 

red 

blue 

gbr 

green 

blue 

red 

Figure  7.1  There  are  31  =  6  ways  to  assign  three  different  colored  marbles  to  three  boxes. 


It  is  useful  to  notice  that  when  the  marbles  have  the  same  color,  each  config¬ 
uration  of  marbles  in  boxes  simply  amounts  to  a  choice  of  three  boxes.  Thus,  the 
number  of  ways  to  put  three  identical  marbles  into  ten  boxes  is  just  the  number 
of  ways  to  select  three  boxes  out  of  ten.  Furthermore,  for  a  given  three  boxes, 
there  are  3  x  2  x  1  =  3!  =  6  ways  in  which  three  marbles  of  different  colors 
can  be  assigned  to  these  boxes.  By  the  multiplication  rule,  the  number  of  ways 
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of  putting  three  marbles  of  different  colors  into  ten  boxes  (at  most  one  to  a  box) 
is  the  product  of  the  number  of  ways  of  selecting  three  boxes  and  the  number  of 
ways  then  to  assign  the  marbles  to  these  boxes. 

P(10,  3)  =  (number  of  ways  to  select  3  boxes  from  10)  x  3! 

To  summarize, 


Number  of  ways  to  select  three  boxes  from  ten 


=  number  of  ways  to  put  three  identical  marbles  into  ten  boxes 


P(  10,  3) 
3! 


10! 

7!3! 


=  120. 


7.2.1  DEFINITION 


For  integers  r  and  n,  n  >  0  and  0  <  r  <  n,  the  binomial  coefficient  (”)  (read  “n 
choose  r”)  is  defined  by 


(")  =  , 

\r)  r!(«  —  r)! 


2.2  PROPOSITION 


The  reason  for  the  name  “binomial  coefficient”  will  become  clear  in  Section  7.5. 
The  reason  for  saying  “n  choose  r"  is  given  in  the  following  proposition. 

Let  n  and  r  be  integers  with  n  >  0  and  0  <  r  <  n.  The  number  of  ways  to  choose 
r  objects  from  n  is  ("). 


Proof  If  r  —  0,  the  result  is  true  because  there  is  just  one  way  to  choose  0  objects  (do 
nothing!),  while  (”,)  =  0!(”10)!  =  1  because  0!  =  1.  Thus,  we  may  assume  that 
r  >  1  and  hence  n  >  1.  Let  N  be  the  number  we  are  seeking;  that  is,  there  are  N 
ways  to  choose  r  objects  from  the  n  given  objects.  Notice  that  for  each  way  of 
choosing  r  objects,  there  are  r!  ways  to  order  them.  By  the  multiplication  rule,  the 
number  of  r -permutations  of  n  objects  [which  we  know  is  Pin,  r)]  is  the  number 
of  ways  to  choose  r  objects  multiplied  by  r\,  the  number  of  ways  to  order  the  r 
objects. 


P(n,  r)  =  N  x  r ! 


Therefore, 


N  = 


P(n,r ) 

7\ 


n\ 

r\(n  —  r)\ 


which  is  what  we  wanted  to  show. 


Explain  why 


I 


PROBLEM  7.  Wanda  is  going  to  toss  a  coin  eight  times.  In  how  many  ways 
can  she  get  five  heads  and  three  tails? 
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7.2.3  DEFINITIONS 


7.2.4  COROLLARY 


EXAMPLES  8 


Solution.  Wanda  might  get  a  string  of  five  heads  followed  by  three  tails 
(denote  this  possibility  HHHHH  III);  or  a  string  of  three  tails  followed  by  five 
heads,  I  I  I  HHHHH;  or  the  sequence  HTHHTHTH;  and  so  on.  The  number  of 
such  sequences  is  the  number  of  ways  of  selecting  five  occasions  (from  the 
eight)  on  which  the  heads  should  arise  or,  equivalently,  the  number  of  ways 
of  selecting  the  three  occasions  on  which  tails  should  come  up.  The  answer  is 
(58)  =  (38)  =  56.  I 

As  is  so  often  the  case  in  mathematics,  in  Proposition  7.2.2,  we  solved  one 
problem  (in  how  many  ways  r  objects  can  be  selected  from  «?)  by  relating  it  to 
another  problem  for  which  we  knew  the  answer  (how  many  /--permutations  of  n 
objects  are  there?).  This  is  an  extremely  useful  and  important  idea. 

A  combination  of  a  set  of  objects  is  a  subset  of  them.  A  subset  of  r  objects  is 
called  an  r -combination  or  a  combination  of  the  objects  taken  r  at  a  time. 

Thus, 


The  number  of  /  -combinations  of  n  objects  is 


The  number  of  2-combinations  of  the  digits  0,  1 ,  . . .  ,  9  is 


/10\  _  10! 
V2/  “  2!8!  “ 


10-9 


=  45. 


The  number  of  combinations  of  letters  of  the  alphabet  taken  six  at  a  time  is 

26  •  25  •  24  •  23  •  22  ■  21 


/26\  _  26! 

\  6  /  _  6!20!  “ 


=  230,230. 


6  •  5  •  4  •  3  •  2  •  1 

The  number  of  ways  to  put  three  identical  marbles  into  ten  boxes  is  the 
number  of  ways  of  selecting  three  of  ten  boxes: 

/10\  10!  10-9-8 

\  3  )  ~  3!7!  “  3-2 


=  120. 


The  number  of  ways  to  choose  two  kinds  of  drinks  from  a  dozen  different 
kinds  is 


(2)  - 


12! 


12  •  11 


=  66. 


2!  10!  2 

The  number  of  ways  to  choose  five  people  from  a  group  of  32  is 
32!  32-31 -30-29-28 


O- 


5!27! 


5-4-3-21 


=  201,376. 


The  distinction  between  permutation  and  combination  is  the  distinction 
between  order  and  selection.  Box  1,  box  2,  box  4  is  one  combination  of  boxes; 
box  2,  box  1 ,  box  4  is  the  same  combination,  but  a  different  permutation.  A 
permutation  takes  order  into  account;  a  combination  involves  only  selection. 
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There  are  (35“)  =  201,376  ways  to  choose  five  people  from  32,  but  5!  x 
201,376  =  P( 32,  5)  lines  of  five  people  which  can  be  formed  from  a  group  of 
32,  a  much  larger  number,  because  order  is  important  in  a  line.  The  line  Ruby, 
Mary,  Colin,  Tom,  Eileen  is  different  from  the  line  Mary,  Tom,  Eileen,  Ruby, 
Colin,  although  these  two  selections  of  people  are  the  same. 

Mr.  Hiscock  has  ten  children  but  his  car  holds  only  five  people  (including  driver). 
When  he  goes  to  the  circus,  in  how  many  ways  can  he  select  four  children  to 
accompany  him? 


PROBLEM  9.  In  how  many  ways  can  20  students  out  of  a  class  of  32  be 
chosen  to  attend  class  on  a  late  Thursday  afternoon  (and  take  notes  for  the 
others)  if 

(a)  Paul  refuses  to  go  to  class? 

(b)  Michelle  insists  on  going? 

(c)  Jim  and  Michelle  insist  on  going? 

(d)  either  Jim  or  Michelle  (or  both)  go  to  class? 

(e)  just  one  of  Jim  and  Michelle  attend? 

(f)  Paul  and  Michelle  refuse  to  attend  class  together? 

Solution. 

(a)  The  answer  is  (3(,)  =  84,672,315  since,  in  effect,  it  is  necessary  to  select 
20  students  from  the  3 1  students  excluding  Paul. 

(b)  Now  the  number  of  possibilities  is  ('3g)  =  141,120,525  since  19  students 
must  be  chosen  from  3 1 . 

(c)  The  answer  is  (jjj)  =  86,493,225,  it  being  necessary  to  choose  the  remaining 
1 8  students  from  a  group  of  30. 

(d)  Let  J  be  the  set  of  classes  of  20  which  contain  Jim  and  M  the  set  of  classes 
of  20  which  contain  Michelle.  The  question  asks  for  \J  U  M\.  Using  the 
Principle  of  Inclusion-Exclusion,  we  obtain  |/UM|  =  \  J\  +  \M\  —  \J  C\M\  = 
(19)  +  (19)  -  (is)  =  195,747,825. 

An  alternative  method  of  obtaining  this  answer  would  be  to  count  separately 
the  cases — Jim  (but  not  Michelle)  goes  to  class,  Michelle  (but  not  Jim)  goes 
to  class,  both  Jim  and  Michelle  go  to  class — and  add  the  results.  (This 
follows  because  \J  U  M\  =  \J  \  M\  +  \M  \  J\  +  \J  D  M |.)  We  obtain 

(19)  +  (19)  +  (is)  =  2(54,627,300)  +  86,493,225  =  195,747,825,  as  before. 
Yet  another  method  would  be  to  subtract  from  the  total  number  of  possible 
classes  of  20  the  number  containing  neither  Jim  nor  Michelle.  This  gives 

(20)  ~  (20)  =  225,792,840  -  30,045,015  =  195,747,825. 

(e)  Using  the  formula  \J  @  M\  =  |/|  +  \M\  —  2\J  n  M\,  we  obtain  (^)  +  (3g)  — 
2(|°)  =  109,254,600. 

(f)  The  number  of  classes  containing  Paul  and  Michelle  is  (3g)  by  part  (c),  so 
the  number  which  do  not  contain  both  is  (jq)  —  (3g)  =  139,299,615. 
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Alternatively,  observe  that  this  part  asks  for  \(P  fl  M)c\  =  \PC  U  Mc\  (by 
one  of  the  laws  of  De  Morgan).  So  the  answer  is  |/>r|  +  \  MC\  —  \PC  DMC\  = 


©  +  Q-Q  =  >39,299.615. 


■ 


PROBLEM  10.  How  many  committees  of  five  people  can  be  chosen  from  20 
men  and  12  women 

(a)  if  exactly  three  men  must  be  on  each  committee? 

(b)  if  at  least  four  women  must  be  on  each  committee? 

Solution. 

(a)  We  must  choose  three  men  from  20  and  then  two  women  from  12.  The 
answer  is  (230)('22)  =  1140(66)  =  75,240. 

(b)  We  calculate  the  cases  of  four  women  and  five  women  separately  and  add 

the  results  (using  the  addition  rule).  The  answer  is  (’42)(20)  +  (g2)(^)  = 
495(20)4-792=  10,692.  I 

The  combinatorial  symbols  we  have  presented  in  Sections  7.1  and  7.2  are 
summarized,  together  with  their  interpretations,  in  Table  7.1. 


Table  7.1  Some  combinatorial  symbols  and  their  uses. 


n !  •  number  of  permutations  of  n  distinct 

symbols 


P(n,r )  = - : — -  •  number  of  ways  to  put  r  different  colored 

■  ~  r’'  marbles  into  n  numbered  boxes,  at  most 
one  to  a  box 

.  number  of  permutations  of  n  distinct 
symbols  used  r  at  a  time 


n\  n ! 

r  )  r\(n  —  r ) ! 


.  number  of  ways  to  put  r  identical  marbles 
into  n  boxes,  at  most  one  to  a  box 


»  number  of  combinations  of  n  distinct 
symbols  used  r  at  a  time 


•  number  of  ways  to  choose  r  objects 
from  n 


Answers  to  Pauses 


5.  Suppose  we  have  n  white  marbles  and  we  wish  to  paint  r  of  them  black. 
Choosing  the  r  marbles  is  equivalent  to  choosing  the  n  —  r  marbles  which 
are  to  remain  white.  Thus,  each  choice  of  r  marbles  from  n  corresponds  to  a 
choice  of  the  remaining  n  -  r,  so  the  numbers  of  choices  are  the  same.  By 
Proposition  7.2.2,  these  are  (")  and  (n"r),  respectively. 
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6.  The  question  involves  choosing,  not  order.  There  are  (*°)  =  4^ j  =  210 
different  ways. 


EXERCISES 


The  symbol  [BB]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 


1.  [BB]  A  group  of  people  is  comprised  of  six  from 
Nebraska,  seven  from  Idaho,  and  eight  from  Louisiana. 

(a)  In  how  many  ways  can  a  committee  of  six  be  formed 
with  two  people  from  each  state? 

(b)  In  how  many  ways  can  a  committee  of  seven  be 
formed  with  at  least  two  people  from  each  state? 

2.  In  how  many  ways  can  12  players  be  divided  into  two 
teams  of  six  for  a  game  of  street  hockey? 

3.  [BB]  How  many  12-digit  0-1  strings  contain  precisely 
five  l’s? 

4.  How  many  different  signals,  each  consisting  of  seven 
flags  arranged  in  a  column,  can  be  formed  from  three 
identical  red  flags  and  four  identical  blue  flags? 

5.  [BB]  In  how  many  ways  can  Tom,  Billie,  and  Peter  share 
15  salmon  of  different  sizes 

(a)  if  each  takes  five? 

(b)  if  the  youngest  boy  takes  seven  salmon  and  the  oth¬ 
ers  each  take  four? 

(c)  if  one  boy  takes  seven  salmon  and  the  others  each 
take  four? 

6.  [BB]  How  many  subsets  of  a  set  of  ten  apples  contain 
at  most  three  apples? 

7.  In  a  popular  lottery  known  as  “Lotto  6/49,”  a  player 
marks  a  card  with  six  different  numbers  from  the  inte¬ 
gers  1^19  and  wins  if  his  or  her  numbers  match  six 
randomly  selected  such  numbers. 

(a)  In  how  many  ways  can  a  player  complete  a  game 
card? 

(b)  How  many  cards  should  you  complete  in  order  to 
have  at  least  one  chance  in  a  million  of  winning? 

(c)  In  how  many  ways  can  a  player  complete  a  game 
card  so  that  no  number  matches  any  of  the  six 
selected? 

(d)  In  how  many  ways  can  a  player  complete  a  game 
card  so  that  at  least  one  number  matches  one  of  the 
six  selected? 

8.  (a)  [BB]  How  many  five-card  hands  dealt  from  a  stan¬ 

dard  deck  of  52  playing  cards  are  all  of  the  same  suit? 


(b)  How  many  five-card  hands  contain  exactly  two 
aces? 

9.  A  newcomers’  club  of  30  people  wants  to  choose  an 
executive  board  consisting  of  president,  secretary,  trea¬ 
surer,  and  two  other  officers.  In  how  many  ways  can 
this  be  accomplished? 

10.  [BB]  A  coin  is  tossed  ten  times  and  the  sequence  of 
heads  and  tails  observed. 

(a)  How  many  different  sequences  are  possible? 

(b)  In  how  many  of  these  sequences  are  there  exactly 
four  heads? 

11.  [BB]  An  urn  contains  15  red  numbered  balls  and  ten 
white  numbered  balls.  A  sample  of  five  balls  is  selected. 

(a)  How  many  different  samples  are  possible? 

(b)  How  many  samples  contain  all  red  balls? 

(c)  How  many  samples  contain  three  red  balls  and  two 
white  balls? 

12.  A  group  of  eight  scientists  is  composed  of  five  mathe¬ 
maticians  and  three  geologists. 

(a)  In  how  many  ways  can  five  people  be  chosen  to 
visit  an  oil  rig? 

(b)  Suppose  the  five  people  chosen  to  visit  the  rig  must 
be  comprised  of  three  mathematicians  and  two  geol¬ 
ogists.  Now  in  how  many  ways  can  the  group  be 
chosen? 

13.  [BB]  From  a  hundred  used  cars  sitting  on  a  lot,  20  are 
to  be  selected  for  a  test  designed  to  check  certain  safety 
requirements.  These  cars  will  then  be  put  back  onto  the 
lot  and,  again,  20  will  be  selected  for  a  test  designed  to 
check  antipollution  standards. 

(a)  In  how  many  ways  can  the  first  selection  be  made? 

(b)  In  how  many  ways  can  the  second  selection  be 
made? 

(c)  In  how  many  ways  can  both  selections  be  made? 

(d)  In  how  many  ways  can  both  selections  be  made  if 
exactly  five  cars  are  to  undergo  both  tests? 

14.  In  how  many  ways  can  a  team  of  six  be  chosen  from 
20  players  so  as  to 
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(a)  include  both  the  strongest  and  the  weakest  player? 

(b)  include  the  strongest  but  exclude  the  weakest 
player? 

(c)  exclude  both  the  strongest  and  weakest  player? 

15.  A  woman  has  nine  close  friends. 

(a)  In  how  many  ways  can  she  invite  six  of  these  to 
dinner? 

(b)  Repeat  (a)  if  two  of  her  friends  are  divorced  (from 
each  other)  and  will  not  attend  together. 

(c)  Repeat  (a)  if  the  friends  consist  of  three  single  peo¬ 
ple  and  three  married  couples  and,  if  a  husband  or 
wife  is  invited,  the  spouse  must  be  invited  too. 

16.  The  Head  of  the  Department  of  Mathematical  Sciences 
at  a  certain  university  has  12  mathematicians,  seven 
computer  scientists,  and  three  statisticians  in  his  employ. 
He  wishes  to  appoint  some  committees  from  among 
these  22  people. 

(a)  How  many  five-member  committees  can  he  appoint? 

(b)  How  many  five-member  committees,  each  contain¬ 
ing  at  least  one  statistician,  can  he  appoint? 

(c)  A  certain  professor  of  mathematics,  Dr.  G,  and  a 
certain  colleague,  Dr.  P,  refuse  to  serve  together  on 
the  same  committee.  How  many  five-member  com¬ 
mittees  can  be  formed  so  as  not  to  contain  both 
Dr.  G  and  Dr.  P? 

(d)  How  many  five-member  committees  can  be  formed 
so  that  the  number  of  mathematicians  is  greater  than 
the  number  of  computer  scientists  and  the  number 
of  computer  scientists  is  greater  than  the  number  of 
statisticians? 

17.  A  student  must  answer  exactly  eight  questions  out  of 
ten  on  a  final  examination. 

(a)  [BB]  In  how  many  ways  can  she  choose  the  ques¬ 
tions  to  answer? 

(b)  Repeat  (a)  if  she  must  answer  the  first  three  ques¬ 
tions. 

(c)  [BB]  Repeat  (a)  if  she  must  answer  at  least  three  of 
the  last  five  questions  and  at  most  four  of  the  first 
five. 

(d)  Repeat  (a)  if  she  must  answer  at  least  three  of  the 
last  five  questions. 

18.  Suppose  U  is  a  set  of  52  elements  which  contains  five 
subsets  A i,  A2,  A3,  A 4,  A5  with  the  following  proper¬ 
ties: 

•  each  set  contains  23  elements; 

•  the  intersection  of  any  two  of  the  sets  contains  ten 
elements; 


•  the  intersection  of  any  three  of  the  sets  contains  four 
elements; 

.  the  intersection  of  any  four  of  the  sets  contains  one 
element; 

•  the  intersection  of  all  the  sets  is  empty. 

How  many  elements  belong  to  none  of  the  five  sub¬ 
sets? 

19.  Ten  points  in  the  plane,  no  three  collinear,  are  given. 

(a)  [BB]  How  many  different  line  segments  are  formed 
by  joining  pairs  of  these  points? 

(b)  How  many  different  triangles  are  formed  by  the  line 
segments  in  (a)? 

(c)  If  A  is  one  of  the  ten  points,  how  many  of  the  tri¬ 
angles  in  (b)  have  A  as  a  vertex? 

20.  (a)  [BB]  How  many  triangles  are  determined  by  the 

vertices  of  a  regular  1 2-sided  polygon? 

(b)  Repeat  (a)  if  the  sides  of  the  polygon  are  not  to  be 
the  sides  of  any  triangle. 

21.  A  diagonal  of  a  polygon  is  a  line  joining  two  nonadja- 
cent  vertices. 

(a)  [BB]  How  many  diagonals  does  an  octagon  have? 

(b)  Loonie  is  the  colloquial  name  for  the  Canadian 
one  dollar  coin.  It  has  the  shape  of  a  regular  11- 
sided  polygon.  How  many  diagonals  does  a  loonie 
have? 

22.  (a)  How  many  diagonals  does  a  regular  n-sided  polygon 

have? 

(b)  Which  regular  n-sided  polygon  has  three  times  as 
many  diagonals  as  sides? 

23.  [BB]  Prove  that  the  product  of  any  n  consecutive  natural 
numbers  is  divisible  by  n\. 

24.  (a)  Use  Definition  7.2.1  to  prove  that  (£)(” ~tk)  = 

(")("«/)’  where  n,  k,  and  l  are  natural  numbers  with 
k  +  t  <  n. 

(b)  Establish  the  identity  in  part  (a)  without  appealing 
to  Definition  7.2.1.  [Hint:  In  how  many  ways  can 
one  choose  two  teams,  one  of  size  k  and  the  other 
of  size  f ,  from  a  group  of  n  people?] 

25.  (a)  Let  k  and  n  be  natural  numbers  with  k  <  n.  Use 

the  definition  of  (")  given  in  7.2.1  to  prove  that 

©  =  (;:!)  +  (■;')■ 

(b)  Establish  the  identity  in  (a)  without  appealing  to 
any  definition. 

26.  (a)  Use  Definition  7.2.1  to  prove  that  (2")  =  2(")  +  n 2 

for  any  natural  number  n. 

(b)  Establish  the  identity  in  (a)  without  using  the  defi¬ 
nition. 
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7.3  Repetitions 

In  the  previous  two  sections  we  counted  the  number  of  ways  of  putting  r  marbles 
into  n  boxes  with  at  most  one  marble  to  a  box  in  two  cases:  The  marbles  have 
different  colors  and  the  marbles  are  all  the  same  color.  Suppose  we  allow  any 
number  of  marbles  in  a  box.  If  the  marbles  are  all  colored  differently,  then  there 
are  n  choices  for  the  first  marble,  n  for  the  second,  and  so  forth.  There  are, 
altogether,  n  x  n  x  •  •  •  x  n  =  nr  possibilities.  When  the  marbles  are  all  the  same 
color,  however,  we  expect  far  fewer  possibilities. 

Suppose  we  want  to  place  three  white  marbles  into  ten  boxes  and  we  are 
allowed  to  put  as  many  marbles  into  a  box  as  we  like.  There  are  three  mutually 
exclusive  ways  in  which  this  can  be  accomplished:  Each  marble  goes  into  a 
different  box  [in  one  of  ( 'j1)  =  120  ways],  two  marbles  go  into  the  same  box  but 
the  third  goes  into  its  own  box  (in  one  of  10  x  9  =  90  ways),  or  all  the  marbles 
go  into  the  same  box  (in  one  of  10  ways).  By  the  addition  rule,  the  number  of 
ways  in  all  is  120  +  90  +  10  =  220.  We  note  in  passing  that  220  =  (l0+33  l)- 

In  how  many  ways  can  ten  marbles,  all  of  the  same  color,  be  put  into  three 
boxes?  While  this  problem  is  obviously  similar  to  the  previous,  it  becomes  quickly 
apparent  that  a  case-by-case  examination  of  the  possibilities  is  very  difficult  this 
time.  There  are  three  ways  in  which  the  marbles  all  go  into  the  same  box,  but 
in  how  many  ways  can  they  be  placed  into  exactly  two  boxes,  or  into  exactly 
three?  The  question  seems  quite  complicated,  so  we  consider  it  from  a  new  per¬ 
spective. 

One  way  to  put  the  marbles  into  the  boxes  is  to  put  three  marbles  into  box  1, 
five  into  box  2,  and  two  into  box  3.  This  situation  can  be  represented  by  the  12- 
digit  string  000100000100,  which  has  strings  of  three,  five,  and  two  0’s  separated 
by  two  l’s.  There  is  a  one-to-one  correspondence  between  ways  of  putting  the 
marbles  into  the  boxes  and  12-digit  strings  consisting  of  two  l’s  and  ten  0’s.  The 
string  001000000100  would  represent  the  situation  of  two  marbles  in  box  1,  six 
in  box  2,  and  two  in  box  3.  The  string  100000010000  would  indicate  that  there 
are  no  marbles  in  box  1,  six  in  box  2,  and  four  in  box  3.  To  count  the  number  of 
ways  of  putting  ten  marbles  in  three  boxes  is,  therefore,  just  to  count  the  number 
of  12-digit  0-1  strings  which  contain  precisely  two  l’s.  This  latter  number  is 
easy  to  find!  There  are  (’22)  such  numbers  corresponding  to  the  number  of  ways 
of  selecting  two  positions  for  the  l’s  from  12  positions.  Therefore,  the  number 
of  ways  to  put  ten  identical  marbles  into  three  boxes,  any  number  to  a  box,  is 
(,2)  =  66  =  (3+Io_1)-  We  have  seen  two  instances  of  the  next  proposition. 


7.3.1  PROPOSITION 


The  number  of  ways  to  put  r  identical  marbles  into  n  boxes  is  (n+;r  '). 


PROBLEM  11.  Doughnuts  come  in  30  different  varieties  and  Catherine  wants 
to  buy  a  dozen.  How  many  choices  does  she  have? 

Solution.  Imagine  that  the  30  varieties  are  in  n  ~  30  boxes  labeled  “choco¬ 
late  white,”  “Boston  creme,”  “peanut  crunch,”  and  so  on.  Catherine  can  indicate 
her  choice  by  dropping  r  =  12  (identical)  marbles  into  the  boxes.  So  there  are 
(30+;22— ')  =  7,898,654,920  possibilities.  I 
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|  David  wants  to  buy  30  doughnuts  and  finds  just  12  varieties  available.  In  how 
many  ways  can  he  make  his  selection? 

The  basic  facts  about  marbles  and  boxes  which  we  have  discussed  so  far  are 
summarized  in  Table  7.2. 


Table  7.2  The  number  of  ways  to  put  r 
numbered  boxes. 

marbles  into  n 

Same  Color  All  Different  Colors 

At  most  one  to  a  box 

C) 

P(n,  r) 

Any  number  in  a  box 

fn  +  r  —  1  \ 

nr 

V  r  ) 

In  Section  7.1,  we  counted  the  number  of  ways  to  put  three  marbles  of  dif¬ 
ferent  colors  into  ten  boxes,  at  most  one  marble  to  a  box;  and,  in  Section  7.2,  we 
considered  the  same  problem  for  identical  marbles.  What  about  the  intermediate 
possibility  that  two  marbles  have  the  same  color,  say  red,  and  the  third  is  a  dif¬ 
ferent  color,  perhaps  blue?  In  how  many  ways  can  these  three  marbles  be  put  into 
ten  boxes,  at  most  one  to  a  box?  There  are  (1,°)  ways  in  which  to  pick  boxes  for 

the  red  marbles  and  then  (,)  ways  to  choose  the  box  for  the  blue  marble.  By  the 
multiplication  rule,  the  number  of  ways  to  put  two  reds  and  one  blue  marble  into 
ten  boxes,  at  most  one  to  a  box,  is 


10! 

2!1!7! 


360. 


The  number  of  ways  to  put  two  red,  four  blue,  and  three  green  marbles  into 
ten  boxes,  at  most  one  to  a  box,  is  ( l1°)  (choose  boxes  for  the  reds)  x(^)  (choose 
boxes  for  the  blues)  x(jj)  (choose  boxes  for  the  greens);  that  is, 

10!  8!  4!  10! 

2!8!  X  4!4!  X  30!  ~  2!4!3!1 ! ' 


PROBLEM  12.  Suppose  there  are  ten  players  to  be  assigned  to  three  teams,  the 
Xtreme,  the  Maniax,  and  the  Enforcers.  The  Xtreme  and  the  Maniax  are  to  receive 
four  players  each  and  the  Enforcers  are  to  receive  two.  In  how  many  ways  can 
this  be  done? 

Solution.  The  assignment  of  players  is  accomplished  by  choosing  four  players 
from  ten  for  the  Xtreme,  then  choosing  four  players  from  the  remaining  six  for  the 
Maniax,  and  assigning  the  remaining  two  players  to  the  Enforcers.  The  number 
of  possible  teams  is 


=  210(15)  =  3150. 


I 
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In  how  many  ways  can  14  men  be  divided  into  six  named  teams,  two  with  three 
players  and  four  with  two?  In  how  many  ways  can  14  men  be  divided  into  two 
unnamed  teams  of  three  and  four  teams  of  two? 

In  how  many  ways  can  the  letters  of  the  word  easy  be  rearranged?  The 
question  just  asks  for  the  number  of  permutations  of  four  different  letters:  The 
answer  is  4!  ==  24.  In  how  many  ways  can  the  letters  of  the  word  ease  be 
rearranged?  This  is  a  slightly  different  problem  because  of  the  repeated  e\  When 
the  first  and  last  letters  of  easy  are  interchanged,  we  get  two  different  arrangements 
of  the  four  letters,  but  when  the  first  and  last  letters  of  ease  are  interchanged  we 
get  the  same  word.  To  see  how  to  count  the  ways  in  which  the  four  letters  of  ease 
can  be  arranged,  we  imagine  the  list  of  all  these  arrangements. 

s  e  a  e 

a  e  e  s 

e  e  s  a 


Pretending  for  a  moment  that  the  two  e’s  are  different  (say  one  is  a  capital  E), 
then  each  “word”  in  this  list  will  produce  two  different  arrangements  of  the  letters 
e  a  s  E.  (See  Fig  7.2.) 


seae 

aees 

eesa 


< 

< 

< 


seaE 

sEae 

aeEs 

aEes 

eEsa 

Eesa 


Figure  7.2 


The  list  on  the  right  contains  the  4!  =  24  arrangements  of  the  four  letters 
e  a  s  E,  so  the  list  on  the  left  contains  half  as  many.  There  are  ||  =  12  ways  in 
which  the  letters  of  ease  can  be  arranged. 

PROBLEM  13.  In  how  many  ways  can  the  letters  of  the  word  attention  be 
rearranged? 

Solution.  The  word  attention  has  nine  letters,  three  of  one  kind,  two  of  another, 
and  four  other  different  letters.  The  number  of  rearrangements  of  this  word  is 
=  30,240.  I 


This  answer  is  obtained,  as  before,  by  imagining  the  list  of  arrangements 
and  imagining  how  many  arrangements  could  be  formed  if  the  letters  were  all 
different.  If  the  two  n’ s  were  different,  each  rearrangement  would  produce  two 
more,  giving  a  second  list  twice  as  long.  If  the  three  t’  s  were  different,  each 
“word”  in  this  second  list  would  yield  3!  =  6  more.  For  instance,  replacing  the 
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three  t's  by  /,  T,  and  r,  then  tanetoeti  yields 

taneToexi,  taneroeTi,  Tanetoexi,  Tanexoeti,  xanetoiTi,  xaneToiti 


corresponding  to  the  3!  permutations  of  t,  T,  x. 

We  would  obtain  a  third  list  3!  times  as  long  as  the  second  and  3!  x  2  times 
as  long  as  the  first.  Since  this  third  list  consists  of  all  the  permutations  of  the  nine 
symbols  a,  t,  T,  e,  n,  x,  i,  o,  N,  it  contains  9!  “words,”  so  the  original  list  contained 
“words.” 

In  how  many  ways  can  the  letters  of  the  word  REARRANGE  be  rearranged? 


Answers  to  Pauses 


7.  As  in  Problem  11,  we  imagine  the  12  varieties  in  boxes.  Each  of  David’s 
possible  decisions  can  be  indicated  by  dropping  30  identical  marbles  into 
these  boxes.  There  are  (12+31 2 3o_1)  =  3,159,461,968  possibilities  (r  =  30, 
n  =  12). 

8.  If  the  teams  are  named,  the  answer  is 


(j4)  x  (‘3')  x  (l)  x  (2)  x  (4)  =  364(1 65) (28) (15) (6)  =  151,351,200. 

This  number  is  2!4!  =  48  times  the  number  of  divisions  into  unnamed  teams 
of  sizes  3,  3,  2,  2,  2,  and  2  because,  for  each  division  into  unnamed  teams, 
there  are  two  ways  to  name  the  teams  of  three  and  4!  ways  to  name  the  teams 
of  two.  The  answer  in  the  case  of  unnamed  teams  is  151,345g  ,200  =  3,153, 150. 

9.  There  are  =  15,120  rearrangements  of  the  letters  of  the  word  REAR¬ 
RANGE. 


EXERCISES 


The  symbol  [BB]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 


1.  [BB]  In  how  many  ways  can  30  identical  dolls  be  placed 
on  seven  different  shelves? 

2.  There  are  15  questions  on  a  multiple-choice  exam  and 
five  possible  answers  to  each  question. 

(a)  In  how  many  ways  can  the  exam  be  answered? 

(b)  In  how  many  ways  can  the  exam  be  answered  with 
exactly  eight  answers  correct? 

3.  [BB]  How  many  different  outcomes  are  possible  when 
five  dice  are  rolled?  (Two  6’s,  two  5’s,  and  a  1  is  one 
outcome.) 

4.  [BB]  How  many  different  collections  of  ten  coins  can 
be  made  from  pennies,  nickels,  dimes,  and  quarters? 

5.  A  florist  sells  roses  in  five  different  colors. 

(a)  How  many  bunches  of  a  half  dozen  roses  can  be 
formed? 


(b)  How  many  bunches  of  a  half  dozen  can  be  formed 
if  each  bunch  must  contain  at  least  one  rose  of  each 
color? 

6.  There  are  20  varieties  of  chocolates  available  and  Linda 
wants  to  buy  eight  chocolates. 

(a)  [BB]  How  many  choices  does  she  have? 

(b)  How  many  choices  does  she  have  if  her  boyfriend 
insists  that  at  least  one  chocolate  should  have  a 
cherry  center? 

(c)  How  many  choices  does  she  have  if  there  remain 
only  two  chocolates  with  caramel  centers?  (At 
least  20  chocolates  of  all  other  varieties  are  avail¬ 
able.) 

7.  (a)  In  how  many  ways  can  five  different  mathemat¬ 

ics  books,  three  different  physics  books,  and  four 
different  chemistry  books  be  arranged  on  a  shelf? 
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(b)  Repeat  (a)  if  all  books  of  the  same  subject  are  to  be 
together. 

(c)  [BB]  Repeat  (a)  if  three  of  the  five  mathematics 
books  are  the  same. 

(d)  Repeat  (c)  if,  in  addition,  all  the  physics  books  are 
the  same. 

8.  [BB]  In  how  many  ways  can  12  people  form  four  groups 
of  three  if 

fa)  the  groups  have  names? 

(b)  the  groups  are  unnamed? 

9.  In  how  many  ways  can  18  different  books  be  given  to 
Tara,  Danny,  Shannon,  and  Mike  so  that  one  person  has 
six  books,  one  has  two  books,  and  the  other  two  people 
have  five  books  each? 

10.  (a)  Twenty  basketball  players  are  going  to  be  drafted  by 

the  professional  basketball  teams  in  Philadelphia, 
Boston,  Miami,  and  Toronto  such  that  each  team 
drafts  five  players.  In  how  many  ways  can  this  be 
accomplished? 

(b)  In  how  many  ways  can  20  players  be  divided  into 
four  unnamed  teams  of  five  players  each? 

11.  Suppose  there  are  12  baseball  players  to  be  drafted  by 
San  Diego,  Houston,  and  Toronto.  If  San  Diego  is  to  get 
six  players,  Houston  four  players,  and  Toronto  two,  in 
how  many  ways  can  this  be  accomplished? 

12.  (a)  [BB]  In  how  many  ways  can  ten  red  balls,  ten  white 

balls,  and  ten  blue  balls  be  placed  in  60  different 
boxes,  at  most  one  ball  to  a  box? 

(b)  In  how  many  ways  can  ten  red  balls,  ten  white 
balls,  and  ten  blue  balls  be  placed  in  60  different 
boxes? 

13.  In  how  many  ways  can  two  white  rooks,  two  black  bish¬ 
ops,  eight  black  pawns,  and  eight  white  pawns  be  placed 
on  a  prescribed  20  squares  of  a  chess  board? 

14.  [BB]  A  committee  wishes  to  award  one  scholarship  of 
$10,000,  two  scholarships  of  $5000,  and  five  scholar¬ 
ships  of  $1000.  The  list  of  potential  award  winners  has 
been  narrowed  to  13  possibilities.  In  how  many  ways 
can  the  scholarships  be  awarded? 

15.  Find  the  number  of  arrangements  of  the  letters  of  each 
of  the  following  words: 

(a)  [BB]  SCIENTIFIC 

(b)  SASKATOON 

(c)  PICCININI 

(d)  CINCINNATI 


16.  [BB]  A  department  store  in  downtown  Victoria,  British 
Columbia,  has  30  flags  to  hang  along  its  roof  line  to 
celebrate  Queen  Victoria’s  birthday.  If  there  are  ten  red 
flags,  five  white  flags,  seven  yellow  flags,  and  eight  blue 
flags,  how  many  ways  can  the  flags  be  displayed  in  a 
row? 

17.  In  a  university  residence  there  are  five  single  rooms,  five 
doubles,  and  five  rooms  which  hold  three  students  each. 
In  how  many  ways  can  30  students  be  assigned  to  the 
15  rooms?  (All  rooms  are  numbered.) 

18.  (a)  Show  that  there  is  a  one-to-one  correspondence 

between  the  number  of  ways  to  put  ten  identical 
marbles  into  three  boxes  and  the  number  of  ordered 
triples  (jc,  y,  z)  of  nonnegative  integers  which  sat¬ 
isfy  x  +  y  +  z=  10. 

(b)  How  many  triples  (x,  y,  z)  of  nonnegative  integers 
satisfy  x  +  y  +  z  =  10? 

(c)  How  many  5-tuples  (x,y,z,u,v)  of  nonnegative 
integers  satisfy  x  +  y  +  z  +  u  +  v  =  19? 

19.  (a)  [BB]  Show  that  there  is  a  one-to-one  correspon¬ 

dence  between  the  solutions  to 

(*)  X]  +  X2  +  *3  +  *4  =  21 

with  x\ ,  xi,  x-},  X4  nonnegative  integers,  x\  >  8,  and 
solutions  to 

(**)  -*1  +  X2  +  X3  +  X4  =  13 

with  x\,X2,  *3,  X4  nonnegative  integers.  How  many 
solutions  are  there? 

(b)  Show  that  there  is  a  one-to-one  correspondence  bet¬ 
ween  the  solutions  to  (*)  with  x\  >  8,  X2  >  8,  and 
solutions  to 

(***)  X,  +X2  +X3  +X4  =  5 

with  x\ ,  X2,  JC3,  X4  nonnegative  integers.  How  many 
solutions  are  there? 

(c)  How  many  solutions  are  there  to  (*)  with  all  vari¬ 
ables  nonnegative  integers  not  exceeding  7?  [Hint: 
Let  A,  be  the  set  of  solutions  to  (*)  with 

Xj  >  8.] 

(d)  How  many  solutions  (x\ ,  *2,  *3.  *4)  are  there  to  the 
equation 

x 1  +  X2  +  X3  +  X4  =  35 
with  all  xi  integers,  0  <  *,•  <  10? 

20.  How  many  integer  solutions  are  there  to  the  equation 
x  +  y  +  z  +  w  =  20  subject  to  x  >  1 ,  y  >  2,  z  >  3,  and 

w  >  4? 
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7.4  Derangements 

There  are  four  houses  on  a  street.  A  mischievous  postman  arrives  with  one  letter 
addressed  to  each  house.  In  how  many  ways  can  he  deliver  the  letters,  one  to 
each  house,  so  that  no  letter  arrives  at  the  correct  house?  The  person  in  charge  of 
coats  at  the  arts  center  loses  all  claim  checks  and  returns  coats  randomly  to  the 
patrons.  What  are  the  chances  that  nobody  gets  their  own  coat? 


7.4.1  DEFINITION 


A  derangement  of  n  distinct  symbols  which  have  some  natural  order  is  a  permu¬ 
tation  in  which  no  symbol  is  in  its  correct  position.  The  number  of  derangements 
of  n  symbols  is  denoted  Dn. 


EXAMPLES  14 


There  is  just  one  derangement  of  the  symbols  1,  2  (namely,  21),  so  Di  =  1. 
There  are  two  derangements  of  1,  2,  3  (312  and  231),  so  Dj,  =  2. 

There  are  nine  derangements  of  1 , 2,  3.  4, 


2341 

2413 

2143 

3142 

3412 

3421 

4123 

4312 

4321 

so  Z?4  =  9.  There  are  nine  ways  our  mischievous  postman  can  deliver  four 
letters,  each  to  the  wrong  address.  ▲ 


Finding  a  general  formula  for  Dn  involves  a  nice  application  of  the  Principle 
of  Inclusion-Exclusion,  binomial  coefficients,  and  a  famous  irrational  number.  We 
consider  again  £>4,  the  number  of  derangements  of  1,  2,  3,  4. 

Let  A 1  be  the  set  of  permutations  of  1,  2,  3,  4  in  which  the  number  1  is  in  the 
first  position.  Let  A2  be  the  set  of  permutations  of  1 . 2,  3,  4  in  which  the  number 
2  is  in  the  second  position.  Define  A?  and  A4  similarly.  The  set  of  permutations 
in  which  at  least  one  of  the  four  numbers  is  left  in  its  natural  position  is  then 
Ai  U  A2  U  A3  U  A4  and  the  complement  of  this  set  consists  precisely  of  the 
derangements  of  1,  2,  3,  4.  Since  there  are  4!  =  24  permutations  altogether,  the 
number  of  derangements  of  four  symbols  is 

D4  =  4!  —  |A|  U  A2  U  A3  U  A4|. 

How  many  elements  are  in  the  union  A\  U  A2  U  A3  U  A4?  By  the  Inclusion- 
Exclusion  Principle, 


(1)  |A,  UA2UA3UA4I  =  X)|A/|-X;|AfnA;|+  12  \Air\AjnAk\ 

i  i <j  i <j<k 

—  I Aj  n  a2  n  a 3  n  a4|. 

Now  | Ai|  =3!  (1  is  in  the  first  position,  numbers  2,  3,  and  4  go  to  any  of  the 
next  three  positions).  Similarly,  |A,  |  =  3!  for  any  i. 

The  set  Ai  fl  Ao  contains  those  permutations  of  1,  2,  3,  4  in  which  1  and  2 
are  in  the  correct  positions.  There  are  just  two  such  permutations,  1234  and  1243. 
Thus,  |A,-  n  A/|  =  2!  for  each  of  the  6  =  (2)  terms  in  the  second  sum  on  the  right 
of  equation  1. 


7.4.2  PROPOSITION 
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Similar  reasoning  shows  that  each  of  the  4  =  (3)  terms  |A,  D  Aj  fl  A*|  equals 
1,  as  does  the  last  term.  Therefore, 


| At  UA2UA3UA4I  =4(3!)  - 


(2!)+  f  (1!)-1 


and  hence. 


Z>4  =  4!  -  (4(3!)  -  (2)2!  +  (3)1!  -  l) 


4!  4! 

=  4!  -4!  + - 2! - hi 

2!2!  3! 

4!  4!  4! 

=  4!  —  4!  + - +  — 

2!  3!  4! 


=  4!  ( 1  — -  +  - - -  +  — ) 

V  1!  2!  3!  4!/ 

/  1  1  1  \ 

=  24('-'  +  2-6  +  5) 

=  24-24  +  12  -4  +  1  =9. 


To  find  a  general  formula  for  Dn,  for  general  n,  we  mimic  this  calculation 
of  D4. 


The  number  of  derangements  of  n  >  1  ordered  symbols  is 


/ 

1  1 

1 

„  1  \ 

Dn  =  nil 

1  —  77  +  rr 

—  ~  +  • 

■  +  (— 1)  — 

\ 

1!  2! 

3! 

n!/ 

For  example, 

/  1  1  1  1  1  \ 

D5  =  5!(  1 - + - + - ) 

V  1!  2!  3!  4!  5 !/ 


=  5!  —  51+5-4-3  —  5-4  +  5  —  1  =  60  —  20  +  5—  1  =  44. 

Some  readers  may  have  encountered  the  number  e,  the  base  of  the  natural 
logarithm,  in  a  calculus  course.  The  Taylor  expansion  for  ex  is  the  formula 

„2  x3 


(2) 


e  =  H - 1 - h 

1!  2! 


3! 


+ 


which,  among  its  uses,  allows  us  to  approximate  various  powers  of  e.  To  approxi¬ 
mate  e  itself,  for  instance,  we  evaluate  the  first  several  terms  of  the  Taylor  expan¬ 
sion  for  ex  with  x  =  1: 


1  T 


T 


l4 


1 


1  1 


e  e  ^  1  TT  “f-  77  —  +  —  —  \  — 

1!  2!  3!  4!  2  6  24 


2.708. 
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Better  approximations  can  be  obtained  by  including  more  terms  of  the  Taylor 
expansion.  To  approximate  s/e,  we  have 

^  \n  s  -5  ( -5  )2  (.5)3 

v  1!  2!  3! 

=  1  +  .5  +  .125  +  .02083  %  1.646. 


With  x  =  —  1 ,  we  have 

1 


=  e 


1 


1-  —  +  +  (-l)'!  — 

1!  2!  3!  n! 


for  any  n  >  1. 

Comparison  with  the  formula  for  D„  given  in  Proposition  7.4.2  shows  that 
Dn  %  n!e~'  =  j-.1  Equivalently,  jjf  %  ~e  %  0.368,  an  observation  which  shows 
that  the  derangements  of  n  symbols  comprise  about  36.8%  of  all  the  permuta¬ 
tions  of  n  symbols.  This  percentage  can  also  be  viewed  as  the  probability  that  a 
randomly  selected  permutation  of  n  symbols  will  be  a  derangement.  It  is  curious 
that  for  any  n  >  5,  this  probability  is  36.8%  to  one  decimal  place.  The  chance 
that  a  postman,  delivering  letters  randomly,  will  deliver  six  letters  to  six  incorrect 
addresses  is  just  about  the  same  as  the  chance  that  he  will  deliver  50  letters  to 
incorrect  addresses. 


OiL 


Answers  to  Pauses 


10.  =  ^  =  0.375.  Even  for  n  =  4,  is  remarkably  close  to 

jf-  =  -$$5  =  .36.  In  fact,  for  n  >  5.  =  .368 . . .  agrees  with  j  to  three 

decimal  places. 


EXERCISES 


The  symbol  [BB]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 


1.  [BB]  Find  Df,,  Dj,  and  Ds. 

2.  A  simple  code  is  made  by  permuting  the  letters  of  the 
alphabet  such  that  every  letter  is  replaced  by  a  different 
letter.  How  many  different  codes  can  be  made  in  this  way? 

3.  [BB]  Eleven  books  are  arranged  on  a  shelf  in  alphabet¬ 
ical  order  by  author  name.  In  how  many  ways  can  your 


little  sister  rearrange  these  books  so  that  no  book  is  in 
its  original  position? 

4.  Fifty  students  take  an  exam.  For  the  purposes  of  grading, 
the  teacher  asks  the  students  to  exchange  papers  so  that 
no  one  marks  his  or  her  own  paper.  In  how  many  ways 
can  this  be  accomplished? 


1  Students  who  have  studied  Taylor  polynomials  should  quickly  observe  that  the  error  in  approximating 
Dn  by  —  is  less  than 
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5.  Your  letter  carrier  has  a  drinking  problem.  When  she 
delivers  letters  to  the  seven  houses  in  your  block,  she 
delivers  completely  at  random.  Suppose  she  has  seven 
letters,  one  addressed  to  each  house,  and  she  delivers 
one  letter  to  each  house.  In  how  many  ways  can  this  be 
accomplished  if 

(a)  no  letter  arrives  at  the  right  house? 

(b)  [BB]  at  least  one  letter  arrives  at  the  right  house? 

(c)  all  letters  arrive  at  the  right  house? 

6.  Twenty  people  check  their  hats  at  a  theater.  In  how  many 
ways  can  their  hats  be  returned  so  that 

(a)  no  one  receives  his  or  her  own  hat? 

(b)  at  least  one  person  receives  his  or  her  own  hat? 

(c)  [BB]  exactly  one  person  receives  his  or  her  own 
hat? 

(d)  at  least  two  people  receive  their  own  hats? 

(e)  at  most  two  people  receive  their  own  hats? 

7.  (a)  [BB]  In  how  many  ways  can  the  integers  1  through 

9  be  permuted  such  that  no  odd  integer  will  be  in 
its  natural  position? 

(b)  In  how  many  ways  can  the  integers  1  through  9  be 
permuted  such  that  no  even  integer  is  in  its  natural 
position? 


(c)  In  how  many  ways  can  the  integers  1  through  9  be 
permuted  such  that  exactly  four  of  the  nine  integers 
are  in  their  natural  positions? 

8.  Without  any  calculation,  prove  that 


9.  Prove  Proposition  7.4.2. 

10.  (a)  [BB]  Prove  that  D„  =  (—1)"  (mod  n). 

(b)  [BB]  Prove  that  Dn  is  even  if  and  only  if  n  is  odd. 

11.  (a)  [BB]  Prove  that  D„  =  (n  —  l)(D„_i  +  Dn-i)  for 

all  n  >  3. 

(b)  Prove  that  D„  =  n  Dn _ j  +  (—!)"  for  all  n  >2. 

(c)  Use  (b)  to  find  Dn  and  to  six  decimal  places  for 
n  =  1,  2,  3,  . . .  ,  10.  What  is  the  approximation  to 
j  to  six  decimal  places? 

12.  While  the  concept  of  a  derangement  of  zero  objects  is 
rather  dubious,  the  formula  for  D„  given  by  Proposi¬ 
tion  7.4.2  does  make  sense  for  n  —  0  and  gives  £>o  =  1. 
Taking  Do  =  1,  show  that  the  generating  function  for 


7.5  The  Binomial  Theorem 

The  arrangement  of  integers  in  Fig  7.3  is  named  after  Blaise  Pascal  (1623-1662) 
because  of  the  many  applications  in  combinatorics  and  probability  which  Pascal 
found  for  it.2  Each  row  begins  and  ends  with  a  1;  each  other  number  in  a  row 
sits  between  two  consecutive  numbers  in  the  row  above  it  and  is  the  sum  of  these 
two.  For  example,  10  =  4  + 6,  6  =  1+5  and  21  =  15  +  6,  as  shown  in  the 
figure. 

This  famous  triangle  has  many  fascinating  properties.  For  example,  if  we 
number  the  rows  starting  at  zero  and  interpret  each  row  as  the  “digits”  of  a  single 
number  written  in  base  10, 3  then  row  k  of  the  triangle  is  11*.  For  example,  row 
zero  is  1  =  11°,  row  one  is  11  =  1 1 1 ,  row  two  is  121  =  1 1 2 ,  row  three  is 
1331  =  ll3,  and  row  five  is 

1  ( 1 05 )  +5(104)+  10(103)  +  10(102)  +  5(10')  +  1(10°)  =  ll5, 
and  so  on. 


2The  triangle  itself  was  known  in  India  as  early  as  the  third  century  B.C. 

3In  this  context,  by  the  “digits”  of  a  number,  we  mean  the  coefficients  required  to  write  the  number  as 
a  linear  combination  of  powers  of  10. 


232  Chapter  7  Permutations  and  Combinations 


1 

1 


1 

2 

1 

1 

3 

3 

1 

1 

® 

® 

4  1 

1  5 

© 

10 

5  1 

6 

15 

20 

©  ®  1 

7  21 

35 

35 

©  7  1 

Figure  7.3  Pascal’s  triangle. 


If,  beginning  at  each  initial  1,  we  follow 
the  diagonal  pattern  suggested  by  the  figure 
to  the  right,  adding  alternate  numbers  along 
the  way  (at  the  positions  denoted  •),  we  dis¬ 
cover  the  terms  of  the  Fibonacci  sequence. 

For  example,  after  two  l’s,  we  obtain  the 
sums  1  +  1  =2,  1+2  =  3,  1+3+1  =  5  (see  Fig  7.4),  1+4  +  3  =  8, 
1  +  5  +  6+1  =  13,  and  so  on. 


o''® 
Cl)  2  1 


o'1®'1 

Cl)  3  3 


1 

1 


0 

i 

4  1 


1+1=2 


1+2=3 


l+3+l=5 


Figure  7.4  The  Fibonacci  sequence,  hidden  within  Pascal’s  triangle. 


Our  major  interest  in  Pascal’s  triangle  concerns  its  connection  with  the  expan¬ 
sion  of  ( x  +  y)n,  for  the  numbers  in  row  n  of  the  triangle  give  the  requisite  coef¬ 
ficients.  As  shown  in  the  following  equations,  which  we  number  starting  with  0, 
the  coefficients  needed  to  expand  ( x  +  y)°  are  in  row  0.  In  row  1,  we  see  the 
coefficients  needed  to  expand  (x  +  y)1 ;  in  row  2,  the  coefficients  needed  to  expand 
(x  +  y)2;  and  so  forth. 


(x  +  y)°  =  1 

(x  +  y)1  =  x  +  y  =  lx+Iy 

(x  +  y)2  =  x2  +  2xy  +  y2  =  J_x2  +  2xy  +  iy2 
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(x  +  y)}  —  x 3  +  3  x2y  +  3  xy2  +  y3  =  lx 3  +  3 x2y  +  3 xy2  +  1  y3 

(x  +  y)4  =  X4  4-  4x3 y  +  6 x2y2  +  4xy3  +  y4 

=  lx4  +  4 x3y  +  6x2y2  +  4 xy3  +  l_y4 

How  might  we  obtain  the  expansion  of  (x  +  y) 1 1  ?  We  could  continue  to  build 
Pascal’s  triangle  as  far  as  row  1 1 ,  but  it  would  be  better  to  observe  that  the  numbers 
in  Pascal’s  triangle  are  just  the  values  of  the  binomial  coefficients  (”).  Figure  7.5 
shows  Pascal’s  triangle  again,  but  in  terms  of  these  combinatorial  symbols. 

0 

©  (!) 

(o)  ©  © 

0  ©  ©  © 

(4o)  ©  ©  0  © 

©  (i)  ©  ©  ©  © 

©  ©  ©  ©  ©  ©  © 

©  (I)  ©  ©  (I)  ©  ©  © 


Figure  7.5  A  useful  presentation  of  Pascal's  triangle. 

Note  that  (JJ)  =  (")  =  ~  =  1  because  0!  =  1,  so  each  row  begins  and  ends 
with  a  1  as  it  should.  The  reader  is  encouraged  to  evaluate  all  the  symbols  in 
this  second  presentation  of  Pascal’s  triangle  in  order  to  verify  that  they  do  indeed 
agree  with  the  numbers  in  Fig  7.3.  A  proof  that  this  correspondence  extends  to 
further  rows  of  the  triangle  depends  upon  the  identity 


(the  proof  of  which  was  Exercise  25,  Section  7.2)  and  is  left  to  the  exercises. 
The  advantage  in  identifying  the  entries  of  the  triangle  as  the  symbols  (”)  should 
be  apparent:  Any  particular  row  can  be  obtained  without  first  constructing  all  the 
rows  above  it.  Row  1 1 ,  for  instance,  which  gives  the  coefficients  needed  to  expand 
(x  +  y)11,  is 

(o).  (V).  (V).  (") .  O’  (!!) 

and  these  numbers  are,  respectively,  the  coefficients  of  the  terms 
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7.5.1  THE  BINOMIAL 
THEOREM 


Notice  that  each  of  these  terms  is  of  the  form  x 1  yJ ,  that  the  sum  i  +  j  of  the 
exponents  is  always  1 1 ,  and  that  the  exponent  of  x  decreases  one  at  a  time  from 
11  to  0  as  the  exponent  of  y  increases  from  0  to  11. 

For  any  x  and  y  and  any  natural  number  n. 


It  is,  of  course,  from  this  theorem  that  binomial  coefficients  take  their  name. 

While  the  Binomial  Theorem  can  be  proven  by  mathematical  induction,  there 
is  a  different  argument  which  explains  why  the  binomial  coefficients  ('^)  are 
involved.  Computing  (x  +  y )2  from  first  principles,  we  obtain 

(x  +  y)2  =  (x  +  y)(x  +  y) 

=  x  (x  +  y)  +  y(x  +  y) 

=  xx  +  xv  +  yx  4-  yy  =  x2  +  2xy  +  y2. 

Computing  (x  +  y)3  the  same  way,  we  have 

(x  +  y)3  =  (x  +  y)(x  +  y)(x  +  y) 

=  (x  +  y)(xx  +  xy  +  yx  +  yy) 

=  xxx  +  xxy  +  xyx  +  x  yy  +  yxx  +  yxy  +  yyx  +  yyy 
=  x3  +  3x2y  +  3xy2  +  y3. 

In  the  third  line  of  this  expansion,  each  of  the  8  =  23  terms  xxx,  xxy,  and  so 
on,  corresponds  to  the  selection  of  an  x  or  a  y  from  each  of  the  original  three 
factors  x  +  y.  Since  xxx  arises  only  when  an  x  is  selected  from  each  factor, 
there  is  just  one  x3  term  in  the  final  expansion.  The  coefficient  of  x2y  in  the 
final  expansion  is  3  because  there  are  three  terms  involving  two  x’s  and  one 
y;  namely,  xxy,  xyx,  and  yxx.  The  reason  why  there  are  three  terms  is  that 
there  are  three  ways  to  select  one  y  (and  hence  two  x’s)  from  the  three  factors 
x  +  y. 

In  the  full  expansion  of  (x  +  y)",  before  any  simplification  and  before  like 
terms  are  collected,  there  are  2n  terms.  Each  such  term  is  a  string  of  x’s  and 
y’s,  n  symbols  in  all,  and  corresponds  to  a  selection  of  x  or  y  from  each  of  the 
n  factors  x  +  y.  Any  term  containing  k  y’s  (and  hence  n  —  k  x’s)  simplifies  to 
xn~kyk  -phe  eventual  coefficient  of  xn~kyk  is  the  number  of  terms  which  simplify 
to  xn~kyk,  this  number  being  (j.)  because  there  are  (”)  ways  to  select  k  factors 
for  the  v’s. 

We  conclude  this  section  with  a  sampling  of  problems  which  can  be  solved 
using  the  Binomial  Theorem. 
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PROBLEM  15.  Using  the  Binomial  Theorem,  expand  (3x2+2_y)5  and  simplify. 

Solution. 

(3x2  +  2y)5  =  (3x2)5  +  Q(3x2)4(2y)  +  Q(3x2)3(2y)2 

+  Q(3x2)2(2y)3  +  Q(3x2)(2y)4  +  (2y)5 
=  243x10  +  5(81x8)(2y)  +  10(27x6)(4y2)  +  10(9x4)(8y3) 

+  5(3x2)(16y4)  +  32y5 

=  243x10  +  810x8y  +  1080x6y2  +  720x4y3  +  240x2y4  +  32y5  I 


/  4\6 

PROBLEM  16.  Using  the  Binomial  Theorem,  expand  (x - J  and  simplify. 

Solution. 


c /  4\  i/16\  ,  / 

64  \ 

9 /256\ 

6x5  (  —  )  +  1 5x4  (  — )  -F  20x3  ( 

-3) 

+  15x2( — j-j 

V  x>  \x2'  V 

x3/ 

V  x4  / 

/  1 024  \  4096 

+  6X(  xO+x* 

4  2  3840 

6144 

4096 

24x4  +  240x2  1280+  , 

xz 

X4 

+  x<> 

PROBLEM  17.  Find  the  coefficient  of  x16  in  the  expansion  of  ^2x2  —  . 

Solution.  The  general  term  in  the  expansion  of  this  expression  by  the  Binomial 
Theorem  is 


24 -k 


We  want  24  —  k  =  16;  thus,  k  =  8.  The  coefficient  is  (g"")24(— ^)8 

495 
16  • 


- 

—  16  V  8  / 


I 


PROBLEM  18.  Prove  that 
numbers  n. 


=  2”  for  all  natural 
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Solution. 

(i  + 1  r  = 


Consider  (x  +  y)n  =  y  (  \xn  kyk .  Setting  x  =  y  =  1,  we  obtain 
k= 0  W 

y,  i«-<: i&;  tjjat  js  2”  =  y  as  desired.  I 

k= 0  yt=0 


EXERCISES 


77ze  symbol  [BB]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 


1.  [BB]  For  each  of  the  following,  expand  using  the  Bino¬ 
mial  Theorem  and  simplify. 

(a)  (x  +  y)6 

(b)  (2x  +  3 y)6 

2.  Use  the  Binomial  Theorem  to  expand  (a+4b)5.  Simplify 
your  answer. 

3.  For  each  of  the  following,  expand  using  the  Binomial 
Theorem  and  simplify. 

(a)  (2x3  —  y2)8 

(b)  [BB]  (2x3  -x2)8 

(c)  y  -  rd* 

4.  [BB]  Find  the  fourth  term  in  the  binomial  expansion  of 

(x3  -  2y2)12. 

5.  Consider  the  binomial  expansion  of  ( x  +  y)20. 

(a)  What  are  the  first  three  terms? 

(b)  What  are  the  last  three  terms? 

(c)  [BB]  What  is  the  seventh  term?  The  fifteenth  term? 

(d)  What  is  the  coefficient  of  xl3y7? 

6.  [BB]  Consider  the  binomial  expansion  of  (2x  —  y)16. 

(a)  How  many  terms  are  there  altogether? 

(b)  Is  there  a  middle  term  or  are  there  two  middle  terms? 
Find  and  explain. 

7.  Consider  the  binomial  expansion  of  (x  +  3y)17. 

(a)  How  many  terms  are  there  altogether? 

(b)  Is  there  a  middle  term  or  are  there  two  middle  terms? 
Find  and  explain. 

8.  [BB]  Find  the  coefficient  of  x3v7  in  the  binomial  expan¬ 
sion  of  (4x  +  5v)10. 


9.  [BB]  Find  the  coefficient  of  the  term  containing  y8  in 
the  binomial  expansion  of  (x  +  3 y2)17. 

10.  What  is  the  coefficient  of  x5  in  the  binomial  expansion 
of  (x  -  2x~2)20? 

11.  [BB]  What  is  the  coefficient  of  x27  in  the  binomial 

/  3  ,\18 

expansion  of  ^ — bx  j  ? 

12.  Find  the  coefficient  of  x25  in  the  binomial  expansion  of 

(*-?)"• 

13.  Prove  that  4"  >  n4  for  all  integers  n  >  5. 

14.  Sequences  of  integers,  {x„},  [y„],  n  >  0,  are  defined  by 
( 1  +  s/2)n  =  x„  +  yn  Jt.. 

(a)  [BB]  Find  the  first  three  terms  and  the  fifth  term  of 
each  sequence. 

(b)  Find  formulas  for  x„  and  y„ . 

(c)  Without  using  the  result  of  (b),  establish  the  recur¬ 
sion  formulas 

Xn+i  —  Xn  +  2v„ 

yn- h  —  xn  +  yn 

for  n  >  0. 

15.  (For  students  who  have  studied  calculus)  In  Problem  8 
on  p.  14,  we  proved  that  s/l  is  irrational,  that  is,  not 
the  quotient  of  integers.  In  this  exercise,  we  give  an 
alternative  proof  using  the  binomial  theorem. 

(a)  If  n  is  an  integer,  show  that  there  exist  integers  x„ 
and  y„,  depending  on  n,  such  that  (s/2  —  1)"  = 
x„  +  yns/2. 

(b)  Explain  why  lim^ootV?  —  1)"  =  0. 

(c)  Suppose  that  s/2  is  rational  and  show  that  (a)  and 
(b)  lead  to  a  contradiction. 
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16.  [BB]  At  the  beginning  of  this  section,  we  said  that  the 
numbers  in  row  n  of  Pascal’s  triangle,  interpreted  as  the 
“digits’  of  a  single  number  in  base  10,  give  the  integer 
11".  Prove  this. 

17.  Suppose  that  we  construct  Pascal’s  triangle  as  described 
at  the  beginning  of  this  section.  Prove  that  the  entries  of 
row  n  are  (£),  (”),  Q,  ...  ,  (").  [Hint:  Identity  (3).] 

18.  Prove  the  Binomial  Theorem  by  mathematical  induction. 
[Hint:  Identity  (3)  will  be  useful.] 

19.  (a)  Show  that  J2k= 2  (2)  =  (3)  anc*  interpret  this  result 

with  reference  to  Pascal’s  triangle. 

(b)  Show  that  J2k=r  C)  =  ("+')  tor  any  r  and  w»  1  - 
r  <  n,  and  interpret  with  reference  to  Pascal’s  tri¬ 
angle. 

20.  [BB]  Show  that  Q  -  ('/)  +  (")-■■■  +  (-!)"(”)  =  0 
for  all  natural  numbers  n. 


21.  (a)  By  direct  calculation,  show  that  34  =  1  +  2(4 6 *)  + 

4(a)  +  8 *(:s)  +  16- 

(b)  Find  a  simple  expression  for  (q)+2(")+4(2)  +  -  ■  •  + 
2" (”)  valid  for  all  natural  numbers  «  and  prove  your 
answer. 

22.  (a)  [BB]  Find  a  simple  expression  for  ^(k)  anc* 

prove  your  answer. 

(b)  Find  a  simple  expression  for  J2k=  \  ^G!)  an^  Prove 
your  answer. 

[Hint:  In  both  of  these  questions,  to  begin,  it  may  prove 
helpful  to  evaluate  the  sum  for  small  values  of  n.] 

23.  Consider  the  Fibonacci  sequence  ao,  a\ ,  <5(2,  ■  •  • ,  where 
do  =  a i  =  l  and,  for  n  >  1,  a„+\  =  a„  +a„-\.  Express 
an  for  n  >  I  as  the  sum  of  certain  binomial  coeffi¬ 
cients  and  prove  your  answer.  [Hint:  See  remarks  at  the 
beginning  of  this  section.] 


REVIEW  EXERCISES  FOR  CHAPTER  7 


1.  In  how  many  ways  can  seven  boys  and  six  girls  stand 
in  a  row  if  the  girls  are  to  stand  together  but  the  boys 
must  not  stand  together? 

2.  Do  Exercise  1  again  with  “circle"  instead  of  "row.” 

3.  How  many  permutations  of  the  letters  a,  b,  c,  d,  e,  f 
contain  at  least  one  of  the  patterns  aeb  and  bcfl 

4.  Find  an  expression  for  the  number  of  five-card  poker 
hands  which  contain  exactly  three  kings. 

5.  A  committee  of  seven  is  to  be  chosen  from  eight  men 
and  nine  women. 

(a)  How  many  such  committees  contain  at  least  six 
women? 

(b)  How  many  such  committees  contain  either  Bob  or 
Alice  but  not  both? 

6.  A  Middle  East  peace  conference  will  be  attended  by 

five  Arab  countries,  four  Western  countries,  and  Israel. 

Each  delegation  will  sit  together  at  a  round  table  sub¬ 

ject  only  to  the  requirement  that  no  Arab  delegation  be 


seated  next  to  the  Israeli  delegation.  How  many  seating 
arrangements  are  possible? 

7.  Frank  wants  to  buy  12  muffins  and  finds  seven  different 
types  available.  In  how  many  ways  can  he  make  his 
selection? 

8.  In  how  many  ways  can  the  letters  of  the  word 
NUNAVUT  be  rearranged? 

9.  Find  the  coefficient  of  x5  in  the  binomial  expansion  of 
(,v  —  2.v~2)20.  Do  not  simplify. 

/  ?  1  \12 

10.  Find  the  coefficient  of  x  in  p6.r  —  —J  .  Simplify 
your  answer. 

11.  (a)  Give  a  verbal  argument  for  the  truth  of  the  identity 

C)  =  ("*:!) +  (";')• 

(b)  If  n  >  k+2  and  k  >  2,  show  that  ('^)  —  (”  k  2 3)  —  (^_?) 
is  even. 

12.  Find  a  simple  expression  for  3"  -  (")3"_1  +  •••  + 
(-l)k(nk)3"-k  +  •••  +  (-!)". 


CHAPTER 


Algorithms 


8.1  What  is  an  Algorithm? 

Computers!  Computers!  Computers!  Are  they  the  bane  of  our  existence  or  the 
most  important  invention  since  the  discovery  of  the  wheel?  Certainly  there  is  no 
denying  that  computers  have  changed  almost  every  aspect  of  our  lives.  They  are 
used  by  stockbrokers  to  track  the  performance  of  stocks  and  to  provide  guidance 
as  to  which  should  be  bought  and  sold.  Much  of  the  blame  for  the  October  1987 
North  American  stock  market  crash  was  attributed  to  computers  all  giving  the 
same  "sell”  advice  at  the  same  time.  Nowadays,  virtually  every  company  puts 
personal  computers  on  the  desks  of  their  employees,  in  the  belief  that  computers 
both  increase  productivity  and  cut  revenue  losses.  Upon  discovery  of  a  potential 
flaw  in  some  part,  a  car  manufacturer  is  able  almost  instantly  to  produce  a  list 
of  the  names  and  addresses  of  all  the  people  across  North  America  who  have 
purchased  a  car  containing  that  particular  part.  The  list  of  computer  applications 
is  literally  endless.  But  what  is  a  computer? 

A  computer  is  fundamentally  just  a  box  containing  wires  and  chips  (these 
being  tiny  circuits  built  onto  small  boards)  all  cleverly  put  together  so  that  the 
machine  can  follow  instructions  given  to  it.  The  box  of  wires  and  chips,  and 
peripheral  items  such  as  monitor  and  keyboard,  are  called  hardware ;  the  instruc¬ 
tions  fed  to  the  machine  are  called  software.  Computer  programming  involves  writ¬ 
ing  software  that  allows  a  machine  to  perform  various  tasks  which,  by  hand,  would 
be  tedious  or  so  time  consuming  as  to  be  essentially  impossible  to  do.  Machines 
work  incredibly  quickly,  never  get  tired,  and  are  excellent  at  following  orders; 
however,  they  will  only  perform  as  well  as  the  instructions  presented  to  them. 
One  of  the  oldest  adages  in  the  computer  business  is  “Garbage  in — garbage  out!”. 

There  are  two  parts  to  a  computer  program.  There  is  the  process  or  sequence 
of  steps  which  is  necessary  to  complete  the  given  task  and  the  translation  of  this 
process  into  a  language  which  the  computer  can  understand.  In  this  chapter,  we 
explore  the  first  of  these  ideas,  asking  the  reader  to  think  about  the  process  or 
algorithm  by  which  familiar  tasks  are  accomplished. 
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The  word  “algorithm”  evolved  from  the  older  word  “algorism”  which  is  a 
corruption  of  the  surname  of  a  ninth-century  Persian,  Abu  Ja‘far  Muhammad 
ibn  Musa  al-Khwarizmi,  who  wrote  an  important  book  setting  forth  rules  for 
performing  arithmetic  with  the  arabic  numerals  1, 2,  3  . . .  we  use  today.1  While 
the  older  term  “algorism”  referred  primarily  to  arithmetical  rules,  “algorithm” 
today  is  used  in  a  more  general  sense  as  a  virtual  synonym  for  “procedure.”  Its 
ingredients  are  an  input,  an  output,  and  a  sequence  of  precise  steps  for  converting 
the  input  to  the  output.  In  Chapter  4,  we  studied  the  Euclidean  algorithm  which, 
upon  input  of  positive  integers  a  and  b,  outputs  the  greatest  common  divisor  of 
a  and  b  through  a  precise  series  of  steps. 

As  children,  after  learning  the  addition  rules  for  single-digit  numbers,  we  were 
taught  a  procedure  for  adding  numbers  with  any  number  of  digits;  upon  input  of 
two  arbitrary  integers,  the  procedure  outputs  their  sum. 

12  3  4 

5  6,  7,  8 

6  9  12 

We  memorized  multiplication  tables  for  single-digit  numbers  and  then  were  taught 
a  procedure  which,  upon  input  of  two  arbitrary  numbers,  outputs  their  product. 

Folklore  has  it  that  Russian  peasants  have  an  instinctive  and  flawless  ability 
to  multiply  and  divide  by  the  number  2  and  are  able  to  use  this  successfully 
to  multiply  any  two  numbers.  For  example,  they  would  compute  211  x  453  by 
the  scheme  shown  in  Fig  8.1.  Each  number  on  the  left  is  the  quotient  when  the 
number  above  it  is  divided  by  2;2  the  numbers  on  the  right  are  the  product  of  2 
and  the  number  above.  After  reaching  1  in  the  left  column,  the  rows  containing 
even  numbers  on  the  left  are  crossed  out.  The  desired  product  is  then  the  sum  of 
the  numbers  which  remain  uncrossed  on  the  right! 


211  x 
105 

CT 

453 

906 

1  9  1  T 

J  Z. 

O  A 

lOiA 

^AO/1 

1.XJ 

13 

A _ 

7248 

3 

1 

28992 

57984 

95583 

Figure  8.1  The  Russian  peasant  method  of  multiplication. 

Before  the  days  of  pocket  calculators,  school  children  were  taught  a  procedure 
for  finding  the  square  root  of  a  number.  To  find  the  square  root  of  2,  for  instance, 
a  student  was  taught  to  produce  the  pattern  shown  in  Fig  8.2.  First,  we  find  the 


1  Interestingly,  the  word  “algebra”  derives  from  the  Latin  title  of  this  book,  Ludus  algebrae  et  almuc- 
grabalaeque. 

^Remember  that  when  a  natural  number  a  is  divided  by  another  natural  number  b.  the  quotient  is  the 
integer  part  of  the  fraction  .  See  Section  4.1. 
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Figure  8.2 


Figure  8.3 


largest  integer  whose  square  is  less  than  2  and  write  this  number,  1,  in  the  two 
places  shown.  The  product  of  1  and  1  is  1  and  2—1  =  1.  Write  down  1  followed 
by  two  0’s.  Next,  we  double  the  number  on  top,  giving  2,  and  search  for  the  largest 
integer  x  such  that  x  x2x  does  not  exceed  1 00.  We  have  x  =  4,  4  x  24  =  96,  and 
100  —  96  =  4,  as  shown.  Follow  this  4  by  another  two  0’s,  double  the  number  on 
top,  giving  28,  and  search  for  the  largest  single  digit  x  such  that  x  x  28x  does  not 
exceed  400.  We  have  x  —  1,  1  x  281  =  281,  and  400  —  281  =  119,  and  so  forth. 
Whether  or  not  you  understand  this  scheme  is  much  less  important  today  than  it 
was  formerly.  We  mention  it  here  not  just  as  a  historical  curiosity,  but  as  another 
example  of  an  arithmetic  algorithm,  one  which  allows  the  computation  of  square 
roots  to  any  degree  of  accuracy. 


1.  4  1  4  2 


2.  0  0  |  0  0  |  0  0  |  0  0 

1 

1 

24 

1  0  0 

9  6 

281 

4  0  0 

2  8  1 

2824 

119  0  0 

112  9  6 

28282 

6  0  4  0  0 

5  6  5  6  4 

3  8  3  6 


Part  of  an  algorithm  for  determining  s/2. 

In  a  course  on  Euclidean  geometry,  we  learn  a  vast  number  of  geometric 
algorithms  usually  called  constructions.  To  bisect  an  angle  with  vertex  A,  for 
instance,  we  draw  an  arc  with  center  A.  If  B  and  C  are  the  points  where  this  arc 
meets  the  arms  of  the  angle,  we  then  draw  arcs  with  centers  B  and  C  and  some 
suitably  large  radius  (for  example,  the  length  of  AB).  If  these  arcs  meet  at  the 
point  P,  then  AP  is  the  bisector  of  the  angle  at  A.  (See  Fig  8.3.) 


In  high  school  and  college,  students  learn  algorithms  for  multiplying  poly¬ 
nomials,  for  multiplying  matrices,  for  solving  systems  of  linear  equations,  for 
differentiating  polynomials;  the  list  goes  on.  In  the  rest  of  this  chapter,  we  dis¬ 
cuss  important  algorithms  the  reader  may  not  have  seen  before  and  their  relative 
strengths  and  weaknesses.  Since  this  is  not  a  text  in  computer  programming,  our 
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description  of  algorithms  is  mostly  in  ordinary  English,  subject  only  to  the  require¬ 
ments  of  clarity  and  precision:  It  should  always  be  clear  to  a  programmer  how  to 
implement  any  of  our  algorithms  in  his  or  her  favorite  language. 

PROBLEM  1.  Describe  an  algorithm  whose  input  is  a  list  ai,a2, ...  ,an  of 
integers  and  whose  output  is  their  sum. 

Solution.  This  question  requires  us  to  confront  what  an  expression  like  5+17  + 
3  +  6  +  4  really  means.  How  do  the  basic  rules  for  addition  of  two  numbers 
extend  to  permit  the  addition  of  several  numbers?  Most  people  would  calculate 
5  +  17  +  3  +  6  +  4  by  adding  from  the  left,  like  this: 

{[(5  +  17)  +  3]  +  6}  +  4  =  [(22  +  3)  +  6]  +  4  =  (25  +  6)  +  4  =  31  +  4  =  35. 

This  approach  is  the  idea  behind  our  algorithm. 

To  find  the  sum  of  n  integers  a\ ,  a2,  ■  ■  ■  ,an. 

Step  1.  set  5  =  0; 

Step  2.  for  i  =  1  to  n,  replace  S  by  S  +  a, ; 

Step  3.  output  S. 

The  value  of  S  output  at  Step  3  is  the  desired  sum.  I 


Step  2  calls  for  a  loop ,  a  feature  of  every  programming  language.  Whenever 
we  write  “for  i  =  1  to  n"  as  part  of  an  algorithm,  we  mean 

•  set  i  =  1  and  execute  the  indicated  statement  (or  statements); 

«  increase  the  value  of  i  by  1  and,  if  i  <  n,  execute  the  indicated  statement(s); 

•  continue  to  increase  the  value  of  i  by  1  and,  as  long  as  i  <  n,  execute  the 

indicated  statement(s); 

•  if  the  stage  i  >  n  is  reached,  skip  to  the  next  step. 

The  variable  i  which  is  used  in  this  process  is  called  a  counter.  For  example, 
with  a\ ,  a2,  fl4,  «5,  respectively,  equal  to  5,  17,  3,  6,  4,  the  preceding  algorithm 
begins  by  setting  S  =  0.  At  Step  2,  the  counter  i  is  set  equal  to  1  and  S  is 
replaced  by  5  +  a|  =  0  +  5  =  5.  Then  i  is  set  equal  to  2  and  S  is  replaced  by 
5  +  02  =  5  +  17  =  22.  Continuing,  S  is  replaced  by  S  +  as  =  22  +  3  =  25,  then 
by  S  +  a 4  =  25  +  6  =  31  and,  finally,  by  S  +  a$  =  31  +  4  =  35.  Then  i  is  set 
equal  to  6,  at  which  point  the  algorithm  skips  to  Step  3  and  outputs  S  =  35,  the 
sum  of  a\ , . . .  ,  as. 


PROBLEM  2.  (Polynomial  evaluation)  Describe  an  algorithm  which,  upon  input 
of/7  +  1  integers  uq,  a  i,  a2, .. .  ,  an  and  an  integer  x,  outputs  the  integer  ao  +  a|X  + 
a2x2  -) - 1-  anxn. 

Solution.  Perhaps  the  most  obvious  approach  to  this  problem  is  first  to  compute 
the  powers  of  .r  from  x  to  xn  and  then  to  evaluate  the  expression  ao  +  a\x  + 
a2x2  +  •  •  •  +  anxn  term  by  term. 


EXAMPLE  3 


8.1.1  HORNER'S 
ALGORITHM 
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Given  integers  ao,a\, . . .  ,  an  and  an  integer  x,  to  compute  the  number  ao  + 
a\x  +  d2X~  +  •  •  •  +  anxn , 

Step  1.  set  P  =  1; 

Step  2.  for  i  =  1  to  n,  replace  P  by  Px,  call  this  element  jc,  and  store  (note  that 
xi  =x')-. 

Step  3.  set  S  —  ao; 

Step  4.  for  i  =  1  to  n,  replace  S  by  S  +  cijXi ; 

Step  5.  output  S. 

The  value  of  S  output  at  Step  5  is  the  desired  integer  aQ+a\x+a2X2  -\ - \-a„xn . 

I 


To  evaluate  the  polynomial  f(x)  =  —  1  +  2x  +  4x2  —  3x3  at  x  =  5,  the  algorithm 
takes  as  input  ao  =  —  1,  a\  =  2,  az  —  4,  03  =  —3  and  x  —  5  and  proceeds  as 
follows. 

Step  1.  P  =  1; 

Step  2.  i  =  1:  replace  P  by  Px  =  1  (5)  =  5  (P  is  now  5)  and  let  jcj  =  5; 

i  =  2:  replace  P  by  Px  =  5(5)  =  25  (now  P  =  25)  and  let  X2  =  25; 

i  =  3:  replace  P  by  Px  =  25(5)  =  125  (now  P  =  125)  and  let  *3  =  125. 
Since  i  =  n  =  3,  Step  2  is  complete. 

Step  3.  S  =  cio  =  —  1 ; 

Step  4.  i  =  1:  replace  S  by  S  +  a\x\  =  —  1  +  2(5)  =  9  (now  S  —  9); 

i  =  2:  replace  5  by  S  +  02x2  =  9  +  4(25)  =  109  (now  S  =  109); 

i  =  3:  replace  S  by  5  +  03x3  =  109  —  3(125)  =  —266  (now  S  =  —266). 
Since  i  =  n  —  3,  Step  4  is  complete. 

Step  5.  Output  S  =  —266.  ▲ 

There  is  a  much  more  efficient  way  to  evaluate  a  polynomial,  more  effi¬ 
cient  in  the  sense  that  it  requires  fewer  arithmetical  operations  and  fewer  stor¬ 
age  requirements.  It  is  based  upon  the  observation,  due  to  William  Horner,  an 
eighteenth-century  English  school  headmaster,  that 

ao  T  a\x  T  aix~  T  •  •  •  T  anxn  =  ao  T  {a\  T  aix  T  a^x~~  T  •  •  •  -f  anxn  ^  )x 

=  ao  +  (fl|  +  («2  +  a->,x  +  ■  ■  ■  +  anxn~2)x)x 


=  ao  +  (a\  +  (02  -I - b  (a„_i  +  a„x)x)  ■  ■  ■  )x. 


Given  integers  ao,a\, ...  ,an  and  an  integer  x,  to  evaluate  the  expression  ao  + 
ci\x  +  Q2X 2  +  •  •  •  +  anxn , 
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EXAMPLE  4 


Answers  to  Pauses 


Step  1.  set  S  =  a„ ; 

Step  2.  for  i  =  1  to  n,  replace  S  by  a„-i  +  Sx ; 

Step  3.  output  S. 

The  final  value  of  S  is  the  desired  number  ciq+  a\x  +  aix~  +  •  •  •  +  anx" . 

We  apply  Horner’s  method  to  the  same  polynomial  considered  in  Example  3, 
f(x)  =  —  1  +  2x  +  4jc2  —  3jc3,  and  again  with  x  =  5. 

Step  1.  set  S  =  =  —3. 

Step  2.  i  =  1:  replace  S  by  ai  +  Sx  =  4  —  3(5)  =  —11  (now  S  =  —  1 1); 

i  =  2:  replace  5  by  a\  +  Sx  =  2  —  11(5)  =  —53  (now  S  =  —53); 

i  =  3:  replace  S  by  ao  +  Sx  =  —  1  —  53(5)  =  —266  (now  S  =  —266). 

Since  i  =  n  =  3,  Step  2  is  complete. 

Step  3.  output  S  =  —266.  A 

Show  the  successive  values  of  S  when  Homer’s  method  is  used  to  evaluate  the 
polynomial  f(x)  =  4x3  —  2x  +  1  at  x  =  —2. 

Many  of  the  uses  to  which  modern-day  computers  are  put  do  not  involve 
arithmetical  calculations  but  take  advantage  of  a  machine’s  ability  to  perform 
certain  logical  operations.  Of  these,  the  most  fundamental  is  its  ability  to  compare, 
that  is,  to  decide  if  two  elements  are  equal.  The  next  two  sections  are  devoted  to 
such  important  nonarithmetical  procedures  as  searching,  sorting,  and  enumerating 
permutations.  We  conclude  this  section  with  one  example  of  an  algorithm  in  which 
comparison  is  the  basic  operation. 


PROBLEM  5.  Given  a  “small”  list,  it  is  easy  for  a  person  to  note  the  distinct 
items  it  contains.  The  distinct  items  in  abc,  dbc,  abc,  xbc,  dbc  are,  of  course,  abc, 
dbc,  and  xbc.  To  record  the  distinct  items  in  a  list  of  a  thousand  or  more  items 
would  be  an  almost  intolerable  task  for  most  people,  but  it  is  a  simple  matter 
for  a  computer.  Describe  an  algorithm  which,  upon  input  of  a  list  a\ ,  02.  •  ■  •  ,  an, 
outputs  the  distinct  items  in  this  list. 

Solution.  The  idea  is  straightforward.  We  compare  each  element  in  the  list  with 
those  preceding  it  and,  if  it  is  different  from  all  its  predecessors,  output  it. 

Step  1.  output  a  i ; 

Step  2.  if  n  =  1,  stop; 

else  for  i  =  2  to  n, 

if  a;  does  not  equal  any  of  a\,aj . i ,  output  a,. 

The  algorithm  outputs  the  distinct  items  among  a\,a2, ...  ,an-  I 


1.  We  have  n  =  3,  ao  =  L  a\  =  —2,  ci2  =  0  and  aj  =  4.  The  first  value  of  S 
is  S  =  4.  Then  5  =  0  +  4(— 2)  =  -8,  S  =  -2  -  8(— 2)  =  14  and,  finally, 
5=1  +  14(— 2)  =  -27. 
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EXERCISES 


The  symbol  [ BB ]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 

In  the  first  three  exercises,  ruler  means  an  unmarked  straight  edge  and  compass  means  cm  instrument  which  allows  the 
describing  of  circles  or  arcs  thereof. 


1.  [BB]  Describe  a  procedure  for  finding  the  mid-point  of  a 
line  segment,  with  only  ruler  and  compass,  and  explain 
why  your  procedure  works. 

2.  Given  a  line  l  in  the  plane  and  a  point  P  not  on  i , 
describe  a  ruler  and  compass  procedure  for  constructing 
the  line  through  P  parallel  to  l,  and  explain  why  your 
procedure  works. 

3.  Given  a  point  P  and  a  line  l,  describe  a  ruler  and  com¬ 
pass  procedure  for  constructing  the  line  through  P  per¬ 
pendicular  to  l  in  each  of  the  following  cases.  In  each 
case,  explain  why  your  procedure  works. 

(a)  [BB]  P  is  not  on  i. 

(b)  P  is  on  l. 

4.  Find  each  of  the  following  products  by  the  Russian  peas¬ 
ant  method. 

(a)  168  x  413 

(b)  461  x  973 

(c)  141  x  141 

5.  [BB]  Explain  why  the  Russian  peasant  method  works. 

6.  Find  the  square  roots  of  3,  12,  and  153,  to  four  decimal 
places  of  accuracy,  by  the  method  used  to  compute  sfl 
in  the  text  and  illustrated  in  Fig  8.2. 

7.  [BB]  Describe  an  algorithm  which,  upon  input  of  n  real 
numbers,  outputs  their  average. 

8.  [BB]  Describe  an  algorithm  which,  upon  input  of  n  real 
numbers,  outputs  the  maximum  of  these  numbers. 

9.  Describe  an  algorithm  which,  upon  input  of  n  real 

numbers,  a\ ,  aj . a„,  and  another  number,  x, 

determines  how  many  items  in  the  list  are  equal 
to  x. 

10.  Describe  an  algorithm  which,  given  n  real  numbers, 

tfi,«2 . outputs  the  number  of  a,  which  lie  in 

the  range  85-90,  inclusive. 

11.  [BB]  Given  an  ordered  list  a \  <  a2  <  •  ■  •  <  an  of  real 
numbers  and  a  real  number  x.  describe  an  algorithm 
which  inserts  x  into  its  correct  position  in  the  list  and 
outputs  the  ordered  list  of  n  +  1  numbers. 


12.  Describe  an  algorithm  which,  upon  input  of  integers  a 
and  b  and  a  natural  number  n,  outputs  all  solutions  of 
ax  =  b  (mod  n)  in  the  range  0  <  x  <  n,  if  there 
are  solutions,  and  otherwise  outputs  the  words 
no  solution. 

13.  [BB]  Let  n  be  a  given  natural  number.  Find  an  algorithm 
for  writing  an  integer  a  in  the  form 

a  =  a„n  \  +  a„-\(n  —  1)1  +  •  •  •  +  as  2!  +  a  i 

with  0  <  a,  <  i  for  i  =  1,2 . n  —  1.  [Hint:  Mod¬ 

ify  the  procedure  for  converting  from  base  10  to  base  b 
described  in  Section  4.1.] 

14.  Describe  an  algorithm  which,  upon  input  of  a  number  n 
given  in  base  10,  outputs  the  digits  of  n  (from  right  to 
left). 

15.  [BB]  Describe  an  algorithm  which,  upon  input  of  n  dis¬ 

tinct  symbols  a\.  U2,  ...  .  an,  outputs  all  the  subsets  of 
(«[,  a 2 . a„}.  [Hint:  One  way  to  do  this  is  to  recog¬ 

nize  that  the  subsets  of  [a \ ,  <22.  •  •  •  ,  a,,}  are  in  one-to- 
one  correspondence  with  the  binary  representations  of 
the  numbers  between  0  and  2"  —  I.] 

16.  For  each  polynomial  fix)  and  each  value  of  x,  list  in 
order  the  successive  values  S  which  occur  in  the  calcu¬ 
lation  of  / (x)  by 

i.  the  algorithm  described  in  Problem  2; 

ii.  Horner’s  algorithm. 

Show  your  calculations. 

(a)  [BB]  f  (x)  =  2,v2  —  3jc  4-  1;  x  =  2 

(b)  fix)  =  3*2  +  1;  x  =  5 

(c)  f(x)  =  -4x3  +  6a:2  +  5x  -  4;  x  =  - 1 

(d)  f(x)  =  17x5  —  40.r3  +  16.r  —  l;x  =  3 

17.  [BB]  Explain  why  Horner’s  algorithm  works. 

18.  Describe  an  algorithm  which  upon  input  of  a  natural 
number  n  outputs  the  set  of  primes  in  the  range  1-/?. 
[Hint:  Use  the  Sieve  of  Eratosthenes  and  take  advantage 
of  Lemma  4.3.4.] 

19.  Cite  examples  of  five  algorithms  not  mentioned  in  this 
text  which  you  learned  after  leaving  high  school. 
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8.2  Complexity 

Suppose  that  two  mathematicians  working  in  the  research  department  of  a  large 
corporation  are  given  a  problem  to  solve.  Each  of  these  people  has  a  solid  back¬ 
ground  in  discrete  mathematics  and  so  has  no  difficulty  in  finding  a  suitable 
algorithm,  but  the  two  algorithms  obtained  are  very  different.  When  implemented 
on  a  computer,  one  of  the  algorithms  took  far  less  time  to  produce  an  answer 
than  the  other.  The  head  of  the  research  department  is  pleased  that  an  efficient 
algorithm  was  discovered,  but  also  concerned  because  of  the  huge  amount  of 
computer  time  wasted  running  the  inefficient  algorithm.  Is  it  possible  to  estimate 
the  amount  of  time  an  algorithm  will  require  before  actually  implementing  it?  The 
answer  is  yes,  and  we  shall  begin  to  see  why  in  this  section. 

Just  like  a  person,  a  computer  requires  a  certain  amount  of  time  to  carry 
out  an  arithmetical  operation,  a  multiplication  or  an  addition,  for  instance.  So 
we  can  estimate  the  time  an  algorithm  requires  by  calculating  the  number  of 
arithmetical  operations  it  involves.  This  is  at  best  only  approximate,  of  course, 
because  different  operations  may  well  require  different  amounts  of  time. 

EXAMPLE  6  In  order  to  determine  the  distance  between  two  points  (jq,jC2 . xn)  and 

(yi,  T2.  •  •  •  .  v«)  in  ^-dimensional  Euclidean  space,  we  must  calculate  the  number 

7 (*t  -  y\)2  +  (X2  -  yi)2  H - b  {xn  -  vn)2. 

If  n  =  2,  this  requires  two  subtractions,  two  squarings,  one  addition,  and  one 
square  root,  six  operations  in  all.  (While  we  might  expect  multiplication  and 
especially  the  square  root  operation  to  be  more  time  consuming  than  addition 
and  subtraction,  we  ignore  this  potential  complication  for  now.)  If  n  =  3,  nine 
operations  are  required;  if  n  =  4,  12  operations  are  required,  and  so  on.  In 
general,  the  number  of  operations  in  the  n -dimensional  case  is  3 n.  As  is  typical, 
the  number  of  operations  depends  on  the  size  of  the  input,  in  this  case,  the  2 n 
numbers  jq, ..., xn,  yi, ...,  yn.  ▲ 

There  are  other  measures  of  the  efficiency  of  algorithms  in  addition  to  time. 
For  instance,  different  algorithms  require  different  amounts  of  space  to  hold  num¬ 
bers  in  memory  for  later  use;  the  less  space  required  the  better.  We  shall,  however, 
concentrate  entirely  upon  time  estimates  of  efficiency,  as  measured  by  operation 
counts. 

Often,  and  particularly  in  the  graph  theoretical  algorithms  which  we  will  study 
in  later  chapters,  it  is  uncertain  exactly  how  many  operations  will  be  needed.  Our 
approach  will  be  to  seek  upper  bound,  also  called  worst-case,  estimates  of  the 
number  of  operations  which  we  know  will  never  be  exceeded. 

In  general,  for  a  given  algorithm,  we  try  to  find  the  complexity  func¬ 
tion  f :  N  — >  N,  where,  for  some  measure  n  of  the  size  of  the  input,  f(n)  is 
an  upper  bound  for  the  number  of  operations  required  to  carry  out  the  algorithm. 
In  the  distance  algorithm  described  in  Example  6,  it  is  logical  to  let  n  be  the 
dimension  of  the  Euclidean  space;  then  /(«)  =  3 n.  As  we  shall  see,  it  is  unusual 
to  have  such  an  exact  count  for  the  number  of  operations. 

We  have  noted  that  estimating  the  running  time  of  an  algorithm  by  estimating 
the  number  of  operations  required  is  an  inherently  inexact  process  because  it 
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cannot  be  expected  that  all  operations — addition,  multiplication,  square  root — 
require  equal  amounts  of  time.  There  is  yet  another  difficulty.  The  time  required 
to  perform  the  basic  operations  of  arithmetic  depends  on  the  size  of  the  integers 
involved.  Clearly  it  takes  more  time  to  add  or  multiply  two  50-digit  numbers 
than  it  does  to  add  or  multiply  two  three-digit  numbers.  The  following  problem 
indicates  that  it  is  possible  to  account  for  this  additional  complication. 

PROBLEM  7.  Find  the  complexity  function  for  adding  two  n-digit  integers  if 
the  basic  operation  is  addition  of  single-digit  integers. 

Solution.  Suppose  the  integers  to  be  added  are  a  =  (an-ia„-2  ■  ■  ■ «  1  a0 ) l o  and 
b  =  (bn-\bn_2  ■  ■  ■  b\bo)\o,  expressed  in  base  10  using  the  notation  of  Section  4.1. 
The  units  digit  of  a  is  ao  and  the  units  digit  of  b  is  bo;  the  tens  digit  of  a  is  a\ 
and  the  tens  digit  of  b  is  b\,  and  so  forth.  The  units  digit  of  a  +  b  is  obtained 
by  adding  ao  and  bo,  a  single  operation.  To  obtain  the  tens  digit,  we  add  a\  and 
b\  \  then,  perhaps,  we  add  1,  depending  on  whether  or  not  there  is  a  carry  from 
the  previous  step.  Hence,  at  most  two  single-digit  additions  (two  operations)  are 
required  for  the  tens  digit  of  a  +  b.  Similarly,  at  most  two  operations  are  required 
for  each  digit  of  a  +  b  after  the  units  digit.  An  upper  bound  for  the  number  of 
operations  is  /(«)  =  1  +  2(n  —  1)  =  2n  —  1.  I 


8.2.1  DEFINITION 


In  this  problem,  and  typical  of  most  complexity  problems,  we  were  unable  to 
obtain  an  exact  count  for  the  number  of  single-digit  additions  required.  Remember 
that  for  us,  complexity  is  measured  in  worst-case  terms.  The  addition  of  two  « -digit 
numbers  requires  at  most  2 n  —  1  single-digit  additions. 

Because  of  the  approximate  nature  of  operation  counts,  it  is  useful  to  have 
some  notation  by  which  we  can  easily  suggest  the  size  of  a  function. 

Let  /  and  g  be  functions  N  — »  R.  We  say  that  /  is  Big  Oh  of  g  and  write  /  =  O(g) 
if  there  is  an  integer  no  and  a  positive  real  number  c  such  that  |/(n)|  <  c\g(n)\ 
for  all  n  >  no? 


8.2.2  REMARK 


1.  Instead  of  saying  “There  exists  an  integer  no  such  that  \f(n)\  <  c|g(«)|  for 
all  n  >  no,  we  often  say  simply  “\f(n)\  <  c\g(n)\  for  sufficiently  large  n.” 

2.  If  /,  g :  N  ->  R  are  functions  which  count  operations  or,  more  generally,  as 
long  as  f(n )  and  gin)  are  positive  for  sufficiently  large  n ,  then  the  abso¬ 
lute  value  symbols  around  f(n)  and  g(n)  in  Definition  8.2.1  are  not  neces¬ 
sary.  Whenever  we  apply  this  definition  without  absolute  value  symbols,  it 
will  be  because  the  functions  in  question  are  either  positive  or  positive  for 
large  n. 


EXAMPLE  8 


Let  f(n)  =  15n3  and  g(n)  =  n3.  With  n o  =  1  and  c  —  15,  we  see  that  /  = 
Ofg).  '  A 


It  is  also  true  that  g  =  Off).  Why? 


3There  is  also  a  Little  Oh  notation,  which  we  do  not  introduce  since  f(n)  =  o(g(n))  is  the  same  as 
f(n)  <  gin),  notation  which  we  prefer.  See  Definition  8.2.4. 
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EXAMPLE  9 


8.2.3  PROPOSITION 


Proof 


EXAMPLE  10 


8.2.4  DEFINITIONS 


EXAMPLES  11 


Let  f(n)  =  n  +  1  and  g(n)  =  n~ .  If  n  >  1,  /(«)  <  n  +  n  =  2 n  <  2 nr.  Taking 
no  =  1  and  c  =  2,  we  see  that  /  =  O(g).  On  the  other  hand,  g  ^  O(f),  for 
suppose  that  n~  <  c(n  +  1)  eventually,  that  is,  for  all  n  greater  than  or  equal  to 
some  number  hq.  Dividing  by  n ,  we  have  n  <  c(l  +  £)  <  2c  for  all  n  >  no.  This, 
however,  is  impossible  because  2c  is  a  constant.  ▲ 

Two  fundamental  properties  of  Big  Oh  are  summarized  in  the  next  proposition. 

Let  f,  g,  f],  g\  be  functions  N  ->  R. 

(a)  If  /  =  O(g),  then  f  +  g  =  0(g). 

(b)  If  /  =  0(f\)  and  g  =  G(g i),  then  fg  =  0(f\g\). 

We  prove  each  part  directly  from  the  definition  and,  in  part  (a),  use  the  triangle 
inequality  which  states  that  \a  +  b\  <  \a  \  +  |b|  for  any  real  numbers  a  and  b. 

(a)  There  exists  a  positive  constant  c  and  an  integer  no  such  that  \f(n)\  <  c\g(n)\ 
for  all  n  >  no-  So,  for  all  n  >  no,  \f(n)  +  g(n)|  <  \f(n)\  +  \g(n)  < 
c\g(n)\  +  |g(n)|  =  (c  +  l)|g(n)|.  With  no  and  c  +  1  in  Definition  8.2.1,  we 
see  that  /  +  g  =  O(g). 

(b)  There  exists  a  constant  c  and  positive  integer  no  such  that  \f(n)\  <  c\f\(n)\ 

for  all  n  >  no .  There  exists  a  constant  d  and  positive  integer  n  \  such  that 
\g(n)\  <  d\g\{n)\  for  all  n  >  n\.  Let  n*  =  max{«o»«i}-  Then,  if  n  >  n*, 
\f(n)g(n)\  =  |/(«)||g(n)|  <  c|/i(n)|  d\g\{n)\  =  cd\f\(n)g\(n)\.  Replacing 
the  constants  no  and  c  in  Definition  8.2.1  with  n*  and  cd,  respectively,  we 
see  that  fg  =  0(f\g\).  I 

We  showed  in  Example  9  that  n  +  l  =  0(n2).  Thus,  by  part  (a)  of  Proposi¬ 
tion  8.2.3,  we  know  also  that  n2  +  n  +  l  —  0(n2).  Since  n  +  1  =  O(n),  part  (b) 
tells  us  that  (n  +  1)(«2  +  n  +  1)  =  0(n2).  A 

Big  Oh  gives  us  a  way  to  compare  the  relative  sizes  of  functions. 

If  /  and  g  are  functions  N  -»  R,  we  say  that  /  has  smaller  order  than  g  and 
write  f  <  g  if  and  only  if  /  =  0(g )  but  g  ^  0{f).  If  /  =  O(g)  and  g  =  0(f), 
then  we  say  that  /  and  g  have  the  same  order  and  write  /  x  g. 

•  Example  9  shows  that  n  +  1  n2;  thus,  n  +  1  has  smaller  order  than  n2. 

•  Example  8  and  the  Pause  which  follows  show  that  15n3  x  n2:  15n3  and  n2 

have  the  same  order.  A 


PROBLEM  12.  Show  that  n\  has  smaller  order  than  nn ,  that  is,  «!  -<  nn . 

Solution.  First  note  that  n !  =  n(n  —  1  )(n  —  2)  •  ■  •  (3)(2)  <nnn-nn  =  n" 
so,  with  no  =  c  =  1  in  Definition  8.2.1,  it  follows  that  n\  =  0(nn).  On  the  other 
hand,  n "  ^  O(nl),  which  we  establish  by  contradiction.  If  nn  =  0(n\),  then, 
for  some  constant  c,  nn  <  cn\,  so  ^  <  c  for  all  sufficiently  large  n.  This  is 
impossible  because 
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8.2.5  REMARK 


nECEzmai 


EXAMPLES  13 


n 


n 


n\ 


n  n 
n  n  —  1 


n 

-  >  n 

1 


for  n  >  2. 


I 


Suppose  that  lim^oo  f{n)  and  lim^oo  g(n)  both  exist,  are  positive,  and 
lim^oo  g(n)  0.  Students  of  calculus  may  notice  that  if  /  =  O(g),  then 
<  c  for  all  sufficiently  large  n,  and  thus,  lim„_,;0O  <  c.  In  particular,  if 

lim„^oo  |^  =  oo,  then  /  ^  O(g). 

Students  familiar  with  limits  should  appreciate  the  next  proposition.  We  leave 
its  proof  to  the  exercises. 


Let  /  and  g  be  functions  N  — >  R. 

(a)  If  lim  =  o,  then  f  <  g. 

n-*oo  g(n ) 

(b)  If  lim  ^  ^  =  oo,  then  g  <  f  ■ 

n^oo  g(n) 

f  (n  ^ 

(c)  If  lim  - =  L  for  some  number  L  ^  0,  then  /xj. 

"->0°  g(n) 


n  +  1  ^ 

•  Since  lim  — r—  =  0,  n  +  1  -<  n  ,  as  earlier  noted. 

n-^oo  nl 

2  n2  -  3n  +  6  „  , 

•  Since  lim  - r - =  2,  2 n2  —  3n  +  6  x  n1.  A 

n-*  oo  nl 


We  have  seen  that  15n3  x  n3  and  2«2  —  3n  +  6  x  n2.  These  examples  illustrate 
a  general  principle  whose  proof  we  first  describe  in  a  specific  case. 


PROBLEM  14.  Let  /(«)  =  5 n2  —  6n  +  3  and  g(n)  =  n 2 .  Show  that  /  and  g 
have  the  same  order. 

Solution.  For  n  >  1,  we  have  5 n2  —  6n  +  3  <  5n2  +  6n  +  3  <  5n2  +  6«2  +  3n2  = 
14«2  so,  with  c  =  14  and  no  =  1  in  Definition  8.2.1,  we  see  that  /  =  O(g).  On 
the  other  hand, 

f(n)  =  5n2  —  6/i  +  3  =  «2(5 - b  >  n2(5 - 

V  n  nz/  V  n  n- / 

Since 

6  3 

lim  -  =  lim  —  =  0, 

n—^oo  fi  n-±oo  n ^ 

there  exists  an  integer  n o  such  that  both 


6 

n 


3  5 

n2  4 


and 
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for  all  n  >  no-  (Evidently,  no  =  5  will  suffice.)  It  follows  that 


so  that 


6  5  J  3  5 

—  >  —  and  — »  >  — 


f(n )  =  5 n~  —  6n  +  3  >  n2( 5  —  —  —  —)  =  n2( 5 - )  =  -n~ 

_  4  4  2  2 


PROPOSITION 


Thus  g(n)  =  n2  <  f /(«)  so,  with  c  =  |  in  Definition  8.2.1,  we  see  that  g  = 
O(f).  Thus,  /  x  g.  ‘  I 

Here  is  the  general  result. 

A  polynomial  has  the  same  order  as  its  highest  power;  that  is,  if  f(n )  =  atn'  + 
a,_in'~'  +  ■  •  ■  +  ao  is  a  polynomial  of  degree  t,  then  f(n)  x  n(. 


Proof  This  follows  immediately  from  part  (c)  of  Proposition  8.2.6,  but  we  provide  here  a 
direct  proof  which  does  not  require  any  knowledge  of  calculus.  Our  proof  mimics 
the  solution  to  Problem  14. 

For  any  natural  number  n,  the  triangle  inequality  gives 

l/(«)l  <  \a,n'\  +  |a,-in'_1|  4 - f  \a{n\  +  |n0| 

=  \a,\n‘  +  \at-\\n‘~'  4 - h  \a\\n  4-  |flo| 

<  \a,\n'  4-  \a,-\ \n'  4 - f  \ai\n1  +  \ao\n’ 

=  (\a,  |  +  |fl/-i  |  4-  •  •  •  4- 1 a 1 1  4-  Iflol)^- 

Thus,  \f(n)\  <  cn‘  for  any  n  >  no  =  1  with  c  =  |af  |4-|flf-i  |4 - b|«i  1 4- |«ol-  By 

Definition  8.2.1,  we  see  that  f(n)  =  0(n‘).  Now  we  prove  that  n'  =  0(f(n)). 

Since  /  x  — /  for  any  / :  N  — »■  R  (see  Exercise  15  at  the  end  of  this  section), 
we  may  assume  that  a,  >  0.  We  have 

/ (n)  —  n  (a,  4 - 1 - 5 — b  •  •  •  4 — -) 

n  nl  nl 

>n  (a, - 5 - Ti¬ 
ll  n~  iv 

As  n  gets  large,  the  numbers 

Iflf-il  \at-2\  l«ol 


approach  0,  so  all  of  them  are  eventually  smaller  than  the  positive  number  — . 
Thus,  there  is  an  no  such  that  for  any  n  >  n 0, 


Therefore, 


_«/_  I  at-. 

It  ’  n2 


\at-2\ 

n2 

at 

<  — , 

2 1 

|aol  at 

nl  2 1 

\a,-2\ 

at 

at 

> - 

It 
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and 


Thus,  n’  <  — /(«)  for  all  n  >  no  and,  with  c  =  —  in  Definition  8.2.1,  we  obtain 
at  at 

n1  =  0(f{n)),  as  required.  I 

In  the  exercises,  you  are  asked  to  verify  that  the  relation  same  order  defines 
an  equivalence  relation  on  the  class  of  functions  N  —*■  R;  that  is,  the  relation  is 

reflexive:  /  x  /  for  all  /; 

symmetric:  if  /  x  g,  then  g  x  /;  and 
transitive:  if  /  x  g  and  g  x  h,  then  /  x  h. 

(See  Section  2.4.) 

EXAMPLE  1 5  Since  5n2  —  6n  +  3  x  n2  and  2n2  —  3n  +  6  x  n2,  it  follows  that  5n2  —  6n  +  3  and 
2rr  —  3n  +6  have  the  same  order,  something  which  is,  perhaps,  not  so  easy  to  see 
directly  from  the  definition  of  Big  Oh.  In  fact,  transitivity  and  Proposition  8.2.7 
imply  that  any  two  polynomials  of  the  same  degree  have  the  same  order.  A 


There  are  also  some  natural  connections  between  the  binary  relations  x  and 
-<.  For  example,  if  /  -<  g  and  g  x  h,  then  /  -<  h.  We  have  seen  that  n  +  1  <  n2 
and  n2  x  5 n2  +  6 n  +  3,  so  we  may  conclude  that  n  +  1  -<  5 n2  +  6n  +  3. 

Proposition  8.2.7  says  that  any  polynomial  has  the  same  order  as  its  highest 
power.  The  next  proposition  shows  that  different  powers  of  n  never  have  the  same 
order. 


8.2.8  PROPOSITION 


Suppose  a  and  b  are  real  numbers  and  1  <  a  <  b.  Then  na  -<  nb. 


Proof  Take  no  =  1  and  c  =  I  in  Definition  8.2.1  to  obtain  na  =  0(nh).  To  see  that 
the  reverse  is  not  true,  suppose  nh  <  cnu  for  some  constant  c  and  all  sufficiently 
large  n.  Dividing  by  n“ ,  we  have  nb~a  <  c.  The  left-hand  side  is  a  positive  power 
of  n  and,  hence,  for  large  «,  it  is  bigger  than  c.  Thus,  nb  is  not  0(na).  I 

Students  of  calculus  should  note  that  Proposition  8.2.8  follows  directly  from 
part  (b)  of  Proposition  8.2.6  because 

r  nb 
lim  —  =  co. 

n — >  og  fia 

As  a  particular  case  of  Proposition  8.2.8,  we  see  that  1  x  n°  for  any  positive 
power  na  of  n.  Thus,  we  begin  to  see  that  the  idea  of  order  permits  the  definition 
of  a  hierarchy  among  functions.  For  instance,  we  now  know  that 

1  x  s/n  <  n  <  n-Jn  <  n2  <  n2'01  -<■••. 

A  straightforward  induction  argument  shows  that  n2  <  2"  for  n  >  5  (see 
Exercise  9  of  Section  5.1).  It  follows  that  2"  ^  O(n);  otherwise,  we  would  have 
2”  <  cn  and,  hence,  n2  <  2n  <  cn  for  some  constant  c  and  sufficiently  large  n. 
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This  implies  n  <  c  which,  for  large  n,  is  not  true.  Thus,  2n  ^  0(n).  Since 
n  =  0(2 "),  we  conclude  that  ri  <  2". 

In  a  similar  fashion,  we  can  prove  that  n3  <  2"  for  n  >  10  and,  hence,  that 
n2  <  2".  (See  Exercise  14  at  the  end  of  this  section.)  In  fact, 

na  <  //' 

for  any  real  numbers  a  and  b  with  b  >  1.  This  follows  directly  from  Proposi¬ 
tion  8.2.6  since 

b" 

lim  —  =  oo 

>00  7 la 

for  any  real  constants  a,b  with  b  >  1.  (For  students  familiar  with  l’Hopital’s 
Rule,  this  result  will  be  elementary.) 

Our  examination  of  the  relative  orders  of  common  functions  continues. 


8.2.9  PROPOSITION 


\ogbn  -<  n  for  any  real  number  b,  b  >  1. 


Proof  We  apply  logb  to  the  inequality  n  <  bn ,  which  is  true  for  sufficiently  large  n, 
obtaining  logbn  <  n.  With  c  =  1  in  Definition  8.2.1,  we  see  that  log^n  =  0(n). 
Now  n  /  0(\ogbn)  since  n  <  c  log/;  n  implies  b"  <  nc  (applying  the  function 
x  i-»  bx)  which,  for  large  n,  is  not  true.  Thus,  log^n  <n.  I 

In  the  Exercises,  you  are  asked  to  show  that  any  two  logarithm  functions 
(with  bases  larger  than  1)  have  the  same  order;  that  is, 

log an  x  log hn 

for  any  a,b  >  1.  For  this  reason,  we  now  frequently  omit  mention  of  the  base 
of  a  logarithm  because  most  assertions  about  order  which  involve  logarithms  are 
true  for  any  base  b  >  1 . 

Propositions  8.2.8  and  8.2.9.  together  with  the  facts  that 

n\  -<  nn 


(Problem  12)  and 


b"  x  r\ 

(Exercise  13  at  the  end  of  this  section),  allow  our  hierarchy  of  functions  to  be 
extended  considerably.  For  a,  b  >  1,  we  have 

1  x  log  n  <  n  <  na  <  b"  <  n  l  -<  n" . 


Figure  8.4  shows  the  graphs  of  the  natural  logarithm  and  of  the  functions 
defined  by  f(x)  =  x,  fix)  —  x2,  and  fix)  =  3X  and  provides  dramatic  evidence 
of  the  relative  orders  and  rates  of  growth  of  these  functions.  In  particular,  note  the 
very  slow  growth  of  the  logarithm  and  the  very  rapid  growth  of  the  exponential. 

How  does  all  of  this  relate  to  efficiency  of  algorithms?  Suppose  A  and  23 
are  algorithms  which  accomplish  the  same  task.  If  the  problem  to  which  these 
algorithms  is  applied  is  “small,”  they  may  work  so  quickly  on  even  a  microcom¬ 
puter  that  differences  in  performance  are  not  noticeable.  Few  of  us  can  detect  the 
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y 


-5 


Figure  8.4  Some  common  complexity  functions. 


difference  in  a  running  time  of  10~6  seconds  and  one  that  is  100  times  as  fast! 
As  the  problem  grows  in  size,  however,  differences  in  running  time  become  very 
significant.  Suppose  A  has  complexity  0(n)  while  3  is  0(n3).  With  n  =  5000 
and  a  computer  which  performs  a  million  operations  per  second,  algorithm  A 
requires  on  the  order  of  5000  x  10~6  =  0.005  seconds  while  3  requires  about 
50003  x  1CT6  =  125,000  seconds,  or  about  a  day  and  a  half!  Some  algorithms 
may  involve  so  many  operations  that  they  are  simply  impractical  to  implement.  We 
become  very  concerned,  therefore,  with  the  relative  sizes  of  complexity  functions 
when  n  is  large.  This  is  precisely  what  the  concept  of  order  is  designed  to  measure. 

If  complexity  functions  /  and  g,  which  correspond  respectively  to  algorithms 
A  and  3,  have  the  same  order  (/  x  g),  then  A  and  3  are  considered  to  be 
very  similar  with  respect  to  efficiency,  whereas,  if  /  has  smaller  order  than  g, 
then  A  is  more  efficient  than  3.  An  algorithm  whose  complexity  function  has 
the  same  order  as  n  is  said  to  be  linear  or  to  run  in  linear  time.  An  algorithm 
with  complexity  function  the  order  of  n~  is  called  quadratic.  More  generally, 
algorithms  with  complexity  functions  of  order  na  for  some  positive  real  number 
a  are  polynomial  or  said  to  run  in  polynomial  time.  Algorithms  with  complexity 
functions  of  order  log  n  are  logarithmic  and  algorithms  with  complexity  of  order 
a "  for  some  a  >  0  are  exponential. 

We  have  seen  that  an  algorithm  which  runs  in  logarithmic  time  is  more  effi¬ 
cient  than  a  linear  algorithm,  which,  in  turn,  is  more  efficient  than  a  quadratic 
algorithm.  Any  polynomial  algorithm  is  more  efficient  than  one  which  is  expo¬ 
nential. 

Here  a  word  of  caution  is  in  order.  Saying  that  an  exponential  algorithm  A 
is  less  efficient  than  a  polynomial  algorithm  3  which  runs  in  polynomial  time 
means  only  that  when  n  is  sufficiently  large,  algorithm  A  takes  more  time.  For 
small  values  of  //,  the  differences  may  be  inconsequential;  indeed,  A  may  even  be 
faster!  For  example,  if  f(n)  =  100«3  and  g(n)  =  2",  then  f  <  g,  but  we  must 
have  n  >  20  for  the  polynomial  algorithm  to  be  faster.  Similarly,  f(n )  =  n2  is 
“better”  than  g(n)  =  because,  once  n  becomes  sufficiently  large,  an  algorithm 
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Table  8.1  Approximate  time  required  to  run  an  algorithm  requiring  f(n) 
operations  assuming  a  machine  speed  of  1  million  operations  per  second. 

tin) n 

10 

100 

5000 

25 

25  x  10“6  s 

25  x  10“6  s 

25  x  10“6  s 

log2  n 

3.3  x  10  6  s 

6.6  x  10“6  s 

1.2  x  10~5  s 

n 

10”5  s 

10-4  s 

5  x  10~3  s 

n  log2  n 

3.3  x  10-5  s 

6.6  x  10“6  s 

6.1  x  10“2  s 

n2 

© 

1 

•4^ 

on 

.01  s 

25  s 

2n 2  +  5n  +  100 

3.5  x  10“4  s 

.021  s 

50  s 

n3/100 

o 

1 

L/i 

cm 

.01  s 

21  min 

2" 

10~3  s 

4.0  x  1016  yr 

4.5  x  101491  yr 

with  the  complexity  function  /  will  take  less  time.  This  is  clearly  not  the  case 
for  small  values  of  n,  however. 

These  remarks  are  illustrated  in  Table  8.1,  which  compares  a  number  of  com¬ 
plexity  functions  /(«)•  The  entries,  which  are  only  approximate,  show  the  time 
required  to  run  an  algorithm  with  f (n)  operations  on  a  machine  performing  one 
million  operations  each  second  (like  a  snail).  The  functions  are  ordered  in  decreas¬ 
ing  order  of  efficiency  for  sufficiently  large  n  (except  for  n2  and  2n 2  +  5 n  +  100, 
which  have  the  same  order).  From  this  table,  we  see  that  an  algorithm  with 
f(n)  =  25  operations  runs  much  more  quickly  than  an  algorithm  with  f(n )  =  n 

3 

or  f(n )  =  jqq,  once  n  is  big  enough.  While  for  n  as  large  as  100,  an  algorithm 

with  f(n)  =  runs  more  quickly  than  one  with  f(n)  =  2 n2  +  5 n  +  100,  it 
is  apparent  that  once  n  —  5000,  the  situation  is  reversed.  Note  how  quickly  an 
algorithm  with  log2  n  operations  runs  and,  conversely,  how  slowly  one  with  2" 
operations  runs  (for  large  n).  (When  viewing  the  numbers  in  the  last  line  of  the 
table,  it  is  interesting  to  bear  in  mind  that  the  universe  has  been  around  for  about 
7  x  109  years!)  Algorithms  with  complexity  functions  which  are  0(log2  n )  are 
highly  efficient  whereas  exponential  algorithms  are  so  slow  as  to  be  useless  from 
a  practical  point  of  view,  once  n  is  any  reasonable  size  at  all. 

In  recent  years,  following  the  initiative  of  Stephen  Cook  of  the  University  of 
Toronto,  computer  scientists  have  identified  a  collection  of  problems  for  which 
no  polynomial  time  algorithm  is  currently  known.  These  problems,  which  are 
equivalent  in  the  sense  that  the  existence  of  a  polynomial  time  algorithm  for 
one  of  them  would  imply  polynomial  time  algorithms  for  them  all,  are  called 
NP-complete.  The  NP  stands  for  nondeterministic  polynomial  time.  Some  of  the 
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most  interesting  NP-complete  problems  are  related  to  graphs  and  will  be  identi¬ 
fied  in  later  chapters.  (See,  for  instance,  the  “Traveling  Salesman’s  Problem”  in 
Chapter  10  or  the  problem  of  coloring  the  vertices  of  a  graph,  which  is  discussed 
in  Chapter  14.) 

We  conclude  this  section  with  a  discussion  of  the  complexity  of  some  algo¬ 
rithms  which  will  already  be  familiar  to  the  reader. 

PROBLEM  16.  Show  that  the  number  of  single-digit  additions  required  in  the 
addition  of  m  /(-digit  numbers  does  not  exceed  m(2n  +  m).  Conclude  that  the 
complexity  of  such  addition  is  0(n2)  if  m  <  n  and  0(m2)  otherwise. 

Solution.  This  addition  problem  has  the  following  form. 

r  =  (r„_i/-„_2...nro)io  1 

C  —  (cn—  |  Ch_2 

t>  =  {bn~\bn—2 

a  =  (fln-ifln-2 

We  use  repeatedly  the  result  of  Problem  7  that  the  addition  of  two  //-digit  numbers 
requires  at  most  2n- 1  single-digit  additions.  Thus,  the  addition  of  a  and  b  requires 
at  most  2 n  —  1  additions.  The  sum  is  a  number  with  at  most  n  +  I  digits.  The 
addition  of  this  number  and  c  (which  we  think  of  as  an  n  +  1 -digit  number 
beginning  with  0)  requires  another  2 (n  +  1)  —  1  =2/7  +  1  operations,  at  most, 
and  gives  a  number  with  at  most  n  +  2  digits.  Adding  the  next  number  requires 
at  most  2 (n  +  2)  -  1  =  2n  +  3  operations.  Continuing  this  way,  the  final  addition 
with  r  requires  at  most  2[/z  +  (m  —  2)]  —  1  =2/7  +  2/77—5  operations.  Altogether, 
the  addition  requires  at  most 

(2/7  —  1 )  +  (2/7  +  1)  +  (2/7  +  3)  +  •  ■  ■  +  (2/7  +  2//?  —  5) 

operations.  This  is  the  sum  of  an  arithmetic  sequence  with  /??  —  1  terms,  first  term 
a  =  2/7  —  1  and  common  difference  d  —  2,  so  it  equals 

^[2(2 n  -  l)  +  (777  -  2)2]  =  (777  -  1 ) (2/7  -  1)  +  (777  -  2)(m  -  1) 

=  (/77  —  1)(277  +  777  —  3) 

<  777(2/7  +  /??). 

[See  formula  (7)  on  p.  166.]  If  777  <  n,  the  number  of  operations  is  at  most 
77(2/7  +  n)  =  ?z(3/7)  =  3//2,  so  the  addition  has  complexity  0(n2).  If  77  <  m, 
777(2/7  +  m)  <  /?7(2/?7  +  7/?)  =  3//7“,  so  the  complexity  is  (D(m2).  I 

PROBLEM  17.  Show  that  the  usual  method  of  multiplying  two  77 -digit  numbers 
has  complexity  0{n2).  Count  as  equivalent  operations  the  addition  and  multipli¬ 
cation  of  two  single-digit  numbers.4 

4There  are  faster  methods  of  multiplying.  In  1962,  Anatolii  Karatsuba  found  a  method  which  is  Otn1,58). 
By  December  1988,  the  fastest  known  method  was  (9(n(logH)(loglogn)).  No  algorithm  better  than  O(n) 


...ci c0)  10  m  numbers 

•  •  ■  bib0)\o 
■  ■  ■  a\ao)io  . 
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Solution.  In  the  multiplication  of  two  n -digit  numbers  by  the  usual  method, 
there  are  n  multiplications  of  an  n-digit  number  by  single-digit  numbers.  Each 
such  multiplication  requires  at  most  2 n  —  1  operations.  (See  Exercise  2.)  Then  it 
is  necessary  to  add  n  numbers,  each  of  length  at  most  2 n.  Using  the  result  of 
Problem  16,  this  requires  at  most  an  additional  n[2(2n)  +  n  \  =  n(5n)  operations 
for  a  total  not  exceeding 

n(2n  —  1)  +  n(5n)  x  n2, 

by  Proposition  8.2.7.  I 


PROBLEM  18.  One  straightforward  way  to  compute  a  positive  integral  power 
of  a  real  number  x  is  to  compute  x{x)  =  x 2,  then  x(x2)  —  .v3,  then  ,v(.r3)  =  x4, 
and  so  forth. 

(a)  Find  the  complexity  of  this  algorithm  in  terms  of  the  number  of  multiplica¬ 
tions. 

(b)  Find  an  algorithm  which  is  more  efficient  than  the  one  described  and  indicate 
why  your  algorithm  is  more  efficient. 

Solution. 

(a)  This  algorithm  requires  n  -  1  multiplications. 

(b)  A  seemingly  more  efficient  way  to  compute  xn  is  to  compute  the  powers 
x,x2,  (x2)2  =  x4,  C-v 4 ) 2  =  .V8,  and  so  on,  stopping  upon  calculation  of  „v2  , 
where  2*-'  <  n  <  2k.  This  requires  k  multiplications.  Now  consider  how  the 
Russian  peasant  method  computes  the  product  n  =  n  x  1 ,5  On  the  left,  n  is 
divided  by  2  a  total  of  k  times,  while  on  the  right,  1  is  multiplied  by  2  a  total 
of  k  times.  The  product  n  is  determined  as  the  sum  of  certain  numbers  on  the 
right,  that  is,  as  the  sum  of  at  most  k  powers  of  2.  Then  xn  is  the  product  of 
the  corresponding  powers  of  x.  For  example, 

37  =  1  +  22  +  25,  so  x31  =  jc1  jc22.v25 


and  generally, 

/7  =  ^2\  so  xn=xZ2ki  =x2k'x2h-.-- 

for  which  at  most  a  further  k  multiplications  are  required.  Thus,  after  at 
most  2k  multiplications,  we  can  compute  xn .  Applying  log2  to  the  inequality 
2*-1  <  n,  we  have  k—\<  log2 n  and  so  k  <  1  -F log2  n.  Thus,  the  number  of 
multiplications  required  this  time  is  0(logn).  Since  logn  -<  n,  the  proposed 
algorithm  is  more  efficient  than  the  one  described  in  (a).  I 


is  possible.  We  refer  the  interested  reader  to  "Ramanujan,  Modular  Equations,  and  Approximations  to  Pi  or 
How  to  Compute  One  Billion  Digits  of  Pi,”  by  J.  M.  Borwein,  P.  B.  Borwein,  and  D.  H.  Bailey,  American 
Mathematical  Monthly  96  (1989),  201-219,  and  to  the  bibliography  of  this  article. 

■’The  Russian  peasant  method  can  be  very  useful! 
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PROBLEM  19.  Find  a  Big  Oh  estimate  of  the  complexity  of  the  Euclidean 
algorithm  in  terms  of  the  number  of  divisions.  (Given  integers  a  and  b,  a  “division” 
of  a  by  b  means  the  determination  of  q  and  r,  0  <  r  <  b,  such  that  a  =  bq  +r.) 

Solution.  Given  positive  integers  a  and  b  with  b  <  a,  recall  that  the  Euclidean 
algorithm  finds  the  greatest  common  divisor  of  a  and  b  by  the  following  sequence 
of  divisions, 

a  =  q\b  +  n,  r\  <  b, 

b  —  <?2n  +  n,  r2<ri, 

r\  =  q^n  +  r3,  r3  <  r2, 

n-2  =  qkn-\+n,  n<n- 1, 

n- 1  =  qk+m, 

the  process  terminating  with  the  first  zero  remainder  rk+\ .  The  gcd  is  then  rk,  the 
last  nonzero  remainder.  (See  Section  4.2.)  The  number  of  divisions  and,  hence, 
the  number  of  basic  operations  is  k  + 1.  How  big  is  kl  Note  that  qk+\  >  1  because 
rk  <  rk-i  and,  since  the  other  quotients  q,  are  positive  integers,  we  have  q,  >  1 
for  1  <  i  <  k.  Therefore, 

n  >  1 

n-l  >2rk>2 

n-2  >  n- 1  +n  >  3 

n- 3  >  n-2  +  rk-\  >3  +  2  =  5 


r  i  >  r2  +  r3 
b  >  r\  +  r2. 

Consider  the  two  sequences 

L  n,  n- 1,  n-2,  n- 3,  ■■■,  n,  b 

1,  1,  2,  3,  5, 

Each  term  in  the  first  sequence  is  at  least  as  big  as  the  corresponding  term  in  the 
second  sequence.  In  particular,  b  is  at  least  as  large  as  the  (k  +  2)th  term  in  the 
second  sequence.  The  second  sequence,  however,  is  the  Fibonacci  sequence  (see 

5.2.3),  whose  /cth  term  is  the  nearest  integer  to  ( 1+2^  j  •  Thus, 


from  which  it  follows  that  ^  1+2V^  <  Ab  +  B,  where  A  and  B  are  positive 

constants  independent  of  b.  Since  -+^  =  C  is  also  a  constant  and  Ck  <  Ab+  B, 
an  application  of  the  logarithm  (to  any  base)  gives  k  log  C  <  log(Ab  +  B).  Since 
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log(An  -f  B)  x  logn  for  A,  B  >  0  (see  Exercise  22),  it  follows  that  the  number 
of  operations  required  is  (9  (log  b).  The  Euclidean  algorithm  is  very  efficient.  I 


Answers  to  Pauses 


2.  Take  n  o  =  1  and  c  —  1. 


EXERCISES 


The  symbol  [BB]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 


1.  [BB]  Consider  the  distance  algorithm  described  in 
Example  6.  Assume  that  addition  and  subtraction  require 
the  same  amount  of  time,  that  addition  is  twice  as  fast 
as  multiplication,  and  that  multiplication  is  ten  times  as 
fast  as  the  square  root  operation.  Find  the  complexity 
function. 

2.  Show  that  the  number  of  operations  required  to  mul¬ 
tiply  an  /1-digit  number  by  a  single-digit  number  does 
not  exceed  2n  —  1.  Count  the  addition  and  the  mul¬ 
tiplication  of  two  single-digit  numbers  as  equivalent 
operations. 

3.  [BB]  Show  that  the  number  of  operations  required  to 
divide  an  n-digit  number  by  a  single-digit  number  does 
not  exceed  3 n.  Count  as  equivalent  the  following  oper¬ 
ations: 

•  the  multiplication  of  two  single-digit  numbers, 

.  the  division  of  a  single-digit  number  or  a  two-digit 
number  by  a  single-digit  number,  provided  the  quo¬ 
tient  has  only  a  single  digit  (e.g.,  37-f9  but  not  374-3), 
and 

•  the  subtraction  of  two  two-digit  numbers  provided  the 
difference  has  only  a  single  digit  (e.g.,  37  —  32,  but 
not  37  —  22). 

4.  Consider  the  problem  of  finding  the  minimum  number 
in  a  set  of  n  real  numbers.  Find  an  algorithm  for  this 
and  find  its  complexity  function.  Take  comparison  in 
the  form  “x  <  y?”  as  the  basic  operation. 

5.  Let  x  be  a  real  number  and  n  a  positive  integer.  Consider 
the  following  two  algorithms  for  calculating  x2  . 

ALGORITHM  A  : 

Step  1.  Set  a  —  1 . 

Step  2.  For  i  =  1  to  2",  replace  a  by  xa. 

Step  3.  Output  a. 


ALGORITHM  3: 

Step  1.  Set  a  =  x. 

Step  2.  For  i  =  1  to  n,  replace  a  by  a2. 

Step  3.  Output  a. 

Find  complexity  functions  for  each  of  these  algorithms 
and  explain  why  3  is  more  efficient.  Assume  that  mul¬ 
tiplication  is  the  basic  operation. 

6.  (a)  [BB]  Justify  the  statement  made  in  Section  8.1  that 

Horner’s  method  of  polynomial  evaluation  requires 
fewer  arithmetic  operations  than  the  more  obvious 
method  described  in  Problem  2.  Assume  that  pow¬ 
ers  of  x  cannot  be  stored. 

(b)  Repeat  part  (a)  assuming  it  is  possible  to  store  pow¬ 
ers  of  x. 

7.  Use  Definition  8.2.1  to  show  that  /  =  Oig)  in  each  of 
the  following  cases  of  functions  /,j:N->R. 

(a)  [BB]  f(n)  =  5n,  g(n)  =  n 3 

(b)  / (n)  =  17n4  +  8n3  +  5 n2  +  6/t  +  1 ,  g(n)  =  n4 

(c)  [BB]  f(n)  =  8n3  +  4 n2  +  5n  +  1,  g(n)  =  3 n4  + 
6  n2  +  8//  +  2 

(d)  f{n)  =  2",  gin)  =  3" 

(e)  f(n)  =  2/r  +  3//  +  1 ,  g(n)  —  n2  +  1 

(f)  fin)  =  2n2  —  3n  +  5,  g(n)  —  n2  —  In 

(g)  fin )  =  Iog2/r\  gin)  =  n 

8.  [BB;  (a),  (c)]  For  each  part  of  Exercise  7  decide  whether 
/  -<  g  or  /  x  g.  Use  Propositions  8.2.7  and  8.2.8,  if 
appropriate. 

9.  If  /,  g,  h\  N  -*■  R,  /  =  O(h)  and  g  =  0{h),  show  that 
f  +  g  =  Oih). 

10.  (a)  Show  that  O  defines  a  transitive  operation  on  the 
class  of  functions  N  — >  R. 

(b)  [BB]  Prove  that  -<  defines  a  transitive  relation  on 
the  class  of  functions  N  -4  R. 
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11.  [BB]  Let  /,  g  and  h  be  functions  N  ->  R  and  suppose 
that  /  x  g  and  g  <  h.  Prove  that  f  <  h. 

12.  [BB]  Show  that  an  -<  bn  if  0  <  a  <  b. 

13.  (a)  [BB]  Show  that  2"  -<  n\. 

(b)  Show  that  bn  <  n\  for  any  b  >  1. 

14.  (a)  Prove  that  n 3  <  2"  for  all  n  >  10. 

(b)  Use  the  result  of  part  (a)  to  show  that  n2  •<  2". 

15.  [BB]  Let  / :  N  — »  R  be  a  function  and  O^leR.  Show 
that  kf  x  /. 

16.  Let  A  be  the  set  of  all  functions  N  — *■  R  and,  for 
f,geA,  define  /  -<  g  as  in  Definition  8.2.4. 

(a)  Does  -<  define  a  partial  order  on  A? 

(b)  For  f,geA,  define  /  :<  g  if  /  -<  g  or  /  =  g.  Is 
(A,  <)  a  partially  ordered  set? 

17.  Show  that  x,  as  defined  in  Definitions  8.2.4,  defines  an 
equivalence  relation  on  the  class  of  functions  N  R. 

18.  [BB]  Use  any  of  the  results  of  this  section  to  show  that 
n  log2  n  -<  n2. 

19.  Find  a  function  in  the  list 

«2,  1,  «3,  n  log  77,  log n,  «4,  2",  nn ,  n!,  n ,  n" ,  n5 
which  has  the  same  order  as 

(a)  /(«)  =  3/7 4  +  6 n2  —  8/7  —  5 

(b)  [BB]  /(/?)  =  n  logn  +  n2 

(c)  f(n)  =  5 

(d)  f(n)  =  2"  +4 nn 

(e)  f(n )  =  3 /?!  —  17n4 

(f)  f(n)  —  n  +  logn 

Justify  your  answers  appealing  to  any  of  the  results  in 
this  section  that  prove  helpful. 

20.  Repeat  Exercise  19  for  each  of  the  following  functions: 

(a)  [BB]  f(n)  =  48«5  —  lnA  +  6n3  +  5 n2  —  2/7  —  7 

(b)  f(n)  =  T  +  375/7 1990 

(c)  f  (n)  ~  7Tn  +  nn 


(d) 

(e) 

(f) 

(g) 


/ (n)  =  3/7  +  5/7  log  /7  +  In 2 

/(//)  =  2  +  4  +  6+ - 1-2/7 

f(n)  =  2  +  4  +  8  +  --  -+2" 
(5/?  +  l)(n2  log 2  n) 


fin) 


+  4/7 


21.  [BB]  Show  that  logfl  /?  x  log;,  n  for  any  real  numbers 
a,b  >  1. 

22.  Suppose  A  and  B  are  positive  real  numbers.  Show  that 
log(An  +  B)  and  log/7  have  the  same  order.  (Use  log2, 
for  instance.) 

23.  [BB]  Show  that  log//!  =  Oinlogn). 

24.  (a)  Find  a  Big  Oh  estimate  of  the  complexity  of  the 

algorithm  for  writing  an  integer  a  in  base  2  in  terms 
of  the  number  of  divisions.  (See  Section  4.1.)  As  in 
Problem  19,  by  a  “division”,  we  mean  the  determi¬ 
nation  of  q  and  r  such  that  a  given  integer  b  = 
2  q  +  r. 

(b)  Suppose  a  division  counts  as  three  basic  operations. 
How  does  this  affect  your  answer  to  (a)? 

25.  The  Russian  peasant  method  is  used  to  multiply  two 
//-digit  numbers. 

(a)  Find  a  reasonable  upper  bound  for  the  number  of 
rows  required  by  this  process,  that  is,  for  the  number 
of  multiplications  and  divisions  by  2. 

(b)  Find  a  reasonable  upper  bound  for  the  number  of 
basic  operations  required  in  the  multiplying  and  divi¬ 
ding  by  2. 

(c)  Find  a  reasonable  upper  bound  for  the  number  of 
basic  operations  required  by  the  final  addition. 

(d)  Show  that  the  number  of  basic  operations  in  all  is 
Oin2). 

Count  as  basic  operations 

•  the  addition  of  two  single-digit  numbers, 

•  the  multiplication  of  two  single-digit  numbers,  and 

.  the  division  of  a  2-digit  number  by  a  single  digit 
number. 

26.  (For  students  of  calculus)  Prove  Proposition  8.2.6. 


8.3  Searching  and  Sorting 

Perhaps  the  two  most  important  tasks  required  of  modem  computers  are  the  search¬ 
ing  and  sorting  of  lists.  Since  they  are  required  by  so  many  other  processes  and 
often  must  be  repeated  many  times,  efficient  algorithms  to  search  and  sort  lists 
are  of  supreme  importance. 
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8.3.1  A  UNEAR 
SEARCH  ALGORITHM 


EXAMPLE  20 


8.3.2  REMARK 


A  car  manufacturer,  realizing  that  a  certain  part  is  a  potential  safety  hazard, 
needs  to  search  the  records  of  all  cars  sold  in  recent  years  for  those  which  contain 
this  part.  A  university  wishes  to  implement  a  system  whereby,  each  semester, 
students  will  register  in  order  of  decreasing  grade  point  average  (those  with  the 
highest  grade  point  average  register  first).  In  order  to  determine  the  numbers  of 
students  with  grade  point  averages  in  various  ranges,  student  records  will  have  to 
be  searched. 

How  often  is  it  required  to  sort  a  list  alphabetically?  Surely  this  is  a  key  step 
in  many  tasks,  from  the  creation  of  telephone  directories  to  dictionaries.  Although 
computers  can  work  very  quickly,  given  a  large  input  and  an  inefficient  algorithm, 
they  can  be  slowed  to  a  crawl  easily. 

In  this  section,  we  discuss  several  algorithms  for  searching  and  sorting  and 
discuss  the  complexity  of  each.  Our  approach  to  complexity  will  again  always  be 
worst  case.  While  we  would  hope  that  a  sorting  algorithm  would  not  take  long  to 
alphabetize  a  list  of  names  input  already  in  alphabetical  order,  our  concern  is  the 
length  of  time  such  an  algorithm  could  take  when  the  input  is  as  disorganized  as 
possible. 

We  begin  with  an  algorithm  which  searches  a  list  for  one  particular  entry. 

To  search  a  list  a \,  02 . a„  for  the  element  x, 

for  i  =  1  to  n 

if  x  =  ai,  output  “true”  and  set  /  =  2 n; 
if  i  ^  2 n,  output  “false”. 

The  algorithm  proceeds  in  an  obvious  way.  Setting  i  =  2 n  is  a  little  trick 
which  stops  the  loop  as  soon  as  x  has  been  found  and,  if  x  is  not  in  the  list, 
ensures  that  “false”  is  output  at  the  end. 

For  x  —  — 2  and  an  input  list 

6  0-21 
a\  a2  0-2  a\ 

the  algorithm  sets  i  =  1,  determines  that  x  ^  a\  =6,  and  sets  i  =  2.  Since 
x  7^  02  =  0,  it  sets  i  =  3.  Since  x  =  ai,  =  —2,  it  outputs  “true”  and  sets 
i  —  2 n  =  8.  Since  i  is  no  longer  in  the  range  from  1  to  n,  the  loop  stops.  Since 
i  =  2n.  the  algorithm  knows  that  x  was  found  and  does  not  output  “false”. 

On  the  other  hand,  for  x  =  2  and  the  same  input  the  algorithm  determines 
that  x  ^  a\  =  6,  x  02  =  0,  x  =  —2,  and  x  ^  04  —  1.  At  the  final  step, 
i  =  4  ^  2n,  so  the  algorithm  outputs  “false.”  ▲ 

In  the  worst  of  circumstances,  when  the  element  x  is  not  in  the  list  a\, 
02,  ■■  ■  ,  an,  the  loop  is  executed  n  times  and  the  algorithm  requires  n  +  1  compar¬ 
isons.  In  most  searching  and  sorting  algorithms,  there  are  few  arithmetic  calcula¬ 
tions.  The  best  basic  operation  with  which  to  measure  efficiency  is  a  comparison, 
that  is,  a  statement  of  the  form  x  =  a,  x  <  a,  x  <  a,  x  >  a,  or  x  >  a.  This  linear 
search  algorithm  requires  at  most  n  +  1  comparisons,  so  it  is  0(n). 

Big  Oh  notation  is  sufficiently  imprecise  that  we  can  be  relatively  careless  when 
determining  the  complexity  of  an  algorithm.  A  more  thoughtful  approach  to  the 
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8.3.3  A  BINARY 
SEARCH  ALGORITHM 


EXAMPLE  21 


linear  search  algorithm  might  have  led  us  to  count  two  comparisons  for  each  value 
of  i,  for  after  checking  if  x  —  a\ ,  the  algorithm  must  also  check  to  see  if  the  end 
of  the  list  has  been  reached,  that  is,  if  i  =  n.  Such  analysis  would  lead  us  to 
complexity  2 n  rather  than  n,  but,  as  we  know,  n  x2 n,  so  the  Big  Oh  estimate  of 
complexity  remains  O(n). 

It  is  often  the  case  that  a  list  to  be  searched  is  in  some  natural  order.  When 
searching  a  dictionary  for  the  meaning  of  obfuscate,  for  instance,  it  is  helpful  to 
know  which  way  to  turn  if  the  dictionary  is  open  at  trisect.  When  a  list  is  already 
in  order,  it  stands  to  reason  that  there  should  be  a  procedure  for  searching  which  is 
more  efficient  than  a  linear  search.  One  commonly  used  such  procedure  is  called 
binary  search.  The  Binary  Search  Algorithm  determines  which  half  of  an  ordered 
list  would  have  to  contain  the  sought-after  element,  then  which  half  of  that  half, 
and  so  on.  At  each  stage,  the  list  to  be  searched  is  one  half  the  length  of  the 
previous  list. 

To  search  for  an  element  x  in  an  ordered  list  a\  <  fl2  <••■<««,  proceed  as 
follows. 

while  n  >  0 

if  n  =  1  then 

if  jt  =  a  i  output  “true”  and  set  n  —  0; 
else  output  “false”  and  set  n  =  0; 
else 

set  m  = 

if  x  =  am ,  output  “true"  and  set  n  =  0; 
else  if  x  <  am 

set  n  =  m  and  replace  the  current  list  with  a i ,  . . .  ,  am\ 
else  replace  n  with  n  —  m  and  the  current  list  with  am+\,  . . .  ,  an. 
end  while 

This  is  the  first  time  we  have  encountered  the  “while/end  while”  construct, 
which  works  pretty  much  as  we  would  imagine.  In  this  particular  case,  as  long 
as  there  is  a  list  to  examine  (“while  n  >  0”),  the  algorithm  decides  in  which  half 
of  the  list  the  sought-for  element  may  reside  and  replaces  the  current  list  with 
this  half. 

With  x  =  12  and  the  input 

5  6  7  10  11  12  15  17  19  20 

a  i  02  ag  a4  05  0(,  a-/  ag  ag  ajo 

of  length  n  =  10  >  0,  the  algorithm  sets  m  =  [^rj  =  5.  Since  x  =  am  =  1 1  and 
x  <  am  are  both  false,  the  algorithm  replaces  n  by  n  —  m  =  5  and  replaces  the 
original  list  with 

12  15  17  19  20 
a  i  02  a  3  a  4  05. 

The  integer  n  is  still  positive,  so  the  algorithm  executes  the  statements  inside  the 
“while  . ..  end  while”  again.  Since  n  1,  it  sets  m  =  L§J  =  2  and,  because 
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x  <  ao  =  15,  it  sets  n  =  2  and  replaces  the  current  list  with 

12  15 

a  i  ciT. 

Now  n  is  still  positive,  so  the  algorithm  executes  the  statements  between  “while" 
and  “end  while”  again.  Since  n  ^  1,  the  algorithm  sets  m  =  |_fj  =  1.  Since 
x  —  a\  =  12,  the  algorithm  outputs  “true”  and  sets  n  =  0.  The  “while”  statement 
is  no  longer  true,  so  the  algorithm  terminates.  A 

To  determine  the  complexity  of  binary  search,  first  suppose  that  n  =  2k  is 
a  power  of  2.  The  algorithm  proceeds  by  progressively  halving  the  length  of  the 
list;  hence,  it  terminates  after  executing  the  statements  between  “while”  and  "end 
while”,  in  the  worst  case,  a  total  of  k  times.  This  involves  2k  +  2  comparisons, 
including  the  two  required  when  n  =  1.  If  n  is  not  a  power  of  2,  find  k  such  that 
2^_l  <  n  <  2k  and  increase  the  length  of  the  input  list  by  adding  terms  all  equal 
to  an  until  the  extended  list  has  length  2k.  This  list,  and  hence  the  original,  is 
searched  after  2k  +  2  comparisons.  Since  2*-1  <  n,  we  have  k  —  1  <  log2n,  so 
k  <  1  +  log2  n  and  2k  +  2  <  4  +  2  log2  n  x  log2  n.  The  binary  search  algorithm 
is  0( log2/i). 

We  now  turn  our  attention  to  sorting  algorithms.  A  sorting  algorithm  will  put 
a  list  of  numbers  into  increasing  order  or  a  list  of  words  into  alphabetical  order. 
We  begin  with  perhaps  the  simplest  of  all  sorting  algorithms,  the  bubble  sort. 
The  Bubble  Sort  Algorithm  makes  a  number  of  passes  through  a  list,  each  time 
interchanging  any  consecutive  pair  of  elements  which  are  not  in  proper  order.  On 
the  first  pass,  the  largest  element  is  shuffled  to  the  end.  On  the  next  pass,  the  next 
largest  element  is  shuffled  to  second  from  the  end.  In  general,  after  the  fcth  pass 
the  list  looks  like  this: 


a\,  ao, ....  ot <  an-k-\.\  <  •  •  •  <  an- 1  <  an 

s-  ^  s - 

unsorted  sorted 

Gradually,  the  largest  elements  “bubble”  to  the  end  in  order. 

EXAMPLE  22  Suppose  the  input  list  is  6,  3,  1, 4,  9,  2.  Initially,  the  first  two  numbers  are  com¬ 
pared  and  (in  this  case)  interchanged,  giving  the  list  3,6,  1,4,  9,  2.  Then  the 
second  and  third  elements  are  compared  and.  in  this  case,  interchanged,  giving 
3,  1,6,  4,  9,  2.  Next,  the  third  and  fourth  elements  are  compared  and  switched, 
giving  3,  1,  4,  6,  9,  2.  No  interchange  of  the  fourth  and  fifth  numbers  is  required. 
The  first  pass  is  completed  with  an  interchange  of  the  fifth  and  last  numbers, 
giving  3,  1,4,  6,  2,  9.  Notice  that  the  largest  number,  9,  has  bubbled  to  the  end. 

The  second  pass  through  the  list  is  like  the  first,  but  with  one  exception.  Since 
the  largest  element  is  now  at  the  end,  it  is  not  necessary  to  compare  the  second  last 
element  with  the  last.  Here  are  the  four  steps  of  the  second  pass  in  our  example. 
The  numbers  being  compared  at  each  stage  are  underlined. 

3,  1,  4,  6,  2,  9  -»  1,3, 4,  6,  2,  9  -»  1,  3,  A_6,  2,  9 
->  1,  3, 4,  6,  2,  9  — >  1,3,4,  2,6,  9 
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Notice  that  the  two  largest  numbers,  6  and  9,  are  now  at  the  end  of  the  list 
in  correct  order.  The  third  pass  is  like  the  second,  but  one  step  shorter  since  it  is 
not  necessary  to  compare  the  fourth  and  fifth  elements.  Here  are  the  three  steps 
of  the  third  pass. 

1,3,4,  2,  6,  9  — >•  1,3,4,  2,  6,  9 ->  1,3,4,  2,  6,  9  -»  1,3,  2,  4,  6,  9. 

The  fourth  pass  requires  two  comparisons, 

1,3,  2, 4, 6.  9  -»•  1,  X2, 4,  6,  9  -*  1,  2,  3, 4,  6,  9 
and  the  fifth  pass  just  one, 

1,2,  3,4,  6,9  — >  1,2,  3,4,  6,  9.  A 


8.3.4 

A  BUBBLE  SORT 

ALGORITHM 

In  the  general  description  of  the  Bubble  Sort  Algorithm  which  follows,  we  use 
the  term  swap  to  denote  the  exchange  of  values  of  two  variables.  For  instance, 
if  a  =  4  and  b  =  2  and  we  swap  a  and  b,  we  mean  that  henceforth,  a  =  2 
and  b  =  4.  In  the  description  of  any  sorting  algorithm,  we  always  assume  that 
the  elements  to  be  sorted  come  from  a  totally  ordered  set,  for  example,  the  real 
numbers  or  the  words  of  a  language  (with  lexicographic  ordering). 

To  sort  n  elements  a i ,  #2, . . .  , an  from  least  to  greatest, 
for  i  =  n  —  1  down  to  1 , 
for  j  =  1  to  i 

if  aj  >  cij+\,  swap  a/  and  oy+i. 


The  algorithm  consists  of  a  loop  inside  a  loop.  In  the  “outer’"  loop,  we  set 
i  =  n  —  1  and  then  execute  the  “inner”  loop  from  j  =  1  to  j  =  n  —  1.  Then 
we  drop  i  to  n  -  2  and  execute  the  inner  loop  from  j  =  1  to  j  =  n  —  2, 
and  so  on  until  i  =  1,  at  which  point  the  inner  loop  is  executed  just  once, 
with  j  =  1.  The  Bubble  Sort  Algorithm  is  popular  with  beginning  programmers 
because  it  is  short  and  simple  to  implement  in  any  language.  Let  us  make  sure 
we  understand  it. 


EXAMPLE  23  For  the  input  list 

4  0  3  2 

a\  02  ai  ai 

we  have  n  =  4.  The  algorithm  sets  i  —  n  —  1=3  and  runs  the  inner  loop  from 
j  =  1  to  3.  With  j  =  1,  we  have  a\  >  02,  so  a\  and  02  are  swapped,  giving  the 
list 

0  4  3  2 

01  02  03  04. 

With  j  =  2,  we  have  02  >03,  so  02  and  03  are  swapped,  giving 

0  3  4  2 

O]  02  03  04. 

With  j  =  3,  we  have  03  >  04,  so  03  and  04  are  swapped,  giving 

0  3  2  4 

a  1  02  03  04. 
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EXAMPLE  24 


Having  completed  the  inner  loop,  the  algorithm  returns  to  the  outer,  drops  i  to 
i  =  2,  and  executes  the  inner  loop  from  /  =  1  to  /  =  2.  With  j  =  1,  the 
inequality  a\  >  ai  is  not  true,  so  there  is  no  swap.  With  j  =  2,  we  have  ci2  >  a?,, 
so  a2  and  0.3  are  swapped,  giving 

0  2  3  4 

a\  02  03  04. 

Again,  the  algorithm  returns  to  the  outer  loop,  drops  i  to  i  —  I  and  executes  the 
inner  loop  just  for  j  =  1.  Since  a\  >  02  is  false,  there  is  no  swap.  The  algorithm 
now  terminates.  A 


What  is  the  complexity  of  a  bubble  sort?  With  7  —  n  —  1,  the  inner  loop  is 
executed  n  —  1  times  and  there  are  n  —  1  comparisons.  With  i  =  n  —  2,  there  are 
n  —  2  comparisons,  and  so  on.  With  i  =  1 ,  just  one  comparison  is  required.  In  all, 
the  number  of  comparisons  is 


n- 1 


n—  1 


n(n  —  I) 
2 


so  that  this  algorithm  is  0{n2).  Note  that  this  estimate  of  the  complexity  of  the 
bubble  sort  is  best  possible:  In  any  bubble  sort,  all  5 n(n  -  1)  comparisons  must 
be  made  before  it  is  completed. 

Many  sorting  algorithms  do  better  than  the  bubble  sort.  On  the  other  hand,  it  is 
known  that  no  sorting  algorithm  has  worst  case  complexity  better  than  0(n  log  n). 
With  a  view  toward  an  eventual  discussion  of  one  n  log  n  algorithm,  we  first 
introduce  an  algorithm  for  merging  two  sorted  lists,  an  important  task  in  its  own 
right.  Instructors  with  numerous  examination  booklets  to  put  into  alphabetical 
order  have  long  since  learned  that  speed  can  be  improved  by  first  dividing  the 
booklets  into  smaller  alphabetized  groups  and  then  merging  the  groups. 

Our  merging  algorithm  takes  two  ordered  lists  and  produces  a  third  ordered 
list  comprised  of  the  elements  in  the  two  given  lists,  in  order. 


Here  is  how  our  Merging  Algorithm  would  sort  the  ordered  lists 

1,3,5,7,10,11  and  2, 5,6, 9. 

We  compare  the  first  elements  of  each  list.  Since  1  <  2,  we  start  our  third  list 
with  1,  removing  1  from  the  first  list.  This  leaves  us  with 

3,5,7,10,11  2, 5,6, 9  and  1. 

We  compare  the  first  elements  of  the  first  two  lists.  Since  2  <  3,  we  place  2  after 
1  in  the  third  list  and  remove  2  from  the  second  list.  At  this  stage,  we  have  the 
three  lists 


3,5.7,10,11  5,6,9  and  1,2. 

We  continue  to  compare  the  first  elements  in  the  first  two  lists,  appending  the 
smaller  to  the  third  list  while  removing  it  from  the  list  from  which  it  came.  If  the 
first  element  in  the  first  list  equals  the  first  element  in  the  second,  we  append  that 
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EXAMPLE  25 
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element  from  the  first  list  to  the  third  (and  remove  it  from  the  first).  Here  are  the 
next  six  steps  in  our  merge. 


5.7.10.11  5,6,9 

7.10.11  5.6,9 

7,10,11  6,9 

7.10.11  9 

10.11  9 

10.  11 


1,2,3 

1,2, 3, 5 

1,2,  3,  5,  5 

1.2,  3,  5,  5,6 

1.2,  3,  5,  5,6,  7 

1.2,  3,  5,  5,  6,  7,  9 


At  this  stage  the  second  list  is  empty,  so  we  simply  append  the  remaining  elements 
10,  11  of  the  first  list  to  the  third,  giving  the  final  list  1,  2,  3,  5,  5,  6,  7,  9, 
10,  11.  A 


To  merge  two  given  sorted  lists 

L\  \  a\  <02  <  ■  ■  ■  <  as  and  £2:  b\  <  bi  <  •  •  •  <  bt, 
of  lengths  s  and  t,  into  a  single  sorted  list 

£3 :  O  <  C2  <  ■  ■  •  <  cs+l 
of  length  .5  +  t,  proceed  as  follows. 

Step  1.  Set  £3  equal  to  the  empty  list. 

Step  2.  If  £|  is  empty,  set  £3  =  £2  and  stop.  If  £ 2  is  empty,  set  £3  =  £|  and 
stop. 

Step  3.  Suppose  a\  <b\.  Then  append  ci\  to  £3  and,  if  this  empties  £|,  append 
the  remaining  elements  of  £2  to  £3  and  stop.  If  r  >  0  elements  remain  in  £  1 , 
label  them  a\,  02, . . .  ,  a,-  in  increasing  order  and  repeat  Step  3. 

Suppose  o  1  >  b\.  Then  append  b\  to  £3  and,  if  this  empties  £2,  append 
the  remaining  elements  of  £|  to  £3  and  stop.  If  r  >  0  elements  remain  in  £2, 
label  them  b\,  fa,  •  •  •  ,br  in  increasing  order  and  repeat  Step  3. 


We  apply  the  merging  algorithm  to  the  lists 


£1: 


3 

a  1 


5  8 

02  03 


and 


£2 


1 

b\ 


1  8 
fa  fa. 


Since  neither  list  is  empty,  we  proceed  directly  to  Step  3.  Since  3  =  a\  >  b\  =  1, 
we  append  b\  to  the  list  £3,  which  was  initially  empty.  Then  we  relabel  the 
remaining  elements  7  and  8  of  £2  as  b\,  fa  respectively  so  that  our  lists  are 


3  5  8  .  7  8 

0\  02  «3  b\  fa 


and  £3 : 

c  1  • 


We  then  repeat  Step  3.  Since  a\  <  b\,  we  append  a\  =  3  to  £3  and  relabel  the 
remaining  elements  5  and  8  of  £1  as  a\  and  02,  respectively.  Our  lists  are  now 


£,: 


5 

a\ 


8 

02 


£2: 


7 

b\ 


8 

fa 


and 


£3: 


1 

Cl 


3 

C’2- 


We  repeat  Step  3.  Since  a\  <  b\,  we  append  a\  =  5  to  £3  and  relabel  the  elements 
of  £],  obtaining 


£1 


8  .  7  8 

£  2- 


a\  “  b\  b2 

After  the  next  repetition  of  Step  3  we  have 


and  £3 : 


£ 


8 


1  • 


£2 : 


8 


and  £3 : 


1 


1  3  5 

c  1  c2  c3. 


7 


a  1  “  b  1  -  c  1  c2  c  3  C4. 

Now,  since  8  =  a\  <  b\  =  8,  we  append  a\  to  £3,  giving 


£ 


„  8  ,  r  1  3  5  7  8 

£2:  ,  and  £3: 


b\  -  c  1  C2  C3  C4  C5. 

Since  £1  is  now  empty,  we  append  8,  the  remaining  element  of  £2,  to  £3, 
producing  the  final  merged  list: 

1  3  5  7  8  8 

£3 :  c  1  c2  C3  C4  C5  C6-  A 


How  many  comparisons  does  the  Merging  Algorithm  require  to  merge  ordered 
lists  £1  and  £2  of  lengths  s  and  t  into  a  single  ordered  list  £3  of  length  s  +  f? 
Notice  that  while  elements  remain  in  £|  and  £2,  each  element  goes  into  £3  after 
one  comparison.  Eventually  one  list  is  empty  and  at  least  one  element  remains  in 
the  other;  such  remaining  elements  go  to  the  end  of  £3  with  no  comparisons.  It 
follows  that,  in  all,  at  most  s  +  t  —  1  comparisons  are  needed. 


PROBLEM  26.  Find  ordered  lists  of  total  length  s  +  t  =7  which  require  exactly 
7-1=6  comparisons  to  merge. 

Solution.  The  maximum  number  of  comparisons,  s  +  t  —  1,  occurs  when  just 
one  number  remains  in  one  of  the  lists  when  the  other  is  emptied.  Here  is  such  a 
situation. 

£|  £2  £3  Comparisons  Needed 


1, 3,5,7 

2,4,6 

3,5,7 

2,4,6 

1 

1 

3,5,7 

4,6 

1,2 

1 

5,7 

4,6 

1,2,3 

1 

5,7 

6 

1,2,3, 4 

1 

7 

6 

1,23,4,5 

1 

7 

1,2,3,43,6 

1 

1,23,4,5.6,7 

0 

Total 

6 

Find  ordered  lists  of  total  length  7  which  require  just  three  comparisons  to  merge. 

We  are  now  able  to  describe  an  algorithm  for  sorting  a  list  which  is  not 
only  more  efficient  than  the  bubble  sort,  but,  in  fact,  achieves  the  theoretical  best 
possible  worst  case  complexity.  Here  is  the  idea. 
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EXAMPLE  27 


8.3.6  A  MERGE  SORT 
ALGORITHM 


Suppose  we  are  given  the  list 

7,  11,5,9,  11,4,  10,  15,  17,3,9,6,21,  1 

of  length  14.  We  group  the  elements  into  seven  pairs  and  order  each  pair.  This 
gives  us  seven  pairs  of  ordered  lists. 

7,11;  5,9;  4,11;  10,15;  3,17;  6,9;  1,21. 

Next,  we  merge  the  first  two  ordered  lists,  merge  the  third  and  fourth  ordered 
lists,  and  merge  the  fifth  and  sixth.  The  last  ordered  list — 1,  21 — is  unchanged. 

5,7,9,11;  4,10,11,15;  3,6,9,17;  1,21. 

At  this  point,  we  have  four  ordered  lists.  We  merge  the  first  two  and  the  last  two, 
giving 

4, 5, 7, 9,  10,  11,  11,  15;  1,3, 6, 9,  17,21. 

Finally,  we  merge  the  two  lists  that  remain,  obtaining  the  final  ordered  list 

1,3,4,  5,6,  7,9,9,  10,  11,  II,  15,  17,21.  A 

In  this  example,  it  is  useful  to  think  of  the  given  list  as  14  ordered  lists,  each 
of  length  1.  After  the  first  step,  we  were  left  with  seven  ordered  lists  of  length  2; 
after  the  second,  with  four  ordered  lists;  after  the  third,  with  two  ordered  lists,  and 
after  the  fourth,  with  the  desired  single  ordered  list.  Since  the  number  of  ordered 
lists  is  essentially  halved  at  each  stage,  in  general  k  steps  will  be  needed  to  sort 
a  list  of  n  elements,  where  2*“’  <  n  <  2k .  For  example,  23  <  14  <  24,  so  four 
steps  are  required  to  sort  a  list  of  length  14. 

Here  is  a  general  description  of  our  Merge  Sort  Algorithm. 

To  sort  a  list  a\,ai, . . .  ,an  into  increasing  order,  proceed  as  follows. 

Step  1.  set  F  =  0. 

Step  2.  for  i  =  1  to  n,  let  the  list  XL,  be  the  single  element  a,-. 

Step  3.  while  F  =  0 

if  n  =  1,  set  F  =  1  and  output  XL| ; 
if  n  =  2m  is  even 
for  i  =  1  to  m 

•  merge  the  sorted  lists  XL2/-1  and  £2, 
and  label  the  resulting  sorted  list  XL,-; 

•  set  n  =  m. 

if  n  =  2m  +  1  >  1  is  odd 
for  /  =  1  to  m 

•  merge  the  sorted  lists  £2,-1  and  £2/ 
and  label  the  resulting  sorted  list  £,  ; 

•  change  the  label  of  the  former  list  £„  to  £m+i; 

•  set  n  =  m  +  1 . 


end  while 
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The  variable  F  in  our  algorithm  is  called  a  flag,  a  useful  parameter  to  introduce 
in  an  algorithm  to  enable  fast  termination  of  a  loop.  Here,  it  enables  us  to  get  out 
of  the  “while”  loop  as  soon  as  the  merging  is  complete. 


PROBLEM  28.  What  would  the  merge  sort  algorithm  do  with  the  list 

2  9  1  4  6  5  3  7 

a\  £?2  #3  CI5  «6  al 

Solution.  Initially  F  =  0  and  lists  £1 ,  £2.  •  •  •  ,  £7  are  defined,  each  of  length  1. 

£1 :  2  £2:  9  £3:1  £4:4  £5:6  £6:5  £7:  3. 

At  Step  3,  since  F  —  0  and  n  —  2(3)  +  1  is  odd  (m  =  3),  we  form  four  new  lists 
£1 ,  £2,  £3,  £4  by  merging  the  first  six  former  lists  in  pairs  into  three  and  adding 
the  seventh.  We  now  have 

£1 :  2,9  £2:  1,4  £3:  5,6  £4:  3. 

At  this  point,  n  is  replaced  bym  +  l  =  3+  l=  4  and  we  repeat  Step  3.  Since 
n  =  2(2)  is  even  ( m  =  2),  we  form  two  lists  £1  and  £2  by  merging  the  former 
lists  £|,  £2  and  £3,  £4,  respectively.  At  this  point  we  have 

£1 :  1,  2, 4, 9  £2:  3,5,6. 

Now  n  is  replaced  by  m  =  2  and  we  repeat  Step  3.  Since  n  —  2(1)  is  even 
(. m  —  1),  the  list  £[  is  formed  by  merging  the  former  £1  and  £2. 

£1:  1,2,  3,  4,  5,  6,  9 

At  this  point,  n  is  replaced  by  m  =  1  and  we  repeat  Step  3.  Since  n  =  1,  we  set 
F  =  1,  output  £|,  the  desired  sorted  list,  and  stop.  I 


What  is  the  complexity  of  the  Merge  Sort  Algorithm  in  terms  of  the  number 
of  comparisons  which  are  required?  As  with  binary  search,  it  is  helpful  first  to 
consider  the  case  that  the  given  list  has  length  n  =  2k  because  then  Step  3  is 
executed  exactly  k  times.  Also,  we  enter  Step  3  each  time  with  an  even  number 
of  lists  and  exit  with  an  even  number,  until  the  last  step. 

Let  us  first  ignore  the  at  most  three  comparisons  required  to  decide  if  n  =  1, 
n  =  2m,  or  n  =  2m  +  1  >  1  each  time  Step  3  is  executed  and  just  count 
comparisons  required  by  the  merging.  Initially,  we  enter  Step  3  with  2k  lists,  each 
of  length  1,  which  are  merged  in  pairs.  Each  merge  requires  one  comparison  for 
a  total  of  2k  1  =  2k  —  2i_1  comparisons  altogether.  We  pass  through  Step  3 
a  second  time,  entering  with  2k~l  lists  of  length  2  which  are  merged  in  pairs. 
Each  pair  is  merged  with  at  most  2  +  2—1  =4—1  comparisons,  for  a  total  of 
2*~2(4  —  1)  =  2k  —  2k~2.  The  third  time  we  execute  Step  3,  we  enter  with  2k~2 
lists  of  length  4  which  are  merged  in  pairs.  Each  pair  is  merged  with  at  most 
4  +  4—1  =8—1  comparisons,  for  a  total  of  2k~3(S  —l)  =  2k  —  2k~3 .  Continuing 
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this  line  of  reasoning,  we  see  that  pass  i  through  Step  3  requires  at  most  2k  —  2k~' 
comparisons  for  the  merging.  The  Ath  and  final  pass  requires  2k  —  1  comparisons, 
the  result  of  merging  two  lists  of  length  2k~] . 

In  total,  the  merging  in  the  algorithm  requires  at  most 

(1)  (2k  -  2k-')  +  (2k  -  2k~2)  +  ■  ■  •  +  (2k  -  4)  +  (2k  -  2)  +  ( 2k  -  1) 


comparisons.  There  are  k  terms  here.  Since  1  +  2  +  4  H - t-  2*_1  =  2k  —  \  (see 

Pause  4),  the  sum  in  equation  (1)  equals  k2k  —  (2*  —  1)  =  k2k  —  2k  +  1.  In  all, 
including  the  possible  three  checks  to  determine  whether  k  —  1,  k  =  2m  or  k  = 
2 in  +  1  >  1 ,  the  algorithm  requires  at  most  3 k  +  (k2k  —  2k  + 1 )  =  k2k  —  2k+2>k+\ 
comparisons. 

In  general,  for  a  list  of  n  elements,  find  k  such  that  2k~'[  <  n  <  2k.  By 
extending  the  list  to  one  of  length  2k  (for  example,  by  appending  additional 
elements  larger  than  any  number  in  the  list),  we  see  that  the  given  list  can  be 
sorted  by  sorting  the  larger  list  and  then  removing  the  additional  large  elements 
from  the  end  of  the  list.  By  the  preceding  analysis,  it  follows  that  the  given  list 
can  be  sorted  with  at  most  k 2k  —  2k  +  3k  +  1  comparisons.  Since  2k~]  <  n, 
k  <  1  +  log2«  and  2k  <  2 n.  So  the  number  of  comparisons  is  at  most 

k2k  -  2k  +  3k  +  1  <  (1  +  log2  n)2k  -2k  +  2k+\ 

=  2k  log2  n  +  3k  +  1 
<  2 n  log2  n  +  3  log2  n  +  4. 


Since  2 n  log2  n+3  log2  n+  4  ~  n  log2  n,  the  number  of  comparisons  is  0(n  logn). 
Referring  to  Table  8.1  of  Section  8.2,  we  note  that  this  is  a  substantial  improve¬ 
ment  on  0(n2),  the  complexity  of  the  bubble  sort. 

MIHBEESBI  Explain  why  1  +  2  +  4  + - f  2k~]  =  2k  -  1. 


Answers  to  Pauses 


3.  We  want  ordered  lists  with  the  property  that  after  three  comparisons  one  is 
empty.  Here  is  an  example: 


-C)  £2  £3  Comparisons  Needed 


1,3  2,4, 6, 8,9 

3  2,4, 6,8, 9 

3  4,6, 8,9 

4,6, 8,9 


Total  3 


1  1 

1,2  1 

1,2,3  1 

1,2, 3, 4, 6, 8, 9  0 


4.  The  terms  in  this  sum  are  those  of  a  geometric  sequence  with  a  =  1,  r  =  2, 
n  =  k.  The  sum  is  [See  formula  (8)  on  p.  167.] 
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EXERCISES 


The  symbol  [BB]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 


1.  (a)  [BB]  Show  the  sequence  of  steps  in  using  a  binary 

search  to  find  the  number  2  in  the  list  1,  2,  3, 4,  5, 
6,  7,  8,  9.  How  many  times  is  2  compared  with  an 
element  in  the  list?  How  many  times  would  it  be 
compared  with  an  element  in  the  list  if  we  employed 
a  linear  search? 

(b)  Repeat  (a)  if  we  are  searching  for  the  number  7. 

2.  (a)  Describe  a  ternary  search  algorithm,  which  searches 

an  ordered  list  for  a  given  element  by  successively 
dividing  the  list  into  thirds  and  determining  in  which 
third  the  element  must  lie. 

(b)  Show  that  the  complexity  of  the  algorithm  found  in 
(a)  (in  terms  of  comparisons)  is  O(log3n). 

3.  (a)  [BB]  Describe  an  algorithm  which,  upon  input  of 

k  numbers,  each  in  the  range  1-100  (inclusive), 
outputs  the  complement  of  this  set  with  respect  to 
U  =  {1,2,...  ,100}. 

(b)  Repeat  (a)  if  100  is  replaced  by  n.  Find  a  Big  Oh 
estimate  for  the  complexity  of  your  algorithm,  in 
terms  of  comparisons. 

4.  [BB]  Show  the  sequence  of  steps  involved  in  merging 
the  sorted  lists  1,  2,  3,  4,  5  and  2,  4,  6,  8,  10.  How  many 
comparisons  are  required? 

5.  Rewrite  the  Merging  Algorithm  using  a  flag  to  prevent 
the  program  from  exiting  during  Step  1 . 

6.  [BB]  Sort  the  list  3,  1,  7,  2,  5,  4  into  increasing  order 

(a)  with  a  bubble  sort; 

(b)  with  a  merge  sort. 

In  each  case,  how  many  comparisons  are  needed?  (For 
the  merge  sort,  ignore  comparisons  required  to  check 
the  size  and  parity  of  n  at  each  iteration  of  Step  3.) 

7.  Repeat  Exercise  6  for  the  list  10,  1 1,  15,  3, 18,  14,  7,  1. 

8.  [BB]  The  median  of  a  list  of  numbers  is  the  middle 
number  or  the  average  of  the  middle  two  numbers,  after 
the  numbers  have  been  listed  from  smallest  to  largest. 
Describe  an  algorithm  for  finding  the  median  of  an  input 
list  of  n  numbers.  Find  a  Big  Oh  estimate  of  the  com¬ 
plexity  of  your  algorithm. 

9.  Suppose  that  the  eight  elements  a,  b,  c,  d,  u,  v,  w.  x  are 
ordered 

d<a<u<c<x<b<v<w. 


(a)  [BB]  Show  the  steps  in  a  bubble  sort  applied  to  the 
list  a,  b,  c,  d,  u,  v,  w,  x.  How  many  comparisons 
are  required? 

(b)  Repeat  (a)  using  a  merge  sort. 

10.  [BB]  Find  an  example  of  two  ordered  lists  of  lengths  .v 
and  t  >  3  which  can  be  merged  with 

(a)  one  comparison 

(b)  t  comparisons 

(c)  exactly  s  +t  —  1  comparisons.  (For  this  part,  assume 
also  that  s  >  3.) 

11.  (a)  [BB]  Show  the  steps  involved  in  the  application  of 

a  bubble  sort  to  the  list  c,  a,  e,  b ,  d,  where  these 
letters  have  their  natural  alphabetical  order. 

(b)  Apply  the  same  sequence  of  interchanges  as 
required  in  part  (a)  to  the  list  1,  2,  3,  4,  5;  that  is, 
if  it  is  necessary  to  interchange  the  second  and  third 
elements  at  a  certain  stage  in  the  bubble  sort  of  (a), 
then  interchange  the  second  and  third  elements  at 
the  same  stage  in  the  “sort”  of  1 , 2,  3,  4,  5.  The  final 
sequence  is  2,  4,  1, 5,  3,  which  describes  the  order 
in  which  the  elements  of  c,  a,  e,  b,  d  must  be  taken 
to  list  them  in  order. 

(c)  Describe  an  algorithm  whose  input  is  a  list  ai, 

a 2 . a„  whose  natural  order  is  a,-, ,  a;2, . . .  ,  a,n 

and  whose  output  is  the  sequence  of  indices  i\, 
i 2, ...  ,  in  (in  this  order). 

12.  We  explained  in  the  text  why  the  Merging  Algorithm 
requires  at  most  s  + 1  —  1  comparisons  to  merge  ordered 
lists  of  lengths  s  and  t.  For  arbitrary  s  and  t,  do  there 
exist  ordered  lists  of  these  lengths  for  which  precisely 
s  +  t  —  1  comparisons  are  needed  to  merge?  Explain. 

13.  The  Binary  Search  Algorithm  we  have  presented  appears 
to  be  more  efficient  than  a  linear  search,  O(logn)  ver¬ 
sus  0{n),  but  the  binary  search  assumes  the  input  list 
is  ordered.  Suppose  we  modify  the  Binary  Search  Algo¬ 
rithm  so  that  it  accepts  an  unordered  list  as  input  by  first 
using  an  efficient  sorting  algorithm  and  then  the  binary 
search  as  described  in  this  section.  Is  this  new  algorithm 
still  more  efficient  than  a  linear  search?  Explain. 

14.  [BB]  What  is  the  fewest  number  of  comparisons 
required  to  merge  ordered  lists  of  lengths  s  and  tl 
Explain  your  answer. 
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15.  Ignoring  repeated  checks  as  to  whether  n  =  I ,  n  =  21, 
or  n  =  2£+ 1  >  1 ,  we  showed  in  the  text  that  the  number 
of  comparisons  in  any  merge  sort  of  2k  elements  is  at 
most  k2k  —  2k  +  1 .  Give  a  specific  example  of  a  list  of 
length  2k,  where  precisely  k2k  —  2*  +  1  comparisons  are 
required. 

16.  [BB]  In  order  to  output  the  distinct  items  of  a  given  list 
a\,aj, ...  ,an,  the  following  method  is  proposed.  For 
each  k,  search  the  elements  preceding  a k  and,  if  a *  is 
not  found,  output  a *.  Show  that  this  algorithm  can  be 
accomplished  with  O(k)gnl)  comparisons. 

17.  Modify  the  Bubble  Sort  Algorithm  to  find  an  algo¬ 
rithm  which  reverses  a  sequence;  that  is,  upon  input 


of  a\,a2,...  ,an,  the  algorithm  outputs  the  sequence 
a„,an- 1,...  ,  (i2,  a  \ .  How  many  swaps  are  required? 
Give  a  Big  Oh  estimate  of  complexity. 

18.  [BB]  It  is  suggested  that  one  way  of  searching  for  a 
number  x  in  a  given  unordered  list  is  first  to  sort  the 
list  using  a  merge  sort  and  then  to  use  a  binary  search 
algorithm.  Is  this  more  or  less  efficient  than  a  linear 
search? 

19.  Given  a  list  a\,a2, ...  ,a„,  each  item  of  which  is 
0  or  1,  it  is  desired  to  put  the  list  into  increasing 
order.  Would  you  use  one  of  the  sorting  methods  dis¬ 
cussed  in  this  section,  or  is  there  a  better  method? 
Explain. 


8.4  Enumeration  of  Permutations  and  Combinations 

A  truck  driver  for  Deluxe  Bakery  has  to  deliver  bread  to  many  supermarkets  scat¬ 
tered  throughout  a  large  metropolitan  area.  She  has  a  map  showing  the  precise 
locations  of  her  delivery  points.  She  knows  how  long  it  takes  to  drive  from  the  bak¬ 
ery  to  each  point  and  how  long  it  takes  to  drive  between  any  two  delivery  points. 
In  what  order  should  she  deliver  bread  in  order  to  minimize  total  traveling  time? 
This  problem,  known  as  the  Traveling  Salesman ’s  Problem,  will  be  encountered 
again.  Here,  it  serves  to  illustrate  a  situation  where  we  would  like  to  enumer¬ 
ate  permutations.  Assigning  the  supermarkets  the  numbers  1,2,...  ,  n,  the  truck 
driver  ideally  would  like  to  list  all  the  permutations  of  1,  2,  . . .  ,  n  and,  for  each 
permutation,  determine  the  time  required  for  the  route  which  visits  supermarkets 
in  the  specified  order. 

To  list  all  the  permutations  of  a  set  of  two  or  three  or  even  four  elements  is  not 
difficult.  To  enumerate  all  the  permutations  of  larger  sets  becomes  first  tedious, 
and  then  ridiculous,  at  least  by  hand,  because  n\  grows  so  very  rapidly  with  n: 
For  instance,  15!  is  roughly  1012. 

To  list  all  permutations  of  ten  elements,  say,  it  would  seem  sensible  to  enlist 
the  support  of  a  computer.  For  this,  of  course,  a  suitable  algorithm  is  required,  an 
algorithm  which,  upon  input  of  a  positive  integer  n,  outputs  a  complete  list  of  all 
the  permutations  of  1,  2, . . .  ,  n  without  omission  or  repetition.  We  are  confronted 
with  two  problems. 


•  How  should  the  permutations  be  ordered? 

•  Given  a  permutation,  how  is  the  next  one  determined? 

Lexicographic  order — the  way  words  are  ordered  in  a  dictionary — is  one  way 
to  order  permutations.  In  a  dictionary,  terrible  precedes  terrific  because,  reading 
from  left  to  right,  the  first  place  where  the  words  differ  is  at  the  sixth  letter,  and 
there  b  precedes  /  in  the  natural  ordering  of  the  letters  of  the  alphabet.  So  we 
list  the  permutation  beadc  before  bedca ,  but  after  bcaed.  In  a  similar  way,  if  the 
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8.4.1  PROPOSITION 


EXAMPLE  29 


EXAMPLE  30 


8.4.2  ALGORITHM  FOR 
ENUMERATING 
PERMUTATIONS 


symbols  are  numbers  instead  of  letters,  1357264  precedes  1357624  but  follows 
1347256. 

With  respect  to  lexicographic  ordering,  there  is  a  procedure  for  finding  the 
permutation  which  follows  any  given  permutation. 

When  the  permutations  of  1,2,...  , n  are  ordered  lexicographically,  the  permu¬ 
tation  which  follows  a  particular  permutation  n  is  obtained  by 

•  reading  the  digits  in  it  from  right  to  left, 

•  noting  the  first  consecutive  pair  xy  where  x  <  y, 

•  replacing  x  by  the  smallest  of  those  digits  to  its  right  which  are  larger  than 
x,  and  then 

•  writing  down  in  increasing  order  the  digits  not  yet  used. 

The  permutation  following  1374652  is  1375246  because,  reading  from  right  to 
left,  the  first  consecutive  pair  xy  with  x  <  y  is  46,  so  we  replace  4  by  5,  the 
smallest  digit  to  its  right  which  is  larger  than  4,  giving  an  initial  string  1375, 
which  is  completed  by  writing  down  the  unused  digits,  2,  4,  and  6,  in  increasing 
order.  ▲ 

The  permutations  of  1,2,  3,  4  in  lexicographic  order  are 

1234  1243  1324  1342  1423  1432  2134  2143 

2314  2341  2413  2431  3124  3142  3214  3241 

3412  3421  4123  4132  4213  4231  4312  4321.  ▲ 

To  enumerate  all  permutations  of  1,2,...  ,  n,  the  following  algorithm  can 
be  used. 

Given  a  natural  number  n,  to  enumerate  the  n !  permutations  of  1, 2,  . . .  ,  n,  pro¬ 
ceed  as  follows. 

Step  1.  Set  t  —  1.  Output  Perm(l)  =  123  . .  .n.  If  n  —  1,  stop. 

Step  2.  For  t  =  1  to  n!  —  1,  given  permutation  Permit)  =  tt\tt2  ■  ■ .  nn,  determine 
the  next  permutation  Perm(t  +  1)  as  follows: 

•  find  the  largest  j  such  that  tcj  <  try+i; 

•  let  m  =  min{7r/  |  i  >  j ,  jt,  >  ttj}; 

•  let  S  be  the  complement  of  the  set  [tt\  , . . .  ,  TTj-\ ,  m\  with  respect 
to  U  =  {1,2,...  ,  «}; 

•  sort  the  elements  of  S  in  increasing  order,  b\  <  b2  <  ■  ■  ■  <  bn-j\ 

•  output  Permit  +  1)  =  tt\  . . .  nj-\mb\  . . .  bn-j. 

With  n  =  3,  for  example,  the  permutations  of  1,  2,  3  are  generated  as  shown 
in  Table  8.2. 

Any  algorithm  which  involves  n\  steps  will  take  a  long  time  to  run,  even 
for  relatively  small  n.  For  example,  at  a  million  operations  per  second,  12!  oper¬ 
ations  require  almost  eight  minutes.  In  the  Exercises,  we  ask  you  to  show  that 
the  algorithm  just  described  can  be  implemented  with  complexity  0(n\n).  (See 
Exercise  3  at  the  end  of  this  section.) 
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Table  8.2  The  permutations  of  1,  2,  3  as  generated  by  Algorithm  8.4.2. 

t 

Permit) 

j 

m 

S 

1 

1  2  3 

7T 1 7T27T3 

2 

3 

{7T],m}c  =  {1,  3}c  =  {2} 

2 

1  3  2 

7T 1 7T27T3 

1 

2 

{ifir  =  {2}c  =  u,  3} 

3 

2  1  3 

7T\7T27T 3 

2 

3 

Dn,m}c  =  {2,3}c  =  {1} 

4 

2  3  1 

rr  1 7T27T3 

1 

3 

[m}c  =  {3}c  =  {1,  2} 

5 

3  1  2 

7T]7T27t2 

2 

2 

[tz\ ,  m}c  =  {3,  2}c  —  {1} 

6 

3  2  1 

Deluxe  Bakery  is  considering  the  introduction  of  bagels  to  its  traditional 
product  lines  of  bread  and  pastries  and  decides  to  select  6  of  the  supermarkets 
which  carry  its  products  for  a  trial.  Towards  this  effort,  it  would  be  helpful  for  the 
marketing  manager  to  review  a  list  of  all  possible  choices  in  order  to  choose  the 
most  useful  6.  What  is  required  then  is  a  listing  of  all  combinations  of  supermarkets 
taken  6  at  a  time. 

As  with  permutations,  combinations  will  be  listed  lexicographically;  how¬ 
ever,  this  time  we  must  be  careful  to  avoid  listing  both  142897  and  218479, 
for  instance,  since  these  represent  the  same  combination.  Suppose  we  agree  to 
list  elements  in  their  natural  (increasing)  order,  so  that  rather  than  142897  or 
218479,  we  will  write  124789.  Imagine  that  the  combination  uvwxyz  appears 
in  our  list.  Then  u<v<w<x<y<z  and  so,  if  these  symbols  come 
from  {1,2,...  ,72},  it  is  apparent,  for  instance,  that  y  cannot  be  72  and  x 
can  be  neither  71  nor  72.  The  largest  possible  value  for  z  is  72,  the  largest 
for  y  is  71,  the  largest  for  x  is  70,  the  largest  for  w  is  69.  the  largest  for 
v  is  68,  and  the  largest  value  for  u  is  67.  Also,  the  smallest  combination  is 
123456. 


8.4.3  PROPOSITION 


When  the  combinations  of  1,2,...  ,  n  taken  r  at  a  time  are  ordered  lexicograph¬ 
ically,  the  first  combination  is  1 23  ...  r  and  the  combination  which  immediately 
follows  a  given  one  is  obtained  by 


«  reading  the  combination  from  right  to  left  until  the  first  digit  which  can  be 
increased  is  found, 

•  adding  1  to  this  digit  (suppose  this  number  is  now  k), 

•  leaving  the  digits  before  k  as  they  were,  but  following  k  by  k  +  1,  k  +  2, . . . 
and  so  on  until  r  digits  in  all  have  been  written  down. 
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In  the  lexicographic  ordering  of  the  combinations  1,2,...  ,9  taken  six  at 
a  time,  what  combination  follows  134589?  Reading  from  right  to  left,  the  first 
digit  which  can  be  increased  is  5.  Increase  this  to  6  (so  that  the  combination 
begins  1346)  and  complete  the  combination  with  78,  giving  134678.  What  follows 
236789?  The  first  digit  which  can  be  increased  is  3.  Increase  this  to  4  (the  next 
combination  begins  24)  and  complete  with  5678,  giving  245678. 

In  the  lexicographic  ordering  of  the  combinations  of  1 , 2,  3,  4,  5,  6  taken  four 
at  a  time,  after  1346  come  1356,  1456,  2345,  2346,  2356,  and  so  on.  The  combi¬ 
nations  of  1,  2,  3, 4,  5  taken  three  at  a  time  are 

123,  124,  125,  134,  135,  145,  234,  235,  245,  345. 


EXERCISES 


The  symbol  [BB]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 


1.  (a)  [BB]  With  a  table  like  Table  8.2,  show  the  steps 

involved  in  the  enumeration  of  the  first  eight  per¬ 
mutations  of  1, 2,  3,  4. 

(b)  Repeat  (a)  for  the  second  eight  permutations  of 

1,2,  3,4. 

(c)  Repeat  (a)  for  the  last  eight  permutations  of 

1,2,  3,4. 

2.  (a)  Use  the  procedure  outlined  in  this  section  to  list  the 

first  20  permutations  of  1,2,  3,  4,  5. 

(b)  [BB]  In  the  lexicographic  ordering  of  the  permuta¬ 
tions  of  1,2,  3,4,  5,  what  five  permutations  come 
after  42513? 

(c)  In  the  lexicographic  ordering  of  the  permutations  of 

1, 2,  3,  4,  5,  what  five  permutations  precede  42513? 

3.  (a)  [BB]  Show  that  the  complexity  of  Algorithm  8.4.2 

for  enumerating  permutations  is  0(n\n  logn),  in 
terms  of  comparisons. 

(b)  Improve  the  estimate  in  (a)  by  showing  how  to 
achieve  0{n\n). 

4.  Let  n'  =  Permit  +  1)  be  the  permutation  derived  from 
Perm(t)  =  n  as  in  Proposition  8.4.1. 

(a)  Prove  that  n  <  n' ,  where  -<  means  precedes  in 
lexicographic  order. 

(b)  Prove  that  if  a  is  a  permutation  satisfying  it  < 
it' ,  then  o  =  n .  Thus,  n'  is  the  immediate  successor 
of  it  relative  to  lexicographic  ordering  of  permuta¬ 
tions.  (Here  n  <  o  means  it  -<  a  or  7t  =  a .) 


5.  List,  in  lexicographic  order,  all  combinations  of 

1,2,  3,4.  5,6 

(a)  [BB]  taken  four  elements  at  a  time; 

(b)  taken  three  elements  at  a  time. 

6.  In  the  lexicographic  ordering  of  all  combinations  of 

1,2 . 9  taken  five  at  a  time,  list,  if  possible,  the  three 

combinations  which  immediately  precede  and  the  three 
which  immediately  follow 

(a)  [BB]  23469, 

(b)  13567, 

(c)  45789. 

7.  (a)  List,  in  lexicographic  order,  the  combinations  of 

1 , 2,  3,  4.  5,  6,  7  taken  two  at  a  time. 

(b)  [BB]  Use  Proposition  8.4.3  and  Algorithm  8.4.2  to 

describe  an  algorithm  for  enumerating  the  permuta¬ 
tions  of  1, 2 . n  taken  r  at  a  time,  not  necessarily 

in  lexicographic  order. 

(c)  Use  parts  (a)  and  (b)  to  list  all  permutations  of 

1, 2,  3,  4,  5,  6,  7  taken  two  at  a  time. 

8.  (a)  [BB]  List,  in  lexicographic  order,  all  combinations 

of  1,2 . 8  taken  six  elements  at  a  time. 

(b)  List,  in  lexicographic  order,  the  first  ten  and  the 
last  ten  combinations  of  1 , 2,  . . .  ,8  taken  three  at  a 
time. 

(c)  Use  the  list  in  (b)  and  the  method  proposed  in 

Exercise  7(b)  to  enumerate  the  first  ten  and  the  last 
ten  permutations  of  1 , 2 . 8  taken  three  at  a  time. 
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9.  [BB]  Prove  Proposition  8.4.3. 

10.  [BB]  Suppose  ai«2tf3  ■■■ar  appears  in  the  lexico¬ 
graphic  ordering  of  the  combinations  of  the  integers 
1,2,3,...  ,n  taken  r  at  a  time,  where  a\  <  (12  <  <23  < 
•  <  ar  . 

(a)  What  is  the  biggest  possible  value  for  arl  For  a\! 
For  aj  ? 


(b)  Under  what  conditions  is  this  the  first  combination 
in  the  list? 

(c)  When  is  it  the  last? 

11.  [BB]  Describe  an  algorithm  which  enumerates  all  com¬ 
binations  of  1 , 2, . . .  ,  n  taken  r  at  a  time. 

12.  Suppose  a  set  S  has  kn  elements  for  natural  numbers 
k  and  n.  Describe  an  algorithm  which  will  output  all  k 
element  subsets  of  S. 
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1.  Describe  how  Horner’s  algorithm  evaluates  f(x)  when 

(a)  f(x)  =  2a-3  —  Ax  +  1  and  x  =  —3 

(b)  f{x)  =  x4  —  2x3  +  x2  —  5x  +  6  and  x  =  2 

2.  Use  the  Russian  peasant  method  to  find 

(a)  149  x  712 

(b)  1018  x  72 

3.  Let  n  >  1  be  an  integer,  let  S  =  [1,2 ,...,«}  and 
let  7 Z  =  ((«] ,  b\),  (ci2,  bf). ...  ,  ( a, ,  bt)}  be  a  subset  of 
S  x  S.  Describe  algorithms  which  determine  whether  or 
not  the  relation  TZ  is 

(a)  reflexive; 

(b)  symmetric; 

(c)  antisymmetric; 

(d)  transitive; 

(e)  a  function. 

In  each  case,  find  a  reasonable  estimate  of  the  complex¬ 
ity  of  your  algorithm. 

4.  Suppose  we  want  an  algorithm  which,  for  an  input  of 
integers  a\.  ai, . . .  ,a„,  outputs  the  largest  and  second 
largest  integers.  It  is  proposed  to  sort  the  list  in  decreas¬ 
ing  order  and  to  output  the  first  two  integers  of  the  sorted 
list. 

(a)  In  terms  of  comparisons,  what  is  the  best  complexity 
function  for  this  algorithm? 

(b)  Is  there  a  better  way  to  proceed?  Explain  and,  if  the 
answer  is  yes,  describe  a  better  algorithm. 

5.  Describe  an  algorithm  which  upon  input  of  a  list 

ci\,a2 . a„  of  integers,  outputs  the  largest  and  the 

smallest  numbers  of  the  list. 


6.  Describe  an  algorithm  which  implements  the  floor  func¬ 
tion;  that  is,  upon  input  of  a  real  number  x,  the  output 
is  |*J- 

7.  (a)  Show  that  1*  +  2*  +  •  •  •  +  nk  =  C7(nt+1)  for  any 

k  >  1. 

(b)  Show  that  l2  +  22  4 - 1 -  n2  x  n3. 

8.  Show  that 

(a)  3 n3  -  5n2  4-  2n  +  1  x  n3, 

(b)  n 4  -  In3  +  3n2  -  5n  +  7  x  n4. 

Do  not  merely  quote  Proposition  8.2.7,  but  you  may  use 
ideas  from  its  proof. 

9.  (a)  (Requires  a  little  knowledge  of  calculus)  Show  that 

l  +  j  +  |  +  --  -  +  f=  C?(log«)- 
(b)  Show  that  the  Sieve  of  Eratosthenes,  used  to  find  all 
primes  less  than  or  equal  to  a  given  integer  n ,  can  be 
implemented  with  an  algorithm  which  is  0(n  logn). 
Count  comparisons  and  “crossing  out”  an  integer 
as  basic  operations.  (See  Section  4.3  and  especially 
paragraph  4.3.5.) 

10.  Show  the  sequence  of  steps  in  using  a  binary 
search  to  find  the  number  5  in  the  list 
— 8,  —  5,  —  1, 0,  1, 2,  4,  5,  8,  9,  12.  How  many  times 
is  5  compared  with  an  element  in  the  list?  How  many 
times  would  it  be  compared  with  an  element  in  the  list 
if  we  employed  a  linear  search? 

11.  Show  the  sequence  of  steps  involved  in  merging  the 
sorted  lists  -1,3,7,  10,  12,  15  and  -4,  -3,  -1,2.  How 
many  comparisons  are  required? 

12.  Sort  the  list  9,  —3,  1 . 0,  —4,  5,  3  into  increasing  order 

(a)  with  a  bubble  sort; 

(b)  with  a  merge  sort. 
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In  each  case,  how  many  comparisons  are  needed?  (For 
the  merge  sort,  ignore  comparisons  required  to  check 
the  size  and  parity  of  n  at  each  iteration  of 
Step  3.) 

13.  In  the  lexicographic  ordering  of  the  permutations  of 
1,2,  3,4,  5,6,  list  the  20  permutations  which  follow 
463152. 


14.  In  the  lexicographic  ordering  of  all  combinations  of 
1,2,...  ,7  taken  four  at  a  time,  list,  if  possible,  the 
three  combinations  which  immediately  precede  and  the 
three  which  immediately  follow 

(a)  3467, 

(b)  1236, 

(c)  2347. 


CHAPTER 
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Graphs 


9.1  A  Gentle  Introduction 

There  are  many  concrete,  practical  problems  that  can  be  simplified  and  solved  by 
looking  at  them  from  a  different  point  of  view.  The  Konigsberg  Bridge  Problem, 
which  we  will  soon  describe,  was  a  long-standing  problem  until  it  was  imagina¬ 
tively  solved  in  1736  by  the  great  Swiss  mathematician  Leonhard  Euler  ( 1 707— 
1783).  Beginning  his  scientific  career  shortly  after  the  death  of  Sir  Isaac  Newton, 
Euler  spent  the  last  17  years  of  his  life  blind,  but  still  very  active.  Some  of  his 
mathematical  contributions  were  to  the  theory  of  convergent  sequences  and  to  the 
calculus  of  variations.  Much  of  what  is  taught  today  about  quadratic  equations, 
conic  sections,  and  quadrics  in  Euclidean  space  is  just  as  Euler  himself  laid  out. 
Perhaps  less  known  is  that  Euler  was  a  superb  designer  of  algorithms;  he  had  the 
uncanny  ability  of  making  order  out  of  chaos,  of  seeing  simple  routes  through 
the  most  complicated  situations.  It  is  because  of  his  imaginative  solution  to  the 
Konigsberg  Bridge  Problem  that  Euler  is  generally  considered  to  be  the  father  of 
modern-day  graph  theory. 

In  the  eighteenth  century,  Konigsberg  was  the  capital  of  East  Prussia.1  The 
Pregel  River  flowed  through  town  and  split  into  two  branches  around  Kneiphof 
island,  which  is  labeled  A  in  Fig  9.1.  Seven  bridges  crossed  the  river,  provid¬ 
ing  links  among  the  four  land  masses  labeled  A,  B,  C,  D  in  the  figure.  Peo¬ 
ple  wondered  if  it  were  possible  to  start  on  one  of  the  land  masses,  walk  over 
each  of  the  seven  bridges  exactly  once,  and  return  to  the  starting  point  (without 
getting  wet). 

To  find  an  abstract  mathematical  model  of  a  concrete  problem  can  be  a  difficult 
task  requiring  both  ingenuity  and  experience.  The  primary  aim  of  this  chapter  is  to 
provide  the  reader  with  some  of  this  experience  by  presenting  several  real-world 


1  An  ice-free  port  on  the  southern  coast  of  the  Baltic  Sea,  Konigsberg  was  completely  destroyed  during 
World  War  II.  It  was  renamed  Kaliningrad  and  transferred  to  the  Soviet  Union  in  1945.  Today,  it  is  the 
westernmost  city  in  Russia. 
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Figure  9.1 


Figure  9.2 


problems  and  showing  how  they  can  be  formulated  in  mathematical  terms.  This 
process  of  translation  into  mathematics  forces  us  to  sift  through  all  the  details 
of  the  problem,  deciding  which  ones  are  important  and  which  are  extraneous. 
(Those  aspects  of  a  “real”  problem  which  make  it  interesting  are  sometimes  quite 
irrelevant  and  serve  primarily  to  create  confusion!)  Because  the  language  of  math¬ 
ematics  is  precise  and  without  ambiguity,  problems  which  seem  complicated  when 
expressed  in  ordinary  language  often  have  surprisingly  straightforward  mathemat¬ 
ical  translations. 


A  graphical  representation  of  the  Konigsberg  Bridge  Problem. 

For  the  Konigsberg  Bridge  Problem,  Euler’s  idea  was  to  realize  that  the  phys¬ 
ical  layout  of  land,  water,  and  bridges  could  be  modeled  by  the  graph  shown  in 
Fig  9.2.  The  land  masses  are  represented  by  small  circles  (or  vertices)  and  the 
bridges  by  lines  (or  edges)  which  can  be  straight  or  curved.  By  means  of  this 
graph,  the  physical  problem  is  transformed  into  this  mathematical  one:  Given  the 
graph  in  Fig  9.2,  is  it  possible  to  choose  a  vertex,  then  to  proceed  along  the  edges 
one  after  the  other  and  return  to  the  chosen  vertex  covering  every  edge  exactly 
once?  Euler  was  able  to  show  that  this  was  not  possible.  Can  you?  (We  shall 
return  to  this  problem  and  discuss  its  solution  in  Section  10.1,  in  the  context  of 
Eulerian  circuits.) 

The  Three  Houses — Three  Utilities  Problem  is  another  physical  situation 
which  can  be  modeled  by  means  of  a  graph.  There  are  three  houses,  each  of  which 
is  to  be  connected  to  each  of  three  utilities — water,  electricity,  and  telephone — by 
means  of  underground  pipes.  Is  it  possible  to  make  these  connections  without  any 
crossovers?  Figure  9.3  shows  how  to  describe  this  problem  with  a 
graph. 

The  houses  and  the  utilities  are  represented  by  vertices  and  the  pipes  are  the 
lines  drawn  between  the  vertices.  When  we  discuss  planar  graphs  in  Chapter  14, 
we  shall  see  that  the  answer  to  our  question  is  no.  Can  you  convince  yourself  of 
this  now? 
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Figure  9.3  The  Three  Houses — Three  Utilities  Problem. 

For  a  long  time,  there  has  been  on  the  market  a  popular  game  called  “Instant 
Insanity,”  which  consists  of  four  cubes,  each  of  whose  six  faces  is  colored  with 
one  of  four  colors:  red,  blue,  green,  white.  The  object  is  to  stack  the  cubes  in 
such  a  way  that  each  of  the  four  colors  appears  on  each  side  of  the  resulting 
column.  Since  there  are  over  40,000  possible  ways  to  stack  such  cubes,2  it  is  easy 
to  understand  how  this  game  got  its  name!  Certainly  there  ought  to  be  a  better 
way  to  solve  this  puzzle  than  trial  and  error.  To  illustrate  the  method  we  have  in 
mind,  we  first  need  a  way  to  picture  a  three-dimensional  cube  on  paper.  We  do 
so  as  indicated  in  Fig  9.4,  where  we  imagine  that  the  cube  has  been  opened  and 
flattened,  as  if  it  were  a  small  cardboard  box. 


Top 

L 

Left 

Side 

Front 

Right 

Side 

Back 

_ 

Bottom 

Figure  9.4  A  way  to  picture  a  cube. 

In  Fig  9.5,  we  depict  the  four  cubes  of  one  version  of  “Instant  Insanity”  in 
this  way  and  show  a  graph  which  records  the  pertinent  information  about  them. 
The  graph  has  four  vertices,  labeled  R,  B,  G,  and  W,  corresponding,  respec¬ 
tively,  to  the  four  colors  red,  blue,  green,  and  white.  There  is  an  edge  labeled 
3  joining  vertices  W  and  G  because  cube  3  has  a  white  face  and  a  green  face 
on  opposite  sides  of  the  cube.  Cube  3  also  has  a  white  and  red  pair  of  oppo¬ 
site  faces,  so  vertices  W  and  R  are  joined  with  an  edge  labeled  3.  For  a  similar 
reason,  there  is  an  edge  labeled  3  joining  vertices  B  and  R.  The  circle  labeled 
1  at  vertex  R  corresponds  to  the  fact  that  cube  1  has  a  pair  of  opposite  red 
faces. 

Once  again,  a  concrete  physical  problem  has  been  represented  by  a  graph. 
You  do  not  have  to  visit  Konigsberg  to  simulate  the  Konigsberg  Bridge  Problem; 
you  do  not  need  to  build  any  houses  to  simulate  the  Three  Houses — Three  Utilities 


2The  exact  number  is  j(64  x  43)  =  41,472. 
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Figure  9.5 


Figure  9.6 
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Four  colored  cubes  and  a  graphical  representation. 

Problem;  you  do  not  even  have  to  possess  the  actual  cubes  of  “Instant  Insanity” 
in  order  to  play. 

The  graph  in  Fig  9.5  describes  the  four  cubes  of  "Instant  Insanity.”  Could  it 
also  be  used  to  describe  a  stacking? 


Cube  1  in  Fig  9.6  has  its  blue  face  at  the  front  and  white  face  at  the  back. 
This  orientation  can  be  shown  in  the  graph  by  emphasizing  the  edge  labeled  1 
joining  vertices  B  and  W.  Similarly,  the  green-white,  red-white,  and  white-blue 
front-back  faces  can  be  shown  by  emphasizing  three  other  edges  in  the  graph. 
The  stacking  shown  on  the  left  of  Fig  9.6  determines  the  subgraph  on  the  right. 
This  stacking  does  not  solve  the  puzzle,  however,  because  there  are  too  many 
white  faces  at  the  back;  moreover,  any  rotation  of  any  of  the  cubes  through  180° 
always  results  in  more  than  one  white  face  at  the  front  or  the  back.  This  is  more 
easily  seen  by  observing  that  in  the  subgraph,  there  are  too  many  edges  joined  to 
vertex  W.  On  the  other  hand,  there  are  exactly  two  edges  in  the  subgraph  (1  and 
4)  joined  to  vertex  B,  corresponding  to  the  fact  that  there  is  just  one  blue  face  on 
the  front  (part  of  the  B-W  pair  of  faces  of  cube  1)  and  just  one  blue  face  at  the 
back  (part  of  the  W-B  pair  of  faces  of  cube  4). 

When  the  cubes  are  stacked  with  their  fronts  and  backs  correct  (that  is, 
with  four  different  colors  on  the  front  and  four  different  colors  on  the  back),  the 
corresponding  subgraph  will 

•  contain  all  four  vertices  R,  B,  W,  G; 

•  consist  of  four  edges,  one  from  each  cube; 

•  have  exactly  two  edges  or  one  circle  meeting  at  each  vertex. 
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Figure  9.7 


Figure  9.8 


Figure  9.9 


Figure  9.7  shows  two  subgraphs,  each  of  which  corresponds  to  a  stacking  of 
cubes  in  which  the  fronts  and  backs  are  correct. 


Two  acceptable  subgraphs  which  are  not  edge  disjoint. 

Draw  an  arrangement  of  fronts  and  backs  (like  that  on  the  left  in  Fig  9.6),  which 
is  represented  by  the  subgraph  on  the  left  in  Fig  9.7.  All  four  colors  should  appear 
on  the  front  and  on  the  back. 

As  anyone  who  has  tried  this  puzzle  knows,  getting  the  front  and  back  of  the 
column  correct  is  easy.  It  is  next  to  get  the  sides  correct  that  provides  the  fun(?). 
Graphically,  we  require  a  second  subgraph  of  the  type  described  previously  to 
represent  a  correct  stacking  of  the  sides.  Moreover,  since  a  given  edge  cannot 
represent  both  front-back  and  side-side  at  the  same  time,  the  second  subgraph 
must  be  edge  disjoint  from  the  first:  No  edge  can  appear  in  both  subgraphs.  The 
subgraphs  shown  in  Fig  9.7,  for  instance,  are  not  edge  disjoint;  they  have  edges  3 
and  4  in  common.  On  the  other  hand,  the  subgraphs  shown  in  Fig  9.8  are  indeed 
edge  disjoint  and  so  provide  us  with  a  solution  to  “Instant  Insanity,”  as  shown  in 
Fig  9.9. 


Two  acceptable  edge  disjoint  subgraphs. 
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A  solution  to  the  game  of  "Instant  Insanity.” 
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9.1.1  REMARKS 


While  our  game  of  “Instant  Insanity”  had  a  unique  solution,  if  we  try  to  make 
other  games  by  assigning  colors  to  the  faces  of  four  cubes  in  other  ways,  it  is  not 
hard  to  find  games  where  there  are  several  winning  configurations  or  where  there 
is  no  solution  at  all.  Also,  whereas  the  subgraphs  for  our  game  had  certain  special 
properties,  such  as  connectedness 3  and  a  lack  of  edge  crossovers,  in  general, 
be  on  the  lookout  for  other  types  of  subgraphs,  three  of  which  are  depicted  in 
Fig  9.10. 


Figure  9.10  Possible  subgraphs  for  other  cube  games. 


Newspapers  and  magazines  often  contain  mathematical  teasers  whose  solu¬ 
tions  can  be  helped  by  drawing  simple  graphs.  We  close  this  section  with  an 
example  of  one  such  puzzle  and  include  another  in  the  exercises  for  this 
section. 


PROBLEM  1.  You  and  your  buddy  return  home  after  a  semester  at  college  and 
are  greeted  at  the  airport  by  your  mothers  and  your  buddy’s  two  sisters.  Not 
uncharacteristically,  there  is  a  certain  amount  of  hugging!  Later,  the  other  five 
people  tell  you  the  number  of  hugs  they  got  and,  curiously,  these  numbers  are  all 
different.  Assume  that  you  and  your  buddy  did  not  hug  each  other,  your  mothers 
did  not  hug  each  other,  and  your  buddy’s  sisters  did  not  hug  each  other.  Assume 
also  that  the  same  two  people  hugged  at  most  once.  How  many  people  did  you 
hug?  How  many  people  hugged  your  buddy? 

Solution.  The  conditions  on  who  did  not  hug  whom  dictate  that  no  person 
hugged  more  than  four  people.  Since  the  other  five  hugged  different  numbers 
of  people,  this  set  of  numbers  must  be  (0,  1,  2,  3,  4). 

Next,  it  is  important  to  realize  that  the 
person  who  hugged  four  others  could  not  be 
your  buddy.  Why?  The  graph  to  the  right 
should  help.  The  four  people  other  than  you 
and  your  buddy  are  labeled  A\,  A 2,  B\,  Bi, 
and  an  edge  indicates  a  hug.  Think  of  A 1  and 
A2  as  sisters  (or  mothers),  and  B\  and  #2  as 
mothers  (or  sisters). 

Remembering  that  you  and  your  buddy  did  not  hug,  the  graph  shows  that  if 
your  buddy  had  four  hugs,  it  would  be  impossible  for  one  of  the  group  Buddy, 
A\,  A 2,  B\,  B2  to  have  reported  no  hugs.  Thus,  your  buddy  could  not  have  had 
four  hugs. 


You  A 1  B 1 


Buddy  A  2  B2 


1 A  graph  is  connected  if  any  two  vertices  are  joined  by  a  sequence  of  edges.  We  shall  introduce  the 
notion  of  connectedness  formally  in  Section  10.1. 
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So  the  person  who  got  four  hugs  is  among  the  group  A\,  A2,  B\,  Bj-  Suppose 
A  i  got  four  hugs.  (The  argument  which  follows  works  in  precisely  the  same  way 
if  we  assume  A2  or  B\  or  B2  was  the  one  with  four  hugs.)  Since  A\  did  not  hug 
A2,  the  hugging  involving  A\  is  as  depicted  in  Fig  9.11(a). 


Figure  9.11 


(a) 


(b) 


Since  one  of  the  group  A],  A2,  B 1,  B2  received  no  hugs,  it  is  apparent  that 
this  person  was  A2.  Consider  again  Fig  9.11(a).  Since  you  didn’t  hug  your  buddy 
and  Si  didn’t  hug  Bo,  the  only  possible  hugs  not  pictured  there  are  between  you 
or  your  buddy  and  S|  or  S2. 

Somebody  received  three  hugs.  Who  could  that  be?  If  it  were  your  buddy, 
he  would  have  had  to  hug  both  B\  and  S2,  leaving  nobody  with  only  one  hug. 
We  conclude  that  either  B\  or  Bo  received  exactly  three  hugs.  There  is  no  loss 
of  generality  in  assuming  it  was  B2.  Thus,  S2  hugged  Ai,  you  and  your  buddy. 
Your  buddy  now  hugged  at  least  two  people  (A|  and  S2),  so  it  follows  that  B\ 
must  be  the  person  who  had  one  hug.  The  final  situation  is  shown  in  Fig  9.1 1(b). 
You  and  your  buddy  each  hugged  two  people.  I 


Answers  to  Pauses 


1.  There  are  several  possibilities  for  the 
fronts  and  backs  of  cubes  which 
correspond  to  the  subgraph  shown  in 
Fig  9.7.  Here  is  one. 


FRONT  BACK 


B 

Cube  4 

R 

G 

Cube  3 

W 

R 

Cube  2 

G 

W 

Cube  1 

B 

EXERCISES 


The  symbol  IBB]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 


1.  [BB]  (Fictitious)  A  recently  discovered  map  of  the  town 
of  Konigsberg  shows  that  there  was  a  ferry  operating 
between  the  banks  labeled  C  and  D  in  Fig  9.1.  Draw 
a  graph  in  which  the  vertices  are  land  masses  and  an 
edge  between  two  vertices  corresponds  to  a  way  to  move 
between  corresponding  land  masses. 

2.  [BB]  Draw  a  configuration  of  two  houses  and  two  util¬ 
ities,  each  house  connected  to  each  utility,  but  with 
no  crossovers. 


3.  One  of  the  owners  of  the  houses  in  the  Three  Houses — 
Three  Utilities  Problem  does  not  want  a  telephone.  Is 
it  now  possible  for  the  houses  to  be  connected  to  util¬ 
ities  without  crossovers?  Draw  a  graph  depicting  the 
situation. 

4.  Find  solutions,  where  possible,  for  the  cube  games  pic¬ 
tured  in  Fig  9.12. 
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Figure  9.12  Continued 

5.  Suppose  the  vertices  of  a  graph  represent  cities  in  a  cer¬ 
tain  region  and  an  edge  joining  two  vertices  indicates 
that  there  is  a  direct  (nonstop)  flight  between  those  two 
cities.  Geographers  define  the  beta  index  of  a  graph  as 
the  ratio  of  the  number  of  edges  to  the  number  of  ver¬ 
tices,  and  view  this  number  as  a  measure  of  connectiv¬ 
ity  of  the  region.  Highly  developed  countries  have  high 
beta  indices;  poorly  developed  countries  have  low  beta 
indices.  Find  the  beta  index  of  each  of  the  following 
graphs. 


n3  i>5  v-j 


6.  [BB]  In  the  graph  shown,  the  vertices  represent  the 
rooms  of  a  one-story  house  and  an  edge  between  ver¬ 
tices  means  that  the  corresponding  rooms  have  a  wall 
in  common.  Draw  a  possible  floor  plan  for  this 
house. 


1  3 


2  5 


7.  You  and  a  friend  meet  three  other  couples  at  a  party 
and  several  handshakes  take  place.  Nobody  shakes  hands 
with  himself  or  herself,  there  are  no  handshakes  within 
couples,  and  no  one  shakes  hands  with  the  same  per¬ 
son  more  than  once.  The  numbers  of  hands  shaken  by 
the  other  seven  people  (excluding  you)  are  all  different. 
How  many  hands  did  you  shake?  How  many  hands  did 
your  partner  shake?  Use  a  graph  to  aid  your  solution. 

8.  (a)  A  graph  has  six  vertices  every  two  of  which  are 

joined  by  an  edge.  Each  edge  is  colored  red  or  white. 
Show  that  the  graph  contains  a  monochromatic  tri¬ 
angle. 

(b)  Is  the  result  of  (a)  true  for  a  graph  with  five  vertices? 
Explain. 

9.  [BB]  A  graph  has  six  vertices  every  two  of  which  are 
joined  by  an  edge.  Each  vertex  is  colored  red  or  white. 
Show  that  the  graph  contains  at  least  two  monochro¬ 
matic  triangles. 

10.  A  graph  has  six  vertices,  every  two  of  which  are  joined 
by  an  edge.  Each  edge  is  colored  red  or  white.  Show 
that  the  graph  contains  at  least  two  monochromatic  tri¬ 
angles. 
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9.2  Definitions  and  Basic  Properties 


9.2.1  DEFINITIONS  AND 
NOTATION 


Having  used  the  term  “graph”  quite  a  bit  already,  it  is  time  now  to  define 
the  word  properly  and  to  introduce  some  of  the  basic  terminology  of  graph 
theory. 

A  graph  is  a  pair  (V,  £)  of  sets,  V  nonempty  and  each  element  of  £  a  set  of 
two  distinct  elements  of  V.  The  elements  of  V  are  called  vertices',  the  elements 
of  £  are  called  edges.  Thus,  if  e  is  an  edge,  then  e  =  { v,  w },  where  v  and  w 
are  different  elements  of  V  called  the  end  vertices  or  ends  of  e.  (Colloquially,  we 
often  say  that  the  edge  e  =  {  v,  w  }  joins  vertices  v  and  w.)  We  usually  abandon 
set  notation  and  refer  to  the  edge  vw\  this  is,  of  course,  the  same  as  the  edge 
wv.  The  vertices  v  and  w  are  said  to  be  incident  with  the  edge  v w ;  the  edge 
vw  is  incident  with  each  vertex.  Two  vertices  are  adjacent  if  they  are  the  end 
vertices  of  an  edge;  two  edges  are  adjacent  if  they  have  a  vertex  in  common. 
The  number  of  edges  incident  with  a  vertex  v  is  called  the  degree  of  that  vertex 
and  is  denoted  deg  v.  If  deg  v  is  an  even  number,  then  v  is  said  to  be  an  even 
vertex ;  if  deg  v  is  an  odd  number,  vertex  v  is  odd.  A  vertex  of  degree  0  is  said 
to  be  isolated.  In  this  text,  graphs  will  always  be  finite ,  meaning  that  the  set  of 
vertices  and,  hence,  also  the  set  of  edges,  are  finite  sets.  Finally,  when  we  say 
that  Q(V.  £)  is  a  graph,  we  mean  that  Q  is  a  graph  with  vertex  set  V  and  edge 
set  £. 


Usually  we  draw  a  picture  of  a  graph  rather  than  presenting  it  formally  as 
sets  of  vertices  and  edges.  For  instance,  the  graph  Q  with  vertices 

V  =  {Ui,  V2,  V3,  V4,  V5,  U6) 

and  edges 

£  =  {V1V4,  V[  V(,,  V2V5,  V4V5,  D5D6} 

can  be  described  as  shown  in  Fig  9.13. 


Vl  V2  V3 


Figure  9.13  A  picture  of  Q(V,  £),  where  V  =  {uj,  V2,  V3, 14,  U5,  ug)  and  £  —  {iqtq,  iq  v^,  vjv^. 
v4v5,  v5u6}. 


Many  of  the  graphs  presented  in  Section  9. 1  are  not  graphs  at  all  according  to 
our  definition!  Many  of  them  have  multiple  edges,  that  is,  several  edges  incident 
with  the  same  two  vertices.  Some  of  them  have  a  loop  at  a  vertex,  that  is,  an 
edge  which  is  incident  with  only  one  vertex.  Since  most  of  the  graphs  of  interest 
to  us  will  have  neither  multiple  edges  nor  loops,  we  have  opted  for  the  definition 
of  “graph”  presented  here.  Nevertheless,  it  is  convenient  to  have  a  term  for  more 


Figure  9.14 


9.2.2  DEFINITION 


EXAMPLE  2 
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general  types  of  graphs  which  do  arise  from  time  to  time.  A  pseudograph  is  like 
a  graph,  but  it  may  contain  loops  and/or  multiple  edges. 

V\  V4 


Figure  9.14  illustrates  a  pseudograph  with  four  vertices  and  five  edges.  Ver¬ 
tices  vi  and  t>2  are  incident  with  edges  e\  and  ey,  thus,  e\  and  es  are  multiple 
edges.  Edge  e 4  is  incident  with  vertices  V3  and  14.  Edge  eg  is  a  loop  because  it  is 
incident  only  with  vertex  14.  Vertices  14  and  14  are  adjacent  while  v\  and  14  are 
not.  Edges  e\  and  e\  are  adjacent  while  e\  and  e\  are  not.  Vertex  v\  has  degree 
2,  so  i>i  is  an  even  vertex.  Counting  the  loop  incident  with  14  twice  (as  it  enters 
and  leaves),  14  is  also  an  even  vertex,  with  degree  4.  Vertices  14  and  14  are  odd: 
deg  14  =  3  and  deg  14  =  1 . 

It  is  unfortunate  that  there  is  some  lack  of  standardization  of  terminology 
in  graph  theory,  especially  in  a  subject  which  has  more  than  its  share  of  tech¬ 
nical  terms.  Many  words  have  almost  “obvious”  meanings,  which  are  the  same 
from  book  to  book,  but  other  terms  are  used  differently  by  different  authors.  It 
is  consequently  vital  when  perusing  books  or  articles  on  graph  theory  never  to 
assume  that  you  know  the  meanings  of  the  graph  theoretical  terms  employed. 
Always  check  the  author’s  definitions  carefully.  For  convenience,  the  definitions 
which  we  employ  in  this  book  are  summarized  in  a  glossary  at  the  end.  Read¬ 
ers  may  have  also  noticed  on  the  inside  covers  a  description  of  symbols  and 
notation. 

A  graph  Q\  is  a  subgraph  of  another  graph  Q  if  and  only  if  the  vertex  and  edge 
sets  of  Q\  are,  respectively,  subsets  of  the  vertex  and  edge  sets  of  Q. 

Each  of  the  three  graphs  Q\,  Q2,  and  Qj,  shown  on  the  right  of  Fig  9.15  is  a 
subgraph  of  the  graph  Q  on  the  left  in  this  figure.  (The  subgraphs  do  not  have  to 
be  drawn  the  same  way  they  appear  in  the  presentation  of  Q.)  A 


Figure  9.15 


A  graph  and  three  subgraphs. 


Graphs 

EXAMPLE  3 


Figure  9.16 


Figure  9.17 


9.2.3  DEFINITION 


9.2.4  DEFINITIONS 


Figure  9.16  depicts  a  graph  Q  and  two  subgraphs  Q\  and  Qi.  The  last  graph  in 
this  figure,  6/3,  is  not  a  subgraph  of  Q  because  the  two  vertices  of  degree  3  in  Q 
are  not  adjacent.  A 


A  graph  Q,  two  subgraphs,  Q \  and  Q2,  and  a  graph  t/3  which  is  not  a  subgraph  of  Q. 

A  useful  tool  in  graph  theory  is  the  deletion  of  an  edge  or  a  vertex  from  a 
graph.  If  e  is  an  edge  in  a  graph  Q ,  we  shall  abuse  notation  and  write  Q  \  { e } 
to  denote  that  subgraph  of  Q  =  Q{V,  £ )  which  has  the  same  vertex  set  as  Q  but 
whose  edge  set  is  £  \  { e }.  For  example,  in  Fig  9.16,  the  subgraph  Q \  is  Q  \  {e}. 
Similarly,  Q  \  { v }  denotes  the  graph  Q  with  the  vertex  v  removed.  The  deletion 
of  a  vertex  requires  some  care,  since  when  a  vertex  is  removed  from  a  graph,  all 
edges  incident  with  that  vertex  must  also  be  removed.  Again  with  reference  to 
Fig  9.16,  subgraph  Q  \  {u}  is  Q2 ,  not  £3. 

Each  of  the  graphs  pictured  in  Fig  9.17  has  as  many  edges  as  possible 
(multiple  edges  and  loops  are  not  permitted  in  a  “graph”).  Such  graphs  have 
a  name. 


K.  1  K.2  K.  3  1C  4  1C  5 

The  first  five  complete  graphs. 


For  any  positive  integer  n ,  the  complete  graph  on  n  vertices,  denoted  /C„,  is  that 
graph  with  n  vertices  every  two  of  which  are  adjacent. 

The  graphs  in  Fig  9. 1 8  are  not  complete,  though  they  have  other  interesting 
properties.  In  each  of  them,  no  two  top  vertices  are  adjacent  and  no  two  bottom 
vertices  are  adjacent  (they  are  examples  of  bipartite  graphs),  and  in  the  two 
rightmost  graphs  every  top  vertex  is  adjacent  to  every  bottom  vertex  (these  two 
are  complete  bipartite  graphs). 

A  bipartite  graph  is  one  whose  vertices  can  be  partitioned  into  two  (disjoint)  sets 
Vi  and  V2,  called  bipartition  sets ,  in  such  a  way  that  every  edge  joins  a  vertex 
in  V|  and  a  vertex  in  V2 ■  (In  particular,  there  are  no  edges  within  V\  nor  within 
V2.)  A  complete  bipartite  graph  is  a  bipartite  graph  in  which  every  vertex  in  V] 
is  joined  to  every  vertex  in  Vt.  The  complete  bipartite  graph  on  bipartition  sets 
of  m  vertices  and  n  vertices,  respectively,  is  denoted  ICm  n. 
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^2,3  £3,3 


Figure  9.18  Three  bipartite  graphs,  two  of  which  are  complete  bipartite. 

Figure  9.18  shows  typical  pictures  of  K. 2,3  and  /C33.  The  one  vertex  graph 
1C  1  shown  in  Fig  9.17  is  also  bipartite  since  bipartition  sets  are  not  required  to 
be  nonempty.  It  is  helpful  to  note  that  a  graph  is  bipartite  if  and  only  if  its 
vertices  can  be  colored  with  two  colors  such  that  every  edge  has  ends  of  different 
colors. 


EXAMPLE  4  Consider  the  graph  on  the  left  of  Fig  9.19.  Coloring  vertex  1  red,  vertex  2  white, 
and  continuing  to  alternate  these  two  colors  through  the  vertices  3, . . .  ,8  gives 
every  edge  different  colored  ends.  So  the  graph  is  bipartite,  the  bipartition  sets 
being  the  red  vertices  and  the  white  vertices.  By  grouping  these  bipartition  sets, 
the  graph  can  be  redrawn  so  that  it  more  obviously  appears  bipartite,  as  shown 
on  the  right  of  Fig  9.19.  A 


Figure  9.19  A  graph  and  a  way  to  show  it  is  bipartite. 


In  the  graph  shown  in  Fig  9.19,  what  are  the  vertex  sets  {1,  3,  5,  7}  and  {2,  4,  6,  8} 
called?  Is  the  graph  complete?  Is  it  complete  bipartite? 

It  is  not  hard  to  show  that  a  bipartite  graph  can  contain  no  triangles ,4  (See 
Exercise  21.)  For  instance,  graph  Q 3  in  Fig  9.16  is  not  bipartite  because  it  contains 
a  triangle  (in  fact,  two  triangles). 

Look  at  the  graphs  accompanying  Exercise  5  in  Section  9.1.  Are  either  of  these 
bipartite?  Explain. 

Two  very  useful  properties  of  graphs  are  described  in  the  proposition  and 
corollary  which  follow.  With  one  proviso,  the  proofs  we  present  are  valid  gen¬ 
erally,  so  we  state  the  propositions  for  pseudographs.  The  proviso  is  that  a  loop 
at  a  vertex  shall  add  two  to  the  degree  of  that  vertex.  [With  reference  to  the 
pseudograph  in  Fig  9.14,  recall  that  deg  113  =  4.] 


4 A  triangle  in  a  graph  is  a  set  of  three  vertices  with  an  edge  joining  each  pair. 
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9.2.5  PROPOSITION 


(Euler)  The  sum  of  the  degrees  of  the  vertices  of  a  pseudograph  is  an  even  num¬ 
ber  equal  to  twice  the  number  of  edges.  In  symbols,  if  Q(V,  £)  is  a  pseudograph, 
then 

^  deg  v  =  2\£\. 

tieV 


Proof  Adding  the  degrees  of  all  the  vertices  involves  counting  one  for  each  edge  incident 
with  each  vertex.  How  many  times  does  an  edge  get  counted?  If  it  is  not  a  loop, 
it  is  incident  with  two  different  vertices  and  so  gets  counted  twice,  once  at  each 
vertex.  On  the  other  hand,  a  loop  at  a  vertex  is  also  counted  twice,  by  convention, 
in  the  degree  of  that  vertex.  I 


EXAMPLE  5  The  graph  in  Fig  9.19  has  eight  vertices  each  of  degree  3.  Since 

^  deg  v  —  8(3)  =  24  =  2\£\, 

veV 

it  must  have  12  edges,  and  it  does.  ▲ 

EXAMPLE  6  The  pseudograph  in  Fig  9. 14  has  vertices  of  degrees  4,  3,  2,  1.  Since  4+ 3 +2+1  = 
10,  the  pseudograph  must  have  five  edges,  and  it  does.  (Note  that  a  loop  is  one 
edge,  but  it  adds  two  to  the  degree.)  ▲ 


How  many  edges  does  1C 3,6  contain?  This  complete  bipartite  graph  has  six 
vertices  of  degree  3  and  three  of  degree  6.  Since 

deg  v  =  6(3)  +  3(6)  =  36  =  2\£\, 

ueV 

1C 3,6  has  18  edges. 


9.2.6  DEFINITION 


Suppose  that  d\ ,  da, . . .  ,  dn  are  the  degrees  of  the  vertices  of  a  graph  (or  pseudo¬ 
graph)  Q,  ordered  so  that  d\  >  di  >  •  ■  •  >  dn.  Then  d\,  di, . . .  ,  dn  is  called  the 
degree  sequence  of  Q. 


EXAMPLES  7 


•  The  degree  sequence  of  the  pseudograph  in  Fig  9.14  is  4,  3,  2,  1. 

•  1C 2,3  has  degree  sequence  3,  3,  2,  2,  2.  (See  Fig  9.18.) 


Why  can  there  not  exist  a  graph  whose  degree  sequence  is  5,  4,  4,  3,  2,  1? 


9.2.7  COROLLARY 


The  number  of  odd  vertices  in  a  pseudograph  is  even. 


▲ 


Proof  By  Proposition  9.2.5,  ^ugVdegi!  =  2|£|  is  an  even  number.  Since 

deg  v  =  ^  deg  v  +  ^  deg  v 

veV  veV  veV 

v  even  v  odd 

and  the  first  sum  on  the  right,  being  a  sum  of  even  numbers,  is  even,  so  also  the 
second  sum 

deg  v 

veV 
v  odd 
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must  be  even.  Since  the  sum  of  an  odd  number  of  odd  numbers  is  odd,  the  number 
of  terms  in  the  sum  here,  that  is,  the  number  of  odd  vertices,  must  be  even.  I 

The  pseudograph  in  Fig  9.14,  for  instance,  has  two  odd  vertices,  iq  and  1)4. 
The  complete  bipartite  graph  £3,6  has  six  odd  vertices  (each  of  degree  3). 

hind  the  degree  of  each  vertex  of  the  pseudograph  Q  shown  in  Fig  9.20.  What  is 
the  degree  sequence  for  Ql  Verify  that  the  sum  of  the  degrees  of  the  vertices  is  an 
even  number.  Which  vertices  are  even?  Which  are  odd?  Verify  that  the  number 
of  odd  vertices  is  even. 


V]  v4  v6 


Figure  9.20 


Answers  to  Pauses 


2.  The  given  vertex  sets  are  the  bipartition  sets  of  the  graph.  The  graph  is 
not  complete  since,  for  example,  vertices  1  and  3  are  not  incident  with  an 
edge;  it  isn’t  a  complete  bipartite  graph  either  because,  for  example,  ver¬ 
tices  1  and  6,  which  lie  in  different  bipartition  sets,  are  not  incident  with  an 
edge. 

3.  Neither  graph  is  bipartite.  The  graph  in  (a)  contains  the  triangle  141414. 
While  the  graph  in  (b)  does  not  contain  a  triangle,  it  contains  the  5-cycle 
1414141414112,  which,  like  a  triangle,  causes  problems.  Try  coloring  the  ver¬ 
tices  of  the  graph  in  (b)  with  two  colors — say  red  and  white — so  that  the  ends 
of  every  edge  have  different  colors.  If  V2  were  colored  white,  then  14  and  14 
would  have  to  be  red,  so  14  and  vj  would  have  to  be  white.  Thus,  the  edge 
V(,vj  would  have  ends  of  the  same  color. 

4.  The  sum  of  the  degrees  is  19,  which  is  not  an  even  number. 

5.  Vertices  tq,  to,  tq,  14,  14,  14  have  degrees  2,  4,  3,  2,  1,  and  0,  respectively. 
The  degree  sequence  of  Q  is  4,  3,  2,  2,  1,  0.  The  sum  of  the  degrees  is 
4  +  34-2  +  2+  1+0  =  12.  The  even  vertices  are  tq ,  iq,  14,  and  14;  the  odd 
vertices  are  1)3  and  14.  There  are  two  odd  vertices. 


EXERCISES 


The  symbol  [BBj  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 


1.  [BB]  Draw  a  graph  with  five  vertices  iq,  iq,  tq,  14,  14 
such  that  deg  tq  =  3,  tq  is  an  odd  vertex,  deg  iq  =  2, 
and  V4  and  iq  are  adjacent. 

2.  Draw  all  possible  graphs  with  three  vertices  iq,  iq,  tq. 
How  many  edges  are  there  in  each  graph?  What  is  the 


degree  sequence  of  each  graph?  Does  this  question  make 
sense  for  pseudographs?  Explain. 

3.  [BB]  Give  an  example  of  a  graph  such  that  every  vertex 
is  adjacent  to  two  vertices  and  every  edge  is  adjacent  to 
two  edges. 
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4.  (a)  How  many  vertices  and  how  many  edges  does  the 
pseudograph  contain?  What  is  the  degree  sequence 
of  this  pseudograph? 


(b)  Verify  Proposition  9.2.5  and  Corollary  9.2.7  for  this 
pseudograph. 

5.  [BB]  Draw  a  graph  with  five  vertices  and  as  many  edges 
as  possible.  How  many  edges  does  your  graph  contain? 
What  is  the  name  of  this  graph  and  how  is  it  denoted? 

6.  (a)  What  is  the  maximum  degree  of  a  vertex  in  a  graph 

with  n  vertices? 

(b)  What  is  the  maximum  number  of  edges  in  a  graph 
with  n  vertices? 

(c)  Given  a  natural  number  n,  does  there  exist  a  graph 
with  n  vertices  and  the  maximum  possible  number 
of  edges? 

7.  Draw  Kn,  K. 3,4,  and  £2,6- 

8.  Draw  a  graph  with  64  vertices  representing  the  squares 
of  a  chessboard.  Connect  two  vertices  with  an  edge  if 
you  can  move  legally  between  the  corresponding  squares 
with  a  single  move  of  a  knight.  [The  moves  of  a  knight 
are  L-shaped,  two  squares  vertically  (or  horizontally) 
followed  by  one  square  horizontally  (respectively,  ver¬ 
tically).] 

(a)  Explain  why  this  graph  is  bipartite. 

(b)  What  are  the  degrees  of  the  vertices? 

9.  Consider  again  the  graphs  accompanying  Exercise  5  of 
Section  9.1,  which  we  reproduce  here. 

vi  v4  v6 


1>3  l>5  VI 

(a)  [BB]  For  the  graph  on  the  left, 

i.  Make  a  table  which  shows  the  least  number  of 
edges  joining  each  pair  of  vertices  in  this  graph. 
(Such  a  table  displays  the  least  number  of  stops 
required  on  air  trips  between  cities  in  the  region 
depicted  by  the  graph.) 


ii.  Add  up  the  numbers  in  each  column  of  the  table. 
Divide  each  column  total  by  the  degree  of  the 
corresponding  vertex.  These  ratios  are  called 
accessibility  indices  since  they  measure  the  rel¬ 
ative  accessibility  of  the  cities  (by  air).  Which 
city  is  the  most  accessible?  Which  is  the  least 
accessible? 

iii.  Suppose  a  direct  flight  joining  cities  iq  and  1)3 
is  introduced.  What  is  the  new  beta  index  of  the 
graph?  What  are  the  new  accessibility  indices? 
Which  city  is  most  accessible  now?  Which  city 
is  now  least  accessible? 

iv.  Repeat  part  (iii),  assuming  a  flight  is  introduced 
between  cities  V2  and  ng  instead  of  between  v\ 
and  1)3. 

(b)  Repeat  the  preceding  questions  for  the  graph  on  the 
right. 

10.  [BB]  Verify  Proposition  9.2.5  and  Corollary  9.2.7  for 
the  complete  graph  A?„.  What  is  the  beta  index  of  /C„? 
(See  Exercise  5,  Section  9.1.) 

11.  Verify  Proposition  9.2.5  and  Corollary  9.2.7  for  the 

complete  bipartite  graph  What  is  the  beta  index 

°f  rl  ? 

12.  [BB]  At  most  social  functions,  there  is  a  lot  of  hand¬ 
shaking.  Prove  that  the  number  of  people  who  shake  the 
hands  of  an  odd  number  of  people  is  always  even. 

13.  Which  of  the  graphs  is  a  subgraph  of  the  graph  in 
Fig  9.19? 


(i)  (ii)  (iii)  (iv)  (v) 

14.  For  each  pair  of  graphs  shown,  discover  whether  or  not 
the  graph  on  the  left  is  a  subgraph  of  the  one  on  the  right. 
If  it  is  not,  explain  why  not.  If  it  is,  label  the  vertices 
of  the  subgraph,  then  use  the  same  symbols  to  label  the 
corresponding  vertices  of  the  graph  on  the  right. 


9.2  Definitions  and  Basic  P 


15.  For  each  of  the  graphs  shown,  draw  pictures  of  the  sub¬ 
graphs  Q  \  (e),  g  \  {u},  and  Q  \  («). 


v 


u 


16.  (a)  [BB]  What  are  the  degrees  of  the  vertices  in  the 
pseudograph? 


(b)  [BB]  Can  there  exist  a  graph  with  four  vertices  of 
degrees  1,2,  3,  and  4? 

17.  For  each  of  the  following  sequences,  determine  if  there 
exists  a  graph  whose  degree  sequence  is  the  one  speci¬ 
fied.  In  each  case,  either  draw  a  graph,  or  explain  why 
no  graph  exists. 

(a)  [BB]  4,  4,  4,  3,  2 

(b)  100,99,98,...  ,3,  2,  2,  2 

(c)  [BB]  5,  5,  4,  3,  2,  1 

(d)  1,  1,  1,  1,  1,  1 

(e)  5,  4,  3,  2,  1 

(f)  5,  4,  3,  2,  1,  1 


(g)  6,  6,  4,  2,  2,  2,  1,  1 

18.  Does  there  exist  a  graph  with  five  ve 
tex  incident  with  at  least  one  edge  l 
adjacent?  Explain. 

19.  (a)  [BB]  A  graph  has  five  vertices  of 

vertices  of  degree  2.  How  many  eo 
(b)  A  graph  has  degree  sequence  5,  5 
How  many  edges  does  it  have? 

20.  Determine  whether  or  not  each  of  the  g 
In  each  case,  give  the  bipartition  sets  o 
graph  is  not  bipartite. 


21.  [BB]  Prove  that  a  graph  which  contains  t 
be  bipartite. 

22.  (a)  Must  a  subgraph  of  a  bipartite  grapl 
(b)  Would  your  answer  to  (a)  change  if,  i 

of  a  bipartite  graph,  bipartition  sets  w 
be  nonempty? 

Explain  your  answers. 

23.  [BB]  (Requires  calculus)  Prove  that  the  nu 
in  a  bipartite  graph  with  n  vertices  is  at  n 

24.  How  many  complete  bipartite  graphs  have 

25.  Let  V  =  (1,2,3 . n }. 

(a)  [BB]  How  many  graphs  are  there  with  v 

(b)  How  many  of  the  graphs  in  (a)  contain 
123? 

(c)  [BB]  What  is  the  total  number  of  triangle: 
graphs  with  vertex  set  V? 

(d)  On  average,  how  many  triangles  does  a  grai 
labeled  vertices  contain? 
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26.  Suppose  a  graph  has  nine  vertices  each  of  degree  5  or 
6.  Prove  that  at  least  five  vertices  have  degree  6  or  at 
least  six  vertices  have  degree  5. 

27.  [BB]  What  is  the  largest  possible  number  of  vertices 
in  a  graph  with  35  edges,  all  vertices  having  degree  at 
least  3? 

28.  Let  m  and  M  denote  the  minimum  and  the  maximum 
degrees  of  the  vertices  of  a  graph  Q  with  vertex  set  V 
and  edge  set  £.  Show  that 

2|£| 

m  <  -  <  M . 

~  |V|  ~ 

29.  [BB]  Suppose  all  vertices  in  a  graph  have  odd  degree  k. 
Show  that  the  total  number  of  edges  in  Q  is  a  multiple 
of  k. 

30.  [BB]  Prove  that  in  any  graph  with  more  than  one  vertex, 
there  must  exist  two  vertices  of  the  same  degree.  [Hint: 
Pigeon-Hole  Principle.] 


31.  Show  that  a  set  of  nonnegative  integers  [d\ ,  dj,  ■  ■ .  ,  dn\ 
is  the  set  of  degrees  of  some  pseudograph  if  and  only  if 
YTi=\  di  is  even. 

32.  Suppose  that  d\,  d2,  ■  • .  ,d„  are  the  degrees  of  the  ver¬ 
tices  in  some  graph.  Show  that  for  any  t  <  n, 

t  n 

Y]  d,  <  t(t  —  1)  +  Y]  min{f,  di}. 

1  =  1  i  =t-\- 1 

Remark'.  The  given  condition  (holding  for  all  t  <  n) 
and  the  requirement  that  ^"=1  di  is  even  are  sufficient 
as  well  as  necessary  for  the  existence  of  a  graph  with  a 
prescribed  set  d\ , . . .  ,  d„  of  degrees.5 

33.  Can  there  exist  a  graph  with  13  vertices,  31  edges,  three 
vertices  of  degree  1,  and  seven  vertices  of  degree  4? 
Explain. 


9.3  Isomorphism 

It  is  important  to  know  when  two  graphs  are  essentially  the  same  and  when 
they  are  essentially  different.  When  we  say  graphs  are  “essentially  the  same,”  we 
mean  that  they  differ  only  in  the  way  they  are  labeled  or  drawn.  There  should 
be  a  one-to-one  correspondence  between  the  vertices  of  the  graphs  and  a  one- 
to-one  correspondence  between  their  edges  such  that  corresponding  vertices  are 
incident  with  corresponding  edges.  The  proper  word  for  “essentially  the  same”  is 
isomorphic. 

There  is  a  distinction  between  a  graph  and  its  picture.  A  graph  is  a  set  V  and  a 
set  £  of  unordered  pairs  of  elements  of  V.  A  picture  of  it  consists  of  dots  and  lines 
which  can  be  drawn  and  arranged  in  many  different  ways.  The  bipartite  graphs 
pictured  in  Fig  9.19,  for  instance,  are  isomorphic:  The  two  pictures  represent  the 
same  graph  £),  where 

V  =  {1,2,  3,  4.  5,  6,  7,  8} 

and 

£  =  {12,  14,  18,  23,  27,  34,  36,  45,  56,  58,  67,  78}. 

EXAMPLE  8  Graphs  Q2  and  G3  in  Fig  9.21  each  consist  of  two  edges  incident  with  a  common 
vertex.  They  are  drawn  differently,  but  the  graphs  are  the  same:  G2  and  G3  are 
isomorphic.  The  picture  of  Q\  indicates  that  this  graph  has  only  one  edge:  Q\  is 
different  from  Q2  in  an  essential  way;  it  is  isomorphic  to  neither  Q2  nor  Q3.  A 

5P.  Erdos  and  T.  Gallai,  “Graphs  with  Prescribed  Degrees  of  Vertices,"  Matematikai  Lapok  11  (1960), 
264-274. 
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ePb — e 
03 


Figure  9.21  Q2  and  C/3  are  isomorphic,  but  neither  is  isomorphic  to  Q\ . 


EXAMPLE  9  Figure  9  .22  illustrates  two  graphs  which  are  not  isomorphic:  Each  graph  consists 
of  four  vertices  and  four  edges,  but  Q2  contains  a  vertex  of  degree  1,  while  Q\ 
has  no  such  vertex.  ▲ 


a - o 

0 - o 

01  02 

Figure  9.22  Two  nonisomorphic  graphs. 


EXAMPLE  10  The  graphs  in  Fig  9.23  are  isomorphic  and  they  have  been  labeled  so  as  to  show 
corresponding  vertices.  Either  picture  represents  a  graph  with  four  vertices  A,  B, 
C,  D,  and  four  edges  AB,  BC,  CD,  and  DA.  A 

AD  AD 

a - e 


0 - & 

B  C  C  B 

Figure  9.23  Two  graphs  labeled  so  as  to  show  that  they  are  isomorphic. 


9.3.1  DEFINITION 


Show  that  the  two  graphs  to  the  right  are 
isomorphic  by  assigning  the  labels  A,  B, 

C,  and  D  to  appropriate  vertices  of  the 
graph  on  the  right.  (It  might  help  to  think 
of  the  edges  of  02  as  pieces  of  string  knot¬ 
ted  at  the  vertices.  How  could  02  be  rear¬ 
ranged  to  look  like  0i?  Once  you  see  this, 
it  will  be  easy  to  do  the  required  labeling.) 

Here  is  the  precise  definition  of  the  term  “isomorphic.” 

Given  graphs  Q\  —  Q\(V\ ,  £\ )  and  02  =  02  (V2,  £2),  we  say  that  Q\  is  isomorphic 
to  02  and  write  Q\  =  02  if  there  is  a  one-to-one  function  y  from  Vi  onto  V2  such 
that 


•  if  vw  is  an  edge  in  £\,  then  <p(v)(p(w)  is  an  edge  is  £2,  and 

•  every  edge  in  £2  has  the  form  <p(v)<p(w)  for  some  edge  vw  €  £\ . 

We  call  (p  an  isomorphism  from  Q\  to  02  and.  abusing  notation,  say  that  (p :  Q\ 
02  is  an  isomorphism. 
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The  definition  of  isomorphism  is  symmetric:  If  Q\  is  isomorphic  to  Q2 ,  then  Qi 
is  isomorphic  to  Q\.  In  fact,  if  ip :  Q\  -»  Q2  is  an  isomorphism,  then  ip~x  :  Q2  — >■  Q\ 
is  an  isomorphism.  Thus,  there  is  no  ambiguity  if  we  simply  say  that  two  graphs 
“are  isomorphic.” 

We  often  say  that  graphs  are  isomorphic  if  and  only  if  there  is  a  bijection 
between  their  vertex  sets  which  “preserves  incidence  relations.”  By  this,  we  mean 
that  a  vertex  v  is  incident  with  an  edge  e  in  Q\  if  and  only  if  (p{v )  is  incident  with 
<p(e)  in  Q2.  An  isomorphism  (p  simply  relabels  vertices  without  changing  any  of 
the  incidence  relations.  If  <p( v)  =  x ,  think  of  .r  as  the  new  label  for  v. 

EXAMPLE  11  Consider  the  two  graphs  shown  at  the 
right.  We  encountered  these  earlier  in 
Fig  9.23.  Remembering  our  previous  dis¬ 
cussion,  we  see  that  if  vertex  u  of  Q2  is 
relabeled  A,  if  vertex  v  is  relabeled  C,  if 
w  is  relabeled  D,  and  if  x  is  relabeled  B, 
then  the  pictures  represent  precisely  the 
same  graph.  Having  seen  how  to  relabel 
the  vertices  of  Q2  with  the  labels  of  Q\ ,  it 
is  easy  to  write  down  the  isomorphism  Q2 

(p(u)  =  A,  <p(v)  =  C,  <p(w)  =  D,  (p{x)  —  B.  A 

■Hi  Use  your  solution  to  Pause  6  to  find  an  isomorphism  (p:  Q2  ->  Q\  for  the  graphs 
Q\,  Q2  shown  in  Fig  9.24. 

A  B 


Figure  9.24  D  C 

The  notion  of  isomorphism  is  exceedingly  important  in  mathematics.  If  the 
definition  looks  complicated,  the  idea  is  very  simple.  Isomorphic  objects,  from 
a  mathematical  standpoint,  are  the  same;  they  differ  only  in  appearance.  This 
idea  is  not  new:  ^  and  0.5  look  different,  but  they  represent  the  same  real 
number. 

We  have  remarked  that  the  definition  of  isomorphism  is 
symmetric:  Q\  =  Q2  if  and  only  if  Q2  =  Gi¬ 
lt  is  also 

reflexive:  Q  =  Q  for  any  graph  Q 
(because  the  identity  map  Q  Q  is  an  isomorphism)  and 
transitive:  If  Q\  =  Q2  and  Q2  =  Q3,  then  Q\  =  Q3 
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(because,  if  <p\ :  Gi  -»  G2  and  (p2‘-  Gi  Gi  are  isomorphisms,  then  so  is  the 
composition  <p2°ipi-  Gi  —r  Gi)-  Thus,  isomorphism  is  an  equivalence  relation  on 
the  set  of  all  graphs. 

The  set  of  all  graphs  is  therefore  partitioned  into  disjoint  equivalence  classes 
called  isomorphism  classes.  Any  two  graphs  in  the  same  equivalence  class  are 
isomorphic;  two  graphs  in  different  equivalence  classes  are  not  isomorphic.  The 
graphs  pictured  in  Pause  6  belong  to  the  same  equivalence  class;  the  graphs  pic¬ 
tured  in  Fig  9.22  belong  to  different  equivalence  classes.  When  we  casually  remark 
that  two  graphs  are  “different,”  we  really  mean  “in  different  isomorphism  classes.” 

Usually,  it  is  very  difficult  to  prove  that  two  graphs  are  isomorphic.  In  princi¬ 
ple,  we  have  to  list  all  the  one-to-one  onto  functions  between  vertex  sets  and,  in 
each  case,  check  whether  or  not  the  function  preserves  incidence  relations.  On  the 
other  hand,  it  is  often  easy  to  prove  that  graphs  are  not  isomorphic.  For  instance, 
the  graphs  G\  and  Gi  in  Fig  9.21  cannot  possibly  be  isomorphic  because  Gi  is 
connected  in  the  sense  that  there  is  a  sequence  of  adjacent  edges  between  any 
two  vertices,  while  G  i  is  not  connected. 

Since  an  isomorphism  is  a  one-to-one  onto  function  between  vertex  sets, 
isomorphic  graphs  have  the  same  numbers  of  vertices.  Many  other  properties  are 
shared  by  isomorphic  graphs. 

Suppose  cp:  G\  — >  Gi  is  an  isomorphism  from  a  graph  G i  to  another  graph 
Gi-  If  v  is  a  vertex  of  degree  k  in  G\  and  if  iq ,  V2, . . .  ,  v*  are  the  vertices  adjacent 
to  v,  then,  in  Q2,  <p(v)  is  adjacent  to  the  k  vertices  <p(v  1),  <p(v 2),  ....  <p(vk),  but 
to  no  other  vertex.  Thus,  the  degree  of  cp(v)  is  also  k.  It  follows  that  isomorphic 
graphs  have  the  same  degree  sequences  and  hence  also  the  same  numbers  of 
edges,  since  the  number  of  edges  in  a  graph  is  one  half  the  sum  of  the  vertex 
degrees  (Proposition  9.2.5). 


9.3.2  PROPOSITION 


If  G 1  and  Gi  are  isomorphic  graphs,  then  G\  and  G2  have  the 

•  same  number  of  vertices, 

•  same  number  of  edges,  and 

•  same  degree  sequences. 


The  graphs  Q \  and  Gi  shown  in  Fig  9. 1 5  are  not  isomorphic  because  they  have 
different  numbers  of  vertices.  The  graphs  shown  in  Fig  9.22  are  not  isomorphic 
because  Gi  contains  a  vertex  of  degree  1  while  G 1  does  not. 

Take  care  not  to  misinterpret  Proposition  9.3.2.  The  proposition  asserts  three 
implications  of  the  form 


“Gi  =  Gi  -*■ 

which  must  not  be  confused  with  double  implications  of  the  form 

“Gi  =  Gi  ■  ■  ■  ” 


Proposition  9.3.2  says  that  if  two  graphs  are  isomorphic,  then  they  must  have 
certain  properties.  It  does  not  say  that  two  graphs  with  the  properties  listed  are 
isomorphic.  In  fact,  it  is  quite  possible  for  two  graphs  which  are  not  isomorphic 
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to  have  the  same  numbers  of  vertices  and  edges  and  the  same  degree  sequences. 
See  Exercises  7  and  8. 


Answers  to  Pauses 


6.  To  the  right,  we  show  one  of  several 
ways  in  which  the  vertices  of  Qo  can  be 
labeled  so  that  it  becomes  clear  that  the 
graph  represented  by  each  picture  is  the 
same. 


7.  <p(u)  =  B,  cp(v)  =  C,  <p(w)  =  A,  <p(x)  =  D.  The  isomorphism  tp  just  relabels 
vertices  of  Qj- 


EXERCISES 


The  symbol  [BB]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 


1.  [BB]  For  each  of  the  ten  pairs  of  graphs  which  can  be 
obtained  from  those  shown,  either  label  the  graphs  so 
as  to  exhibit  an  isomorphism  or  explain  why  the  graphs 
are  not  isomorphic. 


(i)  (ii)  (iii) 


(iv)  (v) 


2.  (a)  Draw  a  graph  isomorphic  to  the  one  shown  on  the 
left,  but  with  no  crossover  of  edges. 


(b)  Same  as  (a)  for  the  graph  on  the  right. 

3.  (a)  [BB]  Draw  all  nonisomorphic  graphs  on  n  =  3  ver¬ 

tices.  Give  the  degree  sequence  of  each. 

(b)  Repeat  part  (a)  for  n  =  4. 

4.  For  each  pair  of  graphs  shown, 

•  if  the  graphs  are  not  isomorphic,  explain  why  not; 


•  if  the  graphs  are  isomorphic,  exhibit  an  isomorphism 
from  one  to  the  other  and  relabel  the  graph  on  the 
right  so  as  to  show  this  isomorphism. 
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5.  (a)  [BB]  Is  the  graph  on  the  left  isomorphic  to  £3,4?  8.  Consider  the  following  three  graphs. 


(b)  Is  the  graph  on  the  right  isomorphic  to  /C4  4? 
Explain. 

6.  [BB]  Explain  why  any  graph  is  isomorphic  to  a  subgraph 
of  some  complete  graph. 


9.3.3  DEFINITION 


Suppose  U| . vn  is  a  set  of  n  vertices  in  a  graph  such 

that  v,  and  u,+i  are  adjacent  for  1  <  i  <  n  —  1  and  v„ 
and  i>i  are  also  adjacent.  Then  the  set  of  these  n  vertices 
and  the  n  edges  V]V2,  U2U3, . . .  v„-ivn,  v„v\  is  called  an  n- 
cycle.  A  3-cycle  is  often  called  a  triangle  and  a  4-cycle  a 
quadrilateral. 

7.  (a)  [BB]  Prove  that  two  graphs  which  are  isomorphic 
must  contain  the  same  number  of  triangles. 

(b)  Prove  that,  for  any  n  >  4,  two  isomorphic  graphs 
must  contain  the  same  number  of  w-cycles. 

(c)  How  many  edges  are  there  in  the  graphs  Q\  and  Qi'l 
How  many  vertices?  What  is  the  degree  sequence 
of  each  graph?  Are  the  graphs  isomorphic?  Explain. 
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(a)  How  many  vertices  and  how  many  edges  are  there 
in  each  graph?  What  is  the  degree  sequence  of  each 
graph?  How  many  triangles  are  in  each  graph? 

(b)  For  each  pair  of  graphs,  either  exhibit  an  isomor¬ 
phism  between  vertex  sets  or  explain  why  the  graphs 
are  not  isomorphic. 

9.  Show  that  the  following  graphs  are  not  isomorphic. 


G  H  1  8 


10.  (a)  [BB]  Suppose  that  graphs  Q  and  H  have  the  same 
numbers  of  vertices  and  the  same  numbers  of  edges, 
and  suppose  that  the  degree  of  every  vertex  in  Q  and 
in  7 i  is  2.  Are  Q  and  7i  necessarily  isomorphic? 
Explain. 

(b)  Suppose  that  graphs  Q  and  H  have  the  same  number 
of  vertices  and  the  same  number  of  edges.  Suppose 
that  the  degree  sequences  of  Q  and  H  are  the  same 
and  that  neither  graph  contains  a  triangle.  Are  Q  and 
H  necessarily  isomorphic?  Explain. 


REVIEW  EXERCISES  FOR  CHAPTER  9 


1.  In  the  Konigsberg  Bridge  Problem,  a  tragic  fire  destroys 
the  bridge  from  B  to  C  and  also  one  of  the  bridges  from 
A  to  D.  (See  Fig  9.1.)  Draw  a  graph  representing  the 
new  situation.  Show  that  it  is  now  possible  for  someone 
to  start  on  land  mass  B  and  walk  over  each  of  the  bridges 
exactly  once,  returning  to  land  mass  B  again. 


2.  (a)  Draw  a  configuration  of  four  houses  and  two  utili¬ 
ties,  each  house  connected  to  each  utility,  but  with 
no  crossovers. 

(b)  Let  n  be  any  positive  integer.  Motivated  by  2(a), 
suggest  a  general  result  concerning  n  houses  and  2 
utilities.  Draw  a  graph  supporting  your  answer. 
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Figure  9.25  Cubes  for  Exercise  3. 


3.  Find  solutions,  where  possible,  for  the  cube  games  pic¬ 
tured  in  Fig  9.25. 

4.  (a)  Draw  a  graph  with  six  vertices  at  least  three  of 

which  are  odd  and  at  least  two  of  which  are  even. 

(b)  Draw  a  graph  with  six  vertices  at  most  three  of 
which  are  odd  and  at  least  two  of  which  are  even. 

(c)  Is  it  possible  to  find  a  graph  which  satisfies  the 
conditions  in  both  4(a)  and  4(b)  simultaneously? 
Explain  your  answer. 

5.  (a)  Why  is  it  not  possible  for  a  graph  to  have  degree 

sequence  6,  6,  5,  5,  4,  4,  4,  4,  3? 

(b)  Why  is  it  not  possible  for  a  graph  to  have  degree 
sequence  8,  8,  7,  6,  5,  4,  3,  2,  1? 

6.  Let  Q  be  a  graph  and  let  H  be  a  subgraph  of  Q.  Assume 
Ti  contains  at  least  three  vertices. 


(a)  Is  it  possible  for  Q  to  be  bipartite  and  for  H  to  be  a 
complete  graph? 

(b)  Is  it  possible  for  Q  to  be  a  complete  graph  and  for 
H.  to  be  bipartite? 

Explain  your  answers. 

7.  Suppose  a  graph  has  49  vertices,  each  of  degree  4  or  5. 
Prove  that  at  least  25  vertices  have  degree  4  or  at  least 
26  vertices  have  degree  5. 

8.  Suppose  Q  is  a  graph  with  n  vertices,  n  edges  and  no 
vertices  of  degree  0  or  I .  Prove  that  every  vertex  of  Q 
has  degree  2. 

9.  For  each  pair  of  graphs  shown, 

•  if  the  graphs  are  not  isomorphic,  explain  why  not; 

.  if  the  graphs  are  isomorphic,  exhibit  an  isomorphism 
from  one  to  the  other. 
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10.  Determine  whether  or  not  the  two  graphs  are  isomorphic. 
(The  one  on  the  left  is  the  "Petersen”  graph,  which  we 
will  encounter  again  in  Chapter  10.) 


11.  For  each  of  the  following  cases,  either  explain  why 
the  two  graphs  are  isomorphic  or  explain  why  they  are 
not. 

(a)  Ka  and  £3,8 

(b)  fC.\  i  and  £3,8 

(c)  1C 4.6  and  /C2, 1 2 

(d)  KI4.6  and  /Cs  5 

(e)  /C4.6  and  £4,4 

12.  George  is  examining  three  graphs  Q\,  Qi,  and  t/3-  He 
gives  correct  arguments  showing  that  Q\  is  not  isomor¬ 
phic  to  Qi  and  that  Gi  is  not  isomorphic  to  Gi-  Can  he 
conclude  that  Q\  is  not  isomorphic  to  £3?  Explain. 

13.  Answer  Exercise  12  again,  assuming  that  George’s  cor¬ 
rect  arguments  show  that  Q\  is  not  isomorphic  to  Gi 
while  Gi  is  isomorphic  to  Qj. 

14.  Prove  that  K-a,b  =  tCc,d  if  and  only  if  {a,  b}  =  (c,  d }. 


*  *  m 


CHAPTER 


10 

Paths  and  Circuits 


10.1  Eulerian  Circuits 

Many  real  problems,  when  translated  to  questions  about  graphs,  inquire  about 
the  possibility  of  walking  through  a  graph  in  a  particular  way.  Although  our 
primary  interest  is  in  graphs,  the  definitions  and  results  of  this  section  are  stated 
for  pseudographs,  since  they  apply  equally,  and  with  few  additional  complications, 
in  the  more  general  setting. 


10.1.1  DEFINITIONS 


A  walk  in  a  pseudograph  is  an  alternating  sequence  of  vertices  and  edges,  begin¬ 
ning  and  ending  with  a  vertex,  in  which  each  vertex  (except  the  last)  is  incident 
with  the  edge  which  follows  and  the  last  edge  is  incident  with  the  edge  which 
precedes  it.  The  length  of  a  walk  is  the  number  of  edges  in  it.  A  walk  is  closed 
if  the  first  vertex  is  the  same  as  the  last  and  otherwise  open.  A  trail  is  a  walk  in 
which  all  edges  are  distinct;  a  path  is  a  walk  in  which  all  vertices  are  distinct.  A 
closed  trail  is  called  a  circuit.  A  circuit  in  which  the  first  vertex  appears  exactly 
twice  (at  the  beginning  and  the  end)  and  in  which  no  other  vertex  appears  more 
than  once  is  a  cycle.  An  n -cycle  is  a  cycle  with  n  vertices.  It  is  even  if  n  is  even 
and  odd  if  n  is  odd. 


There  are  a  lot  of  words  here  and,  as  always,  we  cannot  emphasize  too  strongly 
the  importance  of  coming  to  grips  with  their  meanings.  When  thinking  about  the 
concepts  of  path  and  trail,  it  is  perhaps  helpful  to  note  that  a  path  is  necessarily  a  trail: 
If  all  the  vertices  of  a  walk  are  different,  then  all  the  edges  must  be  different  too.  As 
we  show  in  the  next  example,  the  converse  is  not  true:  A  trail  need  not  be  a  path. 

EXAMPLE  1  In  the  graph  shown  in  Fig  10.1,  ABCEFCBD  is  a  walk  of  length  7  which  is 
neither  a  trail  nor  a  path;  ABCEFCD  is  a  trail,  but  not  a  path;  ABCEFCDB A  is 
a  closed  walk  which  is  not  a  circuit;  BCEFCDB  is  a  circuit  which  is  not  a  cycle; 
and  BCDB  is  a  3-cycle  and  hence  an  odd  cycle.  The  closed  walk  CEFCBDC 
is  not  a  cycle  because  the  first  and  last  vertex  appears  a  third  time.  A 
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Verify  each  of  the  assertions  just  made. 


While,  strictly  speaking,  a  walk  should  be  specified  by  an  alternating  sequence 
of  vertices  and  edges,  it  is  often  enough  to  specify  only  the  vertices,  as  long  as 
consecutive  vertices  are  adjacent,  as  we  have  just  done,  or  only  the  edges,  as  long 
as  consecutive  edges  are  adjacent.1  An  important  type  of  circuit,  known  as  an 
Eulerian  circuit ,  is  one  which  passes  through  every  edge  of  a  pseudograph. 


10.1.2  DEFINITION 


An  Eulerian  circuit  in  a  pseudograph  is  a  circuit  which  contains  every  vertex 
and  every  edge.  An  Eulerian  pseudograph  is  a  pseudograph  which  contains  an 
Eulerian  circuit. 


Note  the  reference  to  vertices  in  the  definition  of  Eulerian  circuit,  which  is 
crucial  to  the  theory.  See  Exercise  5. 

EXAMPLE  2  In  the  graph  of  Fig  10.1,  the  circuit  ABC E FC DA  is  not  Eulerian  because  it  does 
not  contain  the  edge  BD.  As  we  shall  soon  see.  this  graph  possesses  no  Eulerian 
circuit;  it  is  not  an  Eulerian  graph.  ▲ 

EXAMPLE  3  Figure  10.2  further  illustrates  the  difference  between  a  circuit  and  an  Eule¬ 
rian  circuit.  The  circuit  ABC DE FG H FA  is  not  Eulerian  since,  while  it 
encompasses  all  vertices,  it  omits  four  edges.  The  graph  is  Eulerian,  however: 
ABC D E FG H F AD B E A  is  an  Eulerian  circuit.  A 


Figure  10.2  An  Eulerian  graph. 


Eulerian  circuits  are  named,  of  course,  after  Leonhard  Euler,  the  solver  of 
the  Konigsberg  Bridge  Problem,  and  their  study  is  motivated  by  that  problem. 
In  Section  9.1,  we  saw  that  to  follow  the  desired  route  over  the  bridges  of 
Konigsberg,  you  have  to  choose  a  vertex  in  the  pseudograph  of  Fig  9.2  and  find 
a  walk  which  includes  all  the  edges  exactly  once  and  leads  back  to  the  chosen 
vertex.  With  our  present  terminology,  the  Konigsberg  Bridge  Problem  asks  if  the 
pseudograph  is  Eulerian. 


1  In  a  graph,  there  is  never  a  problem  specifying  just  vertices  or  just  edges. 
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Figure  10.3  A  graph  which  is  not  Eulerian. 


In  Theorem  10.1.4,  we  present  the  remarkably  simple  test  which  Euler  found 
for  the  presence  of  an  Eulerian  circuit.  First,  since  an  Eulerian  circuit  provides  a 
walk  between  every  pair  of  vertices,  any  Eulerian  pseudograph  must  be  connected, 
in  the  following  sense. 


10.1.3  DEFINITION 


A  pseudograph  is  connected  if  and  only  if  there  exists  a  walk  between  any  two 
vertices. 


As  we  ask  you  to  show  in  Exercise  16,  if  a  graph  is  connected,  then  there  is 
actually  a  path  between  any  two  vertices,  not  just  a  walk. 

An  Eulerian  graph  must  be  more  than  just  connected,  however,  as  the  graph 
in  Fig  10.3  illustrates.  The  basic  difficulty  with  this  graph  is  that  there  is  only 
one  edge  incident  with  A.  Any  circuit  that  begins  at  A  cannot  return  to  A  without 
using  this  edge  again,  and  any  circuit  which  begins  at  a  vertex  other  than  A  and 
attempts  to  include  all  edges,  after  using  the  edge  BA,  has  to  repeat  it  en  route 
back  to  the  starting  vertex.  The  degrees  of  the  vertices  play  a  role  in  determining 
whether  or  not  a  graph  is  Eulerian.  It  is  not  hard  to  see  that  the  degrees  of  the 
vertices  of  an  Eulerian  graph  must  be  even. 

In  essence,  we  have  already  given  the  argument.  In  walking  along  an  Eulerian 
circuit,  every  time  we  meet  a  vertex  (other  than  the  one  where  we  started),  either 
we  leave  on  a  loop  and  return  immediately,  never  traversing  that  loop  again,  or 
we  leave  on  an  edge  different  from  that  by  which  we  entered  and  traverse  neither 
edge  again.  So  the  edges  (other  than  loops)  incident  with  any  vertex  in  the  middle 
of  the  circuit  can  be  paired.  So  also  can  the  edges  incident  with  the  first  (and  last) 
vertex  since  the  edge  by  which  we  left  it  at  the  beginning  can  be  paired  with  the 
edge  by  which  we  returned  at  the  end.  Thus,  an  Eulerian  graph  must  not  only  be 
connected,  but  also  have  vertices  of  even  degree.  Conversely,  a  connected  graph 
all  of  whose  vertices  are  even  must  be  Eulerian.  To  see  why,  it  will  be  helpful 
to  examine  again  the  graph  of  Fig  10.2  (which  is  connected  and  has  only  even 
vertices)  and  to  try  to  construct  an  Eulerian  circuit  with  a  strategy  that  might 
apply  more  generally. 

We  attempt  to  find  an  Eulerian  circuit  starting  at  A.  To  begin,  we  find  some 
circuit  which  starts  and  ends  at  A,  for  instance,  the  circuit  C\ :  ABC DE FA. 
This  circuit  obviously  is  not  Eulerian  because  it  misses  lots  of  edges.  If  we  delete 
the  edges  of  C\  from  the  graph  as  well  as  vertex  C,  which  is  isolated  after  the 
edges  of  C\  have  been  removed,  we  are  left  with  the  graph  Q\  on  the  left  of 
Fig  10.4. 

This  graph  Q\  has  a  circuit  C,  FGH ,  which  is  connected  to  C\  at  vertex 
F .  Thus,  it  can  be  used  to  enlarge  C\  as  follows:  Start  at  A,  follow  C\  as  far 
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H  A  B  A  B 


Figure  10.4  G  E  D  ED 

as  F,  then  pass  around  C  and  complete  C\  to  A.  We  obtain  a  second  circuit 
C2,  ABCDEFGHF A,  in  the  original  graph,  which  is  larger  than  C\,  but  still 
not  Eulerian  because  not  all  edges  are  yet  included.  As  before,  if  the  edges  of 
C2  are  deleted  (from  the  original  graph),  together  with  vertices  F,  G,  and  H 
which  have  become  isolated,  we  are  left  with  the  graph  on  the  right  of  Fig  1 0.4. 
This  graph  contains  a  circuit  ADBEA,  which  makes  contact  with  C2  at  A.  Piec¬ 
ing  together  these  circuits  (follow  C2  to  A,  then  ADBEA),  we  obtain  a  cir¬ 
cuit  C3  which  contains  all  the  edges  of  the  original  graph,  so  C 3  is  Eulerian. 
With  the  hindsight  of  this  example,  we  are  in  a  position  to  prove  an  important 
theorem. 


10.1.4  THEOREM 


A  pseudograph  (with  at  least  two  vertices)  is  Eulerian  if  and  only  if  it  is  connected 
and  every  vertex  is  even. 


Proof  (— >)  We  have  already  shown  that  an  Eulerian  pseudograph  must  be  connected 
with  each  vertex  even. 

(4-)  For  the  converse,  suppose  that  Q  is  a  connected  pseudograph  with  all 
vertices  of  even  degree.  We  must  prove  that  Q  has  an  Eulerian  circuit.  Let  v 
be  any  vertex  of  Q.  If  there  are  any  loops  incident  with  v,  follow  these  first, 
one  after  the  other  without  repetition.  Then,  since  we  are  assuming  that  Q  has 
at  least  two  vertices  and  since  Q  is  connected,  there  must  be  an  edge  vv\  (with 
iq  7^  v)  incident  with  v.  If  there  are  loops  incident  with  iq,  follow  these  one 
after  the  other  without  repetition.  Then,  since  degui  is  even  and  bigger  than  0, 
there  must  be  an  edge  v\ vj  different  from  vv\ .  Thus  we  have  a  trail  from  v  to  V2 
which  we  continue  if  possible.  Each  time  we  arrive  at  a  vertex  not  encountered 
before,  follow  all  the  loops  without  repetition.  Since  the  degree  of  each  vertex 
is  even,  we  can  leave  any  vertex  different  from  v  on  an  edge  not  yet  covered. 
Remembering  that  pseudographs  in  this  book  are  always  finite,  we  see  that  the 
process  just  described  cannot  continue  indefinitely;  eventually,  we  must  return  to 
v,  having  traced  a  circuit  C\.  Notice  that  every  vertex  in  C\  is  even  since  we 
entered  and  left  on  different  edges  each  time  it  was  encountered.  At  this  point, 
it  may  happen  that  every  edge  has  been  covered;  in  other  words,  that  C\  is  an 
Eulerian  circuit,  in  which  case  we  are  done.  If  C\  is  not  Eulerian,  as  in  the 
preceding  example,  we  delete  from  Q  all  the  edges  of  C\  and  all  the  vertices  of  Q 
which  are  left  isolated  (that  is,  acquire  degree  0)  by  this  procedure.  All  vertices 
of  the  remaining  graph  Q\  are  even  (since  both  Q  and  C\  have  only  even  vertices) 
and  of  positive  degree.  Also,  Q\  and  C\  have  a  vertex  u  in  common,  because  Q 
is  connected.  (See  Exercise  18.)  Starting  at  u,  and  proceeding  in  Q\  as  we  did 
in  Q ,  we  construct  a  circuit  C  in  Q\  which  returns  to  u.  Now  combine  C  and  C\ 
by  starting  at  v,  moving  along  C\  to  u,  then  through  C  back  to  u,  and  then  back 
to  v  on  the  remaining  edges  of  C\.  We  obtain  a  circuit  C2  in  Q  which  contains 
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Figure  10.5 


10.1.5  THEOREM 


EXAMPLE  4 


more  edges  than  C\ .  If  it  contains  all  the  edges  of  Q,  it  is  Eulerian;  otherwise,  we 
repeat  the  process,  obtaining  a  sequence  of  larger  and  larger  circuits.  Since  our 
pseudograph  is  finite,  the  process  must  eventually  stop,  and  it  stops  only  with  a 
circuit  through  all  edges  and  vertices,  that  is,  with  an  Eulerian  circuit.  I 

Why  is  the  graph  shown  in  Fig  10.1  not  Eulerian? 

Not  only  does  Theorem  10.1.4  give  criteria  for  a  pseudograph  to  be  Eulerian, 
but  its  proof  gives  an  algorithm  for  finding  an  Eulerian  circuit  when  one  exists. 
We  have  given  one  example.  Here  is  another. 

A  power  company’s  wires  in  a  certain  region  follow  the  routes  indicated  in 
Fig  10.5.  The  vertices  represent  poles  and  the  edges  wires.  After  a  severe  storm, 
all  the  wires  and  poles  must  be  inspected.  Show  that  there  is  a  round  trip  beginning 
at  A,  which  allows  a  person  to  inspect  each  wire  exactly  once.  Find  such  a  trip. 


A  B  C  D  E  F 
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Sometimes,  instead  of  finding  an  Eulerian  circuit  in  a  pseudograph,  we  want 
to  find  an  Eulerian  trail  between  two  vertices,  a  trail  which  passes  through  every 
vertex  and  includes  every  edge.  It  is  not  hard  to  classify  the  pseudographs  in 
which  such  a  trail  is  possible,  for  adding  one  additional  edge  between  the  two 
vertices,  the  enlarged  pseudograph  is  Eulerian.  To  see  this,  note  that  if  the  two 
vertices  are  u  and  v,  then  following  an  Eulerian  trail  from  u  to  v  and  going  back 
to  u  along  the  extra  edge  defines  an  Eulerian  circuit.  By  Theorem  10.1.4,  the 
enlarged  pseudograph  must  be  connected  with  all  vertices  even.  So  the  original 
pseudograph  must  have  been  connected  with  all  vertices  even  except  u  and  v 
which  are  necessarily  odd.  (With  the  extra  edge,  they  were  even.)  On  the  other 
hand,  given  a  connected  pseudograph  with  all  vertices  except  u  and  v  even,  then 
certainly  there  is  an  Eulerian  trail  from  u  to  v,  for  adding  an  extra  edge  between 
u  and  v  produces  an  Eulerian  pseudograph.  Since  an  Eulerian  circuit  can  begin  at 
any  vertex,  imagine  the  one  which  begins  by  going  from  v  to  u  along  the  added 
edge.  Removing  this  added  edge  from  the  circuit  then  gives  an  Eulerian  trail  from 
u  to  v.  We  have  established  the  following  theorem. 

A  pseudograph  Q  possesses  an  Eulerian  trail  between  two  (different)  vertices  u 
and  v  if  and  only  if  Q  is  connected  and  all  vertices  except  u  and  v  are  even. 

Consider  the  graph  in  Fig  10.6.  Vertices  A  and  B  have  degree  3,  J  and  K  have 
degree  2,  and  all  the  others  have  degree  4.  Since  the  graph  is  connected  and  A 
and  B  are  its  only  odd  vertices,  there  exists  an  Eulerian  trail  from  A  to  B.  One 
Eulerian  trail  is  AGH  F  DG  J  HI  F  ELIKLBC  ADCEB .  ▲ 
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Answers  to  Pauses 


1.  ABCEFCBD  is  not  a  trail  (and  hence  not  a  path)  because  BC  is  a  repeated 
edge;  ABCEFCD  is  a  trail  because  all  edges  are  distinct,  but  not  a  path 
because  C  is  a  repeated  vertex;  ABCEFCDB A  is  a  closed  walk  because 
its  first  and  last  vertices  are  the  same,  but  not  a  circuit  because  edge  AB  is 
repeated;  BCEFCDB  is  a  circuit  because  its  edges  are  distinct  and  it  begins 
and  ends  at  vertex  B,  but  it  is  not  a  cycle  because  its  second  and  fifth  vertices 
are  the  same.  BCDB  is  an  odd  cycle  because  it  contains  three  edges.  (Note 
that  the  circuit  CBDCEFC  is  not  a  cycle:  Its  first  and  fourth  vertices  are 
the  same,  but  the  fourth  vertex  is  not  last.) 

2.  The  graph  in  Fig  10.1  is  not  Eulerian  because  not  all  vertices  are  even;  for 
example,  deg  B  =  3. 

3.  The  desired  round  trip  is  just  an  Eulerian  circuit  in  the  graph  of  Fig  10.5. 
Such  a  circuit  exists  because  the  graph  is  connected  and  each  vertex  has 
even  degree;  vertices  A,  F,  G,  L,  N,  and  O  have  degree  2;  the  rest  have 
degree  4.  In  the  search  for  an  Eulerian  circuit,  an  obvious  circuit  with  which 
to  begin  is  ABC  DE  F  LK  J I HGA.  Joining  this  to  the  circuit  BMC  I M  FI  B 
at  B  gives  A-BMC I M  H  B-C  DE  F  LK  J I HG  A.  Joining  this  to  the  circuit 
DJOKEND  at  D  gives  the  following  routing  for  the  desired  inspection: 
A-BMC  I M  H  B-C-DJOKEND-EFLKJIHGA. 


EXERCISES 


The  symbol  [BB]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 


1.  (a)  [BB]  Find  a  connected  graph  with  as  few  vertices 
as  possible  which  has  precisely  two  vertices  of  odd 
degree. 

(b)  Find  a  connected  graph  with  as  few  vertices  as 
possible  which  has  precisely  two  vertices  of  even 
degree. 


(a)  [BB] 


2.  [BB]  Answer  the  Konigsberg  Bridge  Problem  and  ^ 
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3.  In  each  case,  explain  why  the  graph  is  Eulerian  and  find 
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4.  In  each  of  the  pseudographs  shown,  either  describe  an 
Eulerian  circuit  by  numbering  the  edges  or  explain  why 
no  Eulerian  circuit  exists. 


5.  Suppose  we  modify  the  definition  of  Eulerian  circuit  by 
omitting  the  reference  to  vertices.  Thus  we  propose  that 
an  Eulerian  circuit  be  a  circuit  which  contains  every 
edge  of  a  graph.  Does  Theorem  10.1.4  remain  true? 
Explain. 

6.  (a)  [BB]  Is  there  an  Eulerian  trail  from  A  to  B1  If  yes, 

find  one;  if  not,  explain  why  not. 


(b)  Same  question  for  A  to  C. 

7.  [BB]  (Fictitious)  A  recently  discovered  map  of  the  old 
town  of  Konigsberg  shows  that  there  was  a  ferry 
operating  between  the  areas  labeled  C  and  D  in 
Fig  9.1. 

(a)  Is  it  possible  to  start  on  some  land  area,  cross  over 
each  bridge  exactly  once,  take  the  ferry  exactly 


once,  and  return  to  the  starting  point?  Explain  your 

answer. 

(b)  Is  it  possible  to  start  on  some  land  mass,  walk  over 
each  bridge  exactly  once,  take  the  ferry  exactly 
once,  and  finish  on  some  land  mass  (possibly  dif¬ 
ferent  from  the  starting  point)?  Explain. 

8.  Euler’s  original  article  about  the  Konigsberg  Bridge 
Problem,  which  is  dated  1736,  presents  a  second  simi¬ 
lar  problem  with  two  islands,  four  rivers  flowing  around 
them,  and  15  bridges  connecting  various  land  masses, 
as  shown  below. 

(a)  Is  it  possible  to  tour  the  region  starting  and  finishing 
in  the  same  area  having  walked  over  every  bridge 
exactly  once?  Either  describe  such  a  tour  or  explain 
why  none  is  possible. 

(b)  Is  it  possible  to  tour  the  region  (with  perhaps  dif¬ 
ferent  starting  and  stopping  points)  having  walked 
over  every  bridge  exactly  once?  Either  describe  such 
a  tour  or  explain  why  none  is  possible. 


9.  [BB]  In  Exercise  10  of  Section  9.3,  you  were  asked 
whether  graphs  Q  and  H  with  the  same  numbers  of  ver¬ 
tices  and  edges  and  with  every  vertex  in  each  graph 
of  degree  2  need  be  isomorphic.  Answer  this  question 
again,  assuming  in  addition  that  the  graphs  are  con¬ 
nected. 

10.  Suppose  Q i  and  Gi  are  Eulerian  graphs  with  no  vertices 
in  common.  Let  iq  be  a  vertex  in  Q\  and  let  i>2  be  a 
vertex  in  Qi.  Join  iq  and  t>2  with  a  single  edge.  What 
can  be  said  about  the  resulting  graph  and  why? 

11.  (a)  [BB]  For  which  values  of  n  >  1,  if  any,  is  /C„ 

Eulerian? 

(b)  [BB]  For  which  values  of  n  >  1,  if  any,  does  IC„ 

possess  an  Eulerian  trail?  Explain. 

12.  (a)  Find  a  necessary  and  sufficient  condition  on  natural 

numbers  m  and  n  in  order  for  lCmM  to  be  Eulerian. 

Prove  your  answer. 
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(b)  Find  a  necessary  and  sufficient  condition  on  natu¬ 
ral  numbers  m  and  n  in  order  for  ICmn  to  have  an 
Eulerian  trail.  Assume  m  <  n.  Prove  your  answer. 

13.  Does  there  exist  any  sort  of  route  in  and  around  the  figure 
which  crosses  every  edge  exactly  once?  Explain  your 
answer. 
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5 - € 

© 

14.  [BB]  Prove  that  any  circuit  in  a  graph  must  contain  a 
cycle  and  that  any  circuit  which  is  not  a  cycle  contains 
at  least  two  cycles. 

15.  [BB]  Answer  true  or  false  and  explain:  Any  closed  walk 
in  a  graph  contains  a  cycle. 

16.  Let  u  and  v  be  distinct  vertices  in  a  graph  Q.  Prove  that 
there  is  a  walk  from  u  to  v  if  and  only  if  there  is  a  path 
from  u  to  u. 

17.  [BB]  For  vertices  u  and  v  in  a  graph  Q,  define  u  ~  v  if 
u  =  v  or  there  exists  a  walk  from  u  to  u.  Prove  that  ~ 
defines  an  equivalence  relation  on  the  vertices  of  Q. 

18.  Complete  some  details  in  the  proof  of  Theorem  10.1.4 
by  establishing  the  following.  Suppose  Ci(V|,£i)  is  a 
circuit  in  a  connected  graph  Q  which  does  not  contain 
all  the  edges  of  Q.  Let  Q\  be  that  subgraph  of  Q  whose 
edge  set  is  £  \£|  and  whose  vertex  set  is  V  less  those 
vertices  of  Vi  which  become  isolated  after  the  removal 
of  the  edges  in  £\.  Prove  that  G\  and  C\  have  a  vertex 
in  common. 

19.  Suppose  Gi  and  Gi  are  isomorphic  graphs.  Prove  that 
either  both  Q \  and  Q2  are  connected  or  else  neither  is 
connected. 

20.  [BB]  A  graph  Q  has  20  vertices.  Any  two  distinct  ver¬ 
tices  x  and  y  have  the  property  that  deg  x  +  deg  y  >19. 
Prove  that  Q  is  connected. 


21.  (a)  [BB]  Let  G  be  a  connected  graph  with  n  >  1  ver¬ 

tices  none  of  which  has  degree  1 .  Prove  that  Q  has 
at  least  n  edges. 

(b)  Let  Q  be  a  connected  graph  with  n  vertices.  Prove 
that  Q  has  at  least  n  —  1  edges. 

22.  Let  G  be  a  graph  with  n  vertices  and  m  edges,  where 
m  >  ^(n  —  I  )(n  —  2). 

(a)  Show  that  Q  does  not  have  a  vertex  of  degree  0. 

(b)  Show  that  G  is  connected. 


10.1.6  DEFINITION 


A  component  of  a  graph  is  a  maximal  connected  subgraph, 
that  is,  a  connected  subgraph  which  is  properly  contained 
in  no  other  connected  subgraph  which  has  more  vertices  or 
more  edges. 

A  graph  with  just  one  component  is  connected.  Fig  10.7 
shows  graphs  with  two  and  with  three  components. 


Figure  10.7  Q\  has  two  components;  G2  has  three  com¬ 
ponents. 

23.  Let  Q  be  a  graph  all  of  whose  vertices  have  even  degree. 
How  can  the  Eulerian  circuit  algorithm  described  in  The¬ 
orem  10.1.4  be  modified  to  determine  the  number  of 
components  in  <7? 

24.  [BB]  Prove  that  isomorphic  graphs  have  the  same  num¬ 
ber  of  components. 

25.  Prove  that  a  graph  is  bipartite  if  and  only  if  it  contains 
no  odd  cycles. 


10.2  Hamiltonian  Cycles 

An  Eulerian  circuit  contains  every  edge  of  a  graph  exactly  once.  In  this  section, 
we  discuss  circuits  which  contain  each  vertex  of  a  graph  exactly  once.  Unlike  in 
Section  10.1,  the  definitions  and  results  of  this  section  apply  only  to  graphs,  not 
to  pseudographs. 


10.2.1  DEFINITION 


A  Hamiltonian  cycle  in  a  graph  is  a  cycle  which  contains  every  vertex  of  the 
graph.  A  Hamiltonian  graph  is  one  with  a  Hamiltonian  cycle. 
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EXAMPLE  5 


Figure  10.8 


Some  authors  define  a  Hamiltonian  circuit  as  a  circuit  in  which  every  ver¬ 
tex  except  the  first  and  last  appears  exactly  once.  A  circuit  with  no  repeated 
vertices  (except  the  first  and  the  last)  is  a  cycle,  so  a  Hamiltonian  circuit  is 
a  cycle.  Thus  the  terms  Hamiltonian  circuit  and  Hamiltonian  cycle  are 
synonymous. 

Graph  Gi  in  Fig  10.8  is  Hamiltonian:  The  cycle  ABCDEA,  for  instance,  is  Hamil¬ 
tonian.  On  the  other  hand,  graph  Q2  is  not  Hamiltonian,  but  how  can  we  convince 
ourselves  of  this  fact? 

Suppose  that  Q2  has  a  cycle  H.  which  contains  every  vertex.  Then  EL  will 
contain  A,  which  we  note  is  a  vertex  of  degree  2.  Since  we  cannot  enter  and 
leave  A  on  the  same  edge  (edges  of  a  cycle  are  distinct),  it  follows  that  both 
edges  incident  with  A  have  to  be  part  of  7 i.  In  particular,  edge  C  A  is  in  'H.  The 
same  argument  applied  to  B  shows  that  CB  is  part  of  Ti  and,  similarly,  C  D  and 
CE  are  in  Ti:  All  four  edges  incident  with  C  are  part  of  'H.  This  situation  is 
impossible,  however:  Since  7i  is  a  cycle,  vertex  C  can  appear  only  once  unless 
the  cycle  begins  and  ends  at  C.  In  either  case,  since  ? i  cannot  use  the  same  edge 
twice,  exactly  two  edges  incident  with  C  can  be  part  of  7i.  ▲ 


D 

Gi 


Q\  is  Hamiltonian;  Q2  is  not. 


While  it  is  possible  to  decide  precisely  which  graphs  possess  Eulerian  circuits 
(Theorem  10.1.4),  it  is  noteworthy  that  there  has  not  yet  been  found  a  way  to 
classify  Hamiltonian  graphs;  in  other  words,  there  is  no  known  theorem  of  the 
sort,  “ G  is  Hamiltonian  if  and  only  if  . . .  ”.  There  are,  however,  some  properties  of 
cycles  which  are  helpful  in  trying  to  find  Hamiltonian  cycles  and  which  sometimes 
allow  us  to  conclude  that  a  particular  graph  is  not  Hamiltonian. 

Suppose  7i  is  a  cycle  in  a  graph  Q. 

1,  For  each  vertex  v  of  7f,  precisely  two  edges  incident  with  v  are  in  H\  hence, 
if  El  is  a  Hamiltonian  cycle  of  G  and  a  vertex  v  in  Q  has  degree  2,  then  both 
edges  incident  with  v  must  be  part  of  H. 

2.  The  only  cycle  contained  in  H  is  7i  itself.  (We  say  that  H  contains  no  proper 
cycles.) 

We  have,  in  essence,  already  explained  why  each  vertex  of  a  cycle  is  incident 
with  exactly  two  edges:  Any  vertex  (except  the  first  and  last)  appears  exactly  once 
in  a  cycle.  If  the  cycle  Ti.  is  Hamiltonian,  then  every  vertex  is  in  H  so  both  edges 
incident  with  any  vertex  of  degree  2  must  be  in  El.  This  establishes  Property  1. 
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Figure  10.9 


10.2.3  THE  PETERSEN 
GRAPH 


Figure  10.10 


Property  2  asserts  that  if  C  is  a  cycle  contained  in  another  cycle  TL,  then  C  =  TL. 
We  prove  this  by  contradiction. 

Suppose  C  is  a  cycle  contained  in  TL  and  C  TL.  Then  there  is  a  vertex  y  in 
TL  which  is  not  in  C.  Let  x  be  any  vertex  in  C.  Since  TL  contains  both  x  and  y, 
there  is  a  path  using  edges  of  TL  from  x  to  y.  Thus,  TL  contains  some  edge  vw 
where  vertex  v  is  in  C  but  w  is  not.  (See  Fig  10.9.)  So  TL  contains  the  two  edges 
of  C  which  are  incident  with  v  together  with  the  edge  vw.  Altogether,  there  are 
three  edges  incident  with  v  which  must  be  part  of  TL.  This  contradicts  Property  1. 
Thus  TL  is  not  a  cycle. 


V 


Answer  true  or  false  and  explain:  The  graph 
shown  at  the  right  is  not  Hamiltonian  because  it 
contains  the  cycle  GHIG. 


A 


As  a  deeper  application  of  the  properties  of  Hamiltonian  graphs  described  in 
10.2.2,  we  introduce  a  famous  graph  named  after  the  Danish  mathematician  Julius 
Petersen  (1839-1910).2  We  show  that  this  graph,  the  one  in  Fig  10.10,  is  not 
Hamiltonian. 


not  Hamiltonian. 


Suppose  TL  is  a  Hamiltonian  cycle.  Then  TL  must  contain  at  least  one  of 
the  five  edges  connecting  the  outer  to  the  inner  vertices.  Since  the  graph  is 
symmetric,  there  is  no  loss  of  generality  in  assuming  that  AF  is  part  of  TL.  (Refer 
to  Fig  10. 1 1.)  By  Property  1,  precisely  one  of  the  two  edges  FH  and  FI  is  in  Ti. 
Again,  by  symmetry,  we  may  assume  FH  is  part  of  the  cycle  while  FI  is  not. 

Since  FI  is  not  in  TL ,  but  two  edges  incident  with  I  must  be  in  TL  (Property  1), 
IG  and  ID  are  in  TL.  Now  precisely  one  of  the  edges  GB,  GJ  is  in  TL. 

Suppose  first  that  GB  is  in  and  GJ  is  out.  Because  precisely  two  edges 
incident  with  J  are  in  TL  and  JG  is  not,  both  J  H  and  J E  are  part  of  TL.  Thus, 


2J.  Petersen,  “Die  Theorie  der  regularen  Graphen."  Acta  Mathematica  15  (1891),  193-220. 
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CH  is  out  and  both  BC  and  CD  are  in  Ti.  At  this  point,  however,  Ti  contains 
the  proper  cycle  BCDIGB,  a  contradiction.  We  conclude  that  GB  cannot  be  part 
of  Ti  and  hence  that  GJ  is.  An  argument  similar  to  the  one  just  given  now  leads 
again  to  the  false  conclusion  that  Ti  contains  a  proper  cycle. 


Give  the  details  of  this  argument. 


Figure  10.12  No  Hamiltonian  cycle  contains  1234. 


Look  at  the  graph  pictured  in  Fig  10.12.  Show  that  no  Hamiltonian  cycle  can 
contain  edges  1,  2,  3,  4. 

Hamiltonian  graphs  take  their  name  from  Sir  William  Rowan  Hamilton  (1805- 
1865),  a  contemporary  and  personal  friend  of  William  Wordsworth  and  Samuel 
Taylor  Coleridge  and  indeed  a  man  of  many  talents.  By  the  age  of  13,  he  had 
mastered  one  language  for  each  year  of  his  life,  including  Latin,  Greek,  Hebrew, 
Chinese,  and  Sanskrit!  At  17,  he  had  a  firm  grasp  of  calculus.  He  studied  astron¬ 
omy  at  Trinity  College,  Dublin,  and  later  made  important  contributions  to  the 
study  of  optics.  He  is  perhaps  best  known  within  the  sphere  of  mathematics  as 
the  inventor  of  the  quaternions ,  the  first  noncommutative  field  to  be  discovered. 
Hamilton’s  quaternions  is  an  algebraic  structure  like  the  real  numbers  in  which  one 
can  add,  subtract,  multiply,  and  divide  (by  any  nonzero  number),  but  in  which  mul¬ 
tiplication  is  not  commutative:  There  are  quaternions  a  and  b  for  which  ab  fi  ba. 
The  reader  may  have  encountered  noncommutative  systems  before;  for  example, 
the  set  of  all  n  x  n  matrices  over  the  real  numbers  is  not  commutative.  A  basic 
difference  between  matrices  and  quaternions,  however,  and  the  thing  that  made 
Hamilton’s  discovery  so  remarkable,  is  the  lack  of  divisibility  in  matrices.  There 
are  many  nonzero  matrices  which  are  not  invertible;  on  the  other  hand,  every 
nonzero  quaternion  has  an  inverse. 
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Figure  10.13 


10.2.4  THEOREM 


Proof 


Hamilton  also  invented  a  game  which  made  use  of  a  wooden  regular  dodec¬ 
ahedron ,  that  is,  a  solid  with  12  congruent  faces,  each  of  which  is  a  regular 
pentagon.  The  vertices  of  the  dodecahedron  were  labeled  with  the  names  of  20 
cities  of  the  world,  and  the  object  of  the  game  was  to  find  a  route  “around  the 
world,”  along  the  edges  of  the  solid,  which  passed  through  each  city  exactly  once 
and  led  back  to  the  city  where  the  tour  started. 

Imagine  that  the  pentagon  on  which  the  dodecahedron  sits  is  stretched  so  that 
the  solid  collapses  until  it  is  flat.  The  result  is  a  graph  in  the  plane.  (See  Fig  10.13.) 
Hamilton’s  world  tour  is  possible  if  and  only  if  the  graph  contains  what  we  now 
call  a  Hamiltonian  cycle.  The  graph  in  fact  is  Hamiltonian;  a  Hamiltonian  cycle 
is  marked  with  the  heavy  lines  in  Fig  10.13. 


A  dodecahedron  and  its  associated  Hamiltonian  graph. 


There  are  some  graphs  in  which  a  Hamiltonian  cycle  always  exists  (the  com¬ 
plete  graph  on  n  vertices,  for  instance).  Assigning  the  vertices  of  K.n  the  labels 
V],V2, ...  .  v„,  then  v\V2  ■  ■  ■  v„v\  is  a  cycle  because  we  are  assured  of  an  edge 
between  each  pair  of  vertices.  The  cycle  clearly  passes  through  each  vertex  (except 
the  first  and  the  last)  exactly  once,  so  it  is  Hamiltonian.  It  would  seem  that  a  graph 
with  “lots”  of  edges  should  have  a  good  chance  of  being  Hamiltonian.  The  fol¬ 
lowing  theorem,  published  by  G.  A.  Dirac  in  1952,  provides  further  evidence  in 
support  of  this  idea. 

(Dirac3)  If  a  graph  Q  has  n  >  3  vertices  and  every  vertex  has  degree  at  least  |, 
then  Q  is  Hamiltonian.4 

Among  all  the  possible  paths  in  Q,  suppose  that  V:  v\V2  ■  ■  ■  v,  is  longest  in  the 
sense  that  it  uses  the  most  vertices.  Thus,  there  is  no  walk  in  Q  which  uses  more 
than  t  vertices  without  repeating  some  vertex.  If  some  vertex  w  adjacent  to  v\  is 
not  in  V,  then  the  walk  wv\ V2  ■  ■  •  vt  does  not  have  repeated  vertices  and  is  longer 
than  V.  Since  this  is  contrary  to  the  way  V  was  chosen,  every  vertex  adjacent 
to  v\  is  in  V.  (Similarly,  every  vertex  adjacent  to  vt  is  in  V.)  Since  deg  vi  >  ", 
'  2=  2  -f  1,  the  “+1”  counting  v\  itself.  Since  n  >  3  and  t  is  an  integer,  we 
conclude  that  t  >  3  also. 


^G.  A.  Dirac,  “Some  Theorems  on  Abstract  Graphs,”  Proceedings  London  Mathematical  Society  2 
(1952),  69-81. 

4There  is  a  stronger  version  of  this  theorem,  due  to  Oystein  Ore,  which  says  that  a  graph  with  n  >  3 
vertices  is  Hamiltonian  as  long  as  the  sum  of  the  degrees  of  any  two  nonadjacent  vertices  is  at  least  n.  See 
Exercise  14. 
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Claim:  There  is  a  pair  of  vertices  vk,  vk+i  in  P  (1  <  k  <  t)  such  that  v\  is 
adjacent  to  vk+\  and  v,  is  adjacent  to  vk,  as  suggested  in  Fig  10.14. 


vk+ 1 
vk 


If  this  were  not  the  case,  then  each  of  the  vertices  of  P  adjacent  to  v\  would 
determine  a  vertex  not  adjacent  to  v,  (namely,  its  predecessor  in  P).  Since  the 
vertices  V2,  ■  ■  ■  ,  v,  are  all  different,  there  would  be  in  Q  at  least  |  vertices  not 
adjacent  to  vt.  These  vertices,  together  with  the  vertices  adjacent  to  vt,  account  for 
at  least  n  vertices  in  Q.  Including  v,  itself,  we  have  found  more  than  n  vertices, 
which  cannot  be.  This  establishes  the  validity  of  our  claim,  from  which  it  follows 
that  Q  contains  the  cycle 


C:  vivk+\Vk+2  ■  ■  ■  v,vkVk-\  ■■■vi. 

We  show  that  C  contains  all  vertices  of  Q  and  hence  is  the  desired  Hamiltonian 
cycle.  Remember  that  C  contains  at  least  |  +  1  vertices,  so  there  are  less  than 
7  vertices  not  in  C.  Hence,  any  vertex  w  which  is  not  in  C  must  be  adjacent  to 
some  vertex  vs  of  C.  Then  w,  vs,  and  the  remaining  vertices  of  C  in  sequence 
would  define  a  path  longer  than  V,  contradicting  the  definition  of  V.  I 


Answers  to  Pauses 


Show  that  Dirac’s  Theorem  is  false  if  |  is  replaced  by  in  its  statement.  [Hint: 
Examine  those  graphs  presented  in  this  section  which  are  not  Hamiltonian.] 

4.  False!  The  graph  is  Hamiltonian:  AFEDI HGCBA  is  a  Hamiltonian  cycle. 
It  is  not  the  entire  graph  which  must  not  contain  a  proper  cycle,  but  any 

Hamiltonian  cycle. 

5.  We  are  assuming  that  both  AF  and  F H  are  in  a  Hamiltonian  path  7i.  As 
before,  FI  is  not  in  H ;  therefore,  both  IG  and  ID  are.  Since  edge  G.I  is  also 
in  'H  while  GB  is  out,  both  BA  and  BC  are  in  because  two  edges  adjacent  to 
B  are  part  of  Ft  and  BG  is  not  in  H,  as  shown  on  the  left.  Using  Property  2, 
we  see  that  CH  cannot  be  part  of  otherwise  H  contains  the  proper  cycle 
ABC H FA.  So  both  CD  and  HJ  are  in.  Now  H  contains  the  proper  cycle 
ABCDIGJ H FA,  a  contradiction. 
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6.  If  a  Hamiltonian  cycle  H  were  to  contain  edges  2  and  3  (see  the  graph  on  the 
right),  then  the  third  edge  at  the  center  could  not  be  part  of  7i  (Property  1), 
so  the  edges  labeled  5  and  6  would  have  to  belong  to  H  (Property  1  again). 
Now  edges  1,  2,  3,  4,  5  and  6  form  a  proper  cycle  within  H.  This  contradicts 
Property  2. 

7.  The  graph  Qi  in  Fig  10.8,  which  is  not  Hamiltonian,  has  n  =  5  vertices  each 
of  degree  at  least  2  = 


EXERCISES 


The  symbol  [BB]  means  that  an  answer  can  be  found,  in  the  Back  of  the  Book. 


1.  [BB]  Is  the  graph  Hamiltonian?  Is  it  Eulerian? 
Explain  your  answers. 


2.  [BB;  (b),(d)]  Determine  whether  or  not  each  of  the 
graphs  of  Exercise  4  of  Section  10.1  is  Hamiltonian.  In 
each  case,  either  label  the  edges  with  numbers  so  as  to 
indicate  a  Hamiltonian  cycle  or  explain  why  no  such 
cycle  exists. 


10.2.5  DEFINITION 


A  Hamiltonian  path  in  a  graph  is  a  path  which  passes 
through  every  vertex  exactly  once. 

3.  Determine  whether  or  not  each  of  the  graphs  shown  is 
Hamiltonian.  Determine  also  whether  or  not  each  graph 
has  a  Hamiltonian  path.  In  each  case,  either  number  the 
vertices  so  as  to  indicate  a  Hamiltonian  cycle  or  path  or 
explain  why  no  such  cycle  or  path  exists. 


4.  (a)  [BB]  Is  the  graph  Hamiltonian? 

(b)  Is  there  a  Hamiltonian  path? 

(c)  [BB]  Is  it  Eulerian? 

(d)  Is  there  an  Eulerian  trail? 

Explain  your  answers. 


E 


5.  (a)  (The  Knight’s  Tour)  Is  it  possible  for  a  knight  to 
tour  a  chessboard  visiting  every  square  exactly  once 
and  returning  to  its  initial  square?  (See  Exercise  8 
of  Section  9.2.) 

(b)  Is  the  sort  of  tour  described  in  5(a)  possible  on  a 
7x7  “chessboard”? 

(For  a  complete  classification  of  those  m  x  n  boards  on 
which  knight’s  tours  are  possible,  the  reader  is  directed 
to  the  interesting  article  by  Allen  J.  Schwenk  which 
appeared  in  the  December  1991  issue  of  Mathematics 
Magazine.) 
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6.  Is  the  graph  of  Fig  10. 12  Hamiltonian?  Display  a  Hamil¬ 
tonian  cycle  or  explain  clearly  why  no  such  cycle  exists. 

7.  The  figure  shows  the  floor  plan  of  a  single-story  house 
with  various  doorways  between  rooms  and  other  door¬ 
ways  leading  outside. 


O 


A 

SQ 

— 
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G 

(a)  [BB]  Is  it  possible  to  start  outside,  then  to  enter  the 
house  and  walk  through  every  room  exactly  once 
(without  leaving  the  house),  and  finally  to  return 
outside?  If  yes,  exhibit  the  route  on  a  copy  of  the 
floor  plan. 

(b)  Is  there  a  route  which  starts  outside  and  leads 
through  every  doorway  in  the  house  exactly  once? 
(You  are  allowed  to  return  outside  or  to  reenter 
rooms  as  often  as  you  want.) 

8.  [BB]  In  a  group  of  2 n  people,  each  person  has  at  least 
n  friends.  Prove  that  the  group  can  be  seated  in  a  circle, 
each  person  next  to  a  friend. 

9.  (a)  [BB]  How  many  edges  must  a  Hamiltonian  cycle  in 

Kn  contain? 

(b)  How  many  Hamiltonian  cycles  does  K,„  have? 
(Begin  all  cycles  at  the  same  vertex.) 

(c)  [BB]  What  is  the  maximum  number  of  edge  dis¬ 
joint  Hamiltonian  cycles  in  /C,,?5  (Cycles  are  edge 
disjoint  if  no  two  of  them  have  an  edge  in  common.) 

(d)  Find  all  the  Hamiltonian  cycles  in  Kn  for  n  = 
1, 2,  3,  4,  5.  In  each  case,  exhibit  a  maximum  num¬ 
ber  which  are  edge  disjoint. 

10,  [BB]  Draw  a  picture  of  a  cube.  By  imagining  that  the 
bottom  square  is  stretched  until  it  is  flat,  draw  a  graph 
of  the  flattened  cube.  Is  this  graph  Hamiltonian?  If  so, 
draw  a  Hamiltonian  cycle.  If  not,  explain  why  not. 

11.  The  picture  on  the  left  is  that  of  an  icosahedron,  a  solid 
object  whose  faces  consist  of  20  congruent  equilateral 
triangles.  By  stretching  the  base  triangle  and  flatten¬ 
ing,  the  icosahedron  determines  a  graph  in  the  plane 
(as  shown  on  the  right  side  of  the  figure).  Find  a  Hamil¬ 
tonian  cycle  in  this  graph. 


12.  Make  a  model  of  a  dodecahedron  (Fig  10.13)  and  trace 
a  Hamiltonian  path  along  its  edges.  Do  the  same  for 
the  icosahedron.  ( Suggestion :  Look  for  ideas  in  geom¬ 
etry  texts,  in  their  discussions  of  the  Platonic  solids. 
One  such  book  is  the  wonderful  work  of  H.  S.  M.  Cox- 
eter,  Introduction  to  Geometry,  Wiley,  New  York, 
1961.) 

13.  (a)  [BB]  Suppose  Q  is  a  graph  with  n  vertices,  each  of 

which  has  degree  d  >  !i^-.  Prove  that  Q  contains 
a  Hamiltonian  path.  [Hint:  Add  an  extra  vertex  to 
Q  which  is  adjacent  to  every  vertex  and  use  Dirac’s 
Theorem.] 

(b)  Does  the  graph  shown  on  the  left  have  a  Hamil¬ 
tonian  path?  If  so,  find  it.  If  it  doesn’t  have  one, 
explain  why  not. 


(c)  Repeat  the  previous  question  for  the  graph  on  the 
right. 

(d)  Does  the  converse  of  (a)  hold;  that  is,  if  a  graph  has 
a  Hamiltonian  path,  must  the  degree  of  every  vertex 
be  at  least  -yf  ?  Explain  your  answer.  What  about 
the  converse  of  Dirac's  Theorem? 

(e)  [BB]  Give  an  example  of  a  graph  which  has  a  Hamil¬ 
tonian  path  but  no  Hamiltonian  cycle. 

14.  (Ore’s  Theorem)  Suppose  Q  is  a  graph  with  n  >  3  ver¬ 
tices  and  that  the  sum  of  the  degrees  of  any  two  nonad- 
jacent  vertices  is  at  least  n.  Prove  that  Q  is  Hamiltonian 
by  starting  with  a  path  V :  v\ vi  •  ■  •  vt  of  greatest  length, 
as  in  the  proof  of  Dirac’s  Theorem,  and  then  considering 
separately  the  cases  where 


5 


In  fact,  any  complete  graph  actually  has  this  maximum  number  of  edge  disjoint  cycles. 
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(a)  [BB]  v\  and  v,  are  adjacent,  and 

(b)  iq  and  v,  are  not  adjacent. 

15.  [BB]  Suppose  Q  is  a  graph  with  n  >2  vertices  such  that 
the  sum  of  the  degrees  of  any  two  nonadjacent  vertices 
is  at  least  n  —  1.  Prove  that  Q  has  a  Hamiltonian  path. 

16.  Answer  true  or  false  and  in  each  case  either  give  a  proof 
or  provide  a  counterexample. 

(a)  A  Hamiltonian  graph  contains  no  proper  cycles. 

(b)  Every  vertex  in  a  Hamiltonian  graph  has  degree  2. 

(c)  [BB]  Every  Eulerian  graph  is  Hamiltonian. 

(d)  Every  Hamiltonian  graph  is  Eulerian. 

17.  Let  Q  be  a  graph  with  at  least  three  vertices. 


(a)  [BB]  If  there  is  a  Hamiltonian  path  between  any  two 
vertices  of  Q,  need  Q  contain  a  Hamiltonian  cycle? 
Explain. 

(b)  If,  at  every  vertex  v  in  Q,  there  is  a  Hamiltonian 
path  which  starts  at  n,  need  Q  contain  a  Hamiltonian 
cycle?  Explain. 

(c)  Is  it  possible  for  there  to  exist  an  Eulerian  trail 
between  any  two  vertices  of  Q2  If  so,  need  Q  contain 
an  Eulerian  circuit?  Explain. 

18.  A  connected  graph  Q  has  1 1  vertices  and  53  edges.  Show 
that  Q  is  Hamiltonian  but  not  Eulerian. 

19.  Find  a  necessary  and  sufficient  condition  on  m  and  n  in 
order  for  /Cm,„  to  be  Hamiltonian.  Prove  your  answer. 


10.3  The  Adjacency  Matrix 

Graphs  occur  with  increasing  frequency  in  modern-day  problems.  While  theo¬ 
retically,  any  problem  associated  with  a  finite  graph  is  solvable,  in  practice,  the 
number  of  cases  to  consider  is  often  so  large  and  the  time  needed  to  deal  with 
each  case  so  great  that  an  exhaustive  search  of  all  possibilities  is  impossible. 
Accordingly,  the  discovery  of  new  graph-based  algorithms  and  ways  to  improve 
efficiency  are  flourishing  areas  of  mathematical  research  today.  In  order  to  write 
an  algorithm  which  requires  the  input  of  a  graph,  we  first  must  decide  how  to  code 
the  pertinent  information  which  describes  a  graph.  For  this  purpose,  the  adjacency 
matrix  is  commonly  used. 


10.3.1  DEFINITION 


Let  Q  be  a  graph  with  n  vertices  labeled  tq,  V2,  ■ . .  ,  vn.  For  each  i  and  j  with 
1  <  0  j  <  n,  define 


II  if  Vi  v  j  is  an  edge 
0  if  ViVj  is  not  an  edge. 

The  adjacency  matrix  of  Q  is  the  n  x  n  matrix  A  =  [«,•_/]  whose  (i,  j)  entry  is  a,;. 


EXAMPLE  6 


Figure  10.15  shows  a  graph  Q  and  its  adjacency 


matrix  A. 


A  = 


0 

0 

0 

1 

0 


0  0  1 
0  0  1 
0  0  0 
1  0  0 
1  0  1 


0  - 
1 
0 
1 
0 


Figure  10.15 


A  graph  and  its  adjacency  matrix. 
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EXAMPLE  7 


The  adjacency  matrix  of  the  complete  bipartite  graph  on  the  sets  Vi  =  {vi,  V2) 
and  V2  =  {W3,  14,  u5}  is 


"00  1  1  1" 
0  0  111 
110  0  0 
110  0  0 
110  0  0 


▲ 


Since  the  adjacency  matrix  records  all  the  incidence  relations  in  a  graph,  it  is 
not  surprising  that  it  can  be  used  to  give  a  lot  of  information  about  the  graph.  We 
list  here  some  of  the  basic  properties  of  an  adjacency  matrix.  Some  are  obvious, 
others  not  quite  so,  and  perhaps  the  most  curious  of  all,  due  to  Gustav  Kirchhoff, 
we  leave  to  Chapter  1 2. 

Let  Q  be  a  graph  with  vertices  v\,V2, ...  ,v„  and  let  A  =  [a,j  |  be  the  adjacency 
matrix  of  Q. 


1.  The  diagonal  entries  of  A  are  all  0;  that  is,  an  =  0  for  z  =  1, . . .  ,  n.  This  fol¬ 
lows  because  an  edge  from  vertex  v,  to  n,  is  a  loop  and  loops  are  not  allowed 
in  graphs.  (Actually,  we  can  define  the  adjacency  matrix  of  a  pseudograph 
and  show  that  it  has  most  of  the  properties  given  here,  but  we  shall  restrict 
our  attention  to  graphs.) 

2.  The  adjacency  matrix  is  symmetric,  that  is,  =  ajt  for  all  i,  j. 

Conversely,  given  any  symmetric  matrix  A  which  contains  only  0’s  and  l’s 
and  only  0’s  on  its  diagonal,  there  exists  a  graph  Q  whose  adjacency  matrix  is 
A.  Thus,  there  is  a  one-to-one  correspondence  between  graphs  and  symmetric 
0,  1  matrices  with  only  0’s  on  the  diagonal. 

The  next  three  properties  are  less  obvious  than  the  first  two.  We  will  discuss 
them  in  some  detail  after  a  couple  of  Pauses. 

3.  deg  Vi  is  the  number  of  l’s  in  row  i ;  this  is  also  the  number  of  1  ’s  in  column  i 
since  row  i  and  column  i  are  the  same,  by  symmetry. 

4.  The  (/,  j)  entry  of  A2  is  the  number  of  different  walks  from  v,  to  vj  which 
include  two  edges;  thus,  the  degree  of  v/  is  the  / th  entry  on  the  main  diagonal 
of  A2. 


5.  In  general,  for  any  n  >  1,  the  (i,  j )  entry  of  An  is  the  number  of  walks  from 
Vj  to  Vj  which  include  n  edges. 


Can  the  4  x  4  identity  matrix  I  = 


1  0  0 
0  1  0 
0  0  1 
0  0  0 


a  graph? 

Find  a  graph  whose  adjacency  matrix  is  A  = 


0 

0 

0 

1 


0 

0 

0 

1 


be  the  adjacency  matrix  of 


001- 
0  1  0 
1  0  0 
0  0  0 
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EXAMPLE  8 


10.3.3  THEOREM 


EXAMPLE  9 


As  mentioned,  the  powers  of  an  adjacency  matrix  A  evidently  have  special 
significance.  In  order  to  understand  why  the  (7,  j)  entry  of  A1  is  the  number  of 
walks  of  length  2  between  u,  and  vj  in  the  graph  corresponding  to  A,  remember 
that  this  entry  is  the  dot  product  of  row  i  and  column  j  of  A.  Since  A  contains 
only  0’s  and  l’s,  this  dot  product  is  just  the  number  of  coordinates  in  which 
row  i  and  column  j  each  have  a  1.  For  example,  if  row  i  of  A  were  the  vector 
[0,  0,  1,  0,  1,  1,  0]  and  column  j  the  vector  [1 ,  1,  1,  0,  0,  1,0],  then  the  dot  product 
(row  ;)  •  (column  j)  =  2,  corresponding  to  the  two  coordinates  (third  and  sixth) 
where  each  vector  has  a  1.  How  does  it  happen  that  a  row  and  a  column  have  a 
1  in  the  same  coordinate?  In  our  example,  row  i  and  column  j  each  have  third 
coordinate  1 ;  this  corresponds  to  the  fact  that  there  is  an  edge  in  the  graph  between 
Vi  and  U3  and  an  edge  between  vj  and  113.  There  is  a  walk  in  the  graph  from  17  to 
Vj  which  uses  two  edges,  namely,  VjVjV  j-  The  dot  product  of  row  i  and  column  j 
is  therefore  the  number  of  walks  of  length  2  from  17  to  vj . 

In  general,  there  is  one  walk  of  length  2  from  a  vertex  to  itself  for  each 
edge  incident  with  that  vertex.  Thus,  the  degree  of  vertex  17  is  the  diagonal  entry 
an  of  the  square  of  the  adjacency  matrix,  as  asserted  in  Property  4.  The  rest  of 
Property  4  and  Property  5  can  be  justified  with  similar  arguments. 


Referring  to  Fig  10.15,  A2 


1  10  0  1" 
12  0  11 
0  0  0  0  0 
0  10  3  1 
110  12 


The  (4,  2)  entry  of  A 2  is  1  corresponding  to  the  fact  that  there  is  precisely  one 
walk  of  length  2  between  14  and  V2  in  the  associated  graph:  U4U5V2.  The  (4,  4) 
entry  of  A2  is  3;  there  are  three  walks  of  length  2  from  14  back  to  14,  one  for 
each  edge  incident  with  14.  ▲ 


The  third  power  of  the  matrix  A  in  Fig  10.15  is  A3 


"01031" 

12  0  4  3 
0  0  0  0  0 
3  4  0  2  4 

13  0  4  2 


The  (4,  5)  entry  of  A3  is  4.  Thus,  there  are  four  walks  of  length  3  from  V4  to  v=, 
in  the  graph.  What  are  they? 


Since  the  adjacency  matrix  of  a  graph  records  the  number  of  vertices  and  the 
adjacencies  between  them,  the  following  theorem  is  straightforward. 


Two  graphs  are  isomorphic  if  and  only  if  their  vertices  can  be  labeled  in  such  a 
way  that  the  corresponding  adjacency  matrices  are  equal. 


The  graphs  in  Fig  10.16  have  been  labeled  so  that  each  has  the  adjacency  matrix 

"  0  1  0  1  " 

10  10 
0  10  1 
10  10 


Figure  10.16 


Figure  10.17 
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(These  are  the  graphs  which  appeared  in  Fig  9.23,  with  the  labels  A,  B,  C,  D 
replaced  by  v\,  V2,  U3,  t’4,  respectively.)  As  we  earlier  noted,  these  graphs  are 
isomorphic.  ▲ 

U)  v4 

© - 0 

o - © 

v2  V3 

0i 

Two  graphs  labeled  so  as  to  show  that  they  are  isomorphic. 

Suppose  we  are  presented  with  two  graphs  which  are  already  labeled.  Can 
we  tell  from  the  adjacency  matrices  whether  or  not  the  graphs  are  isomorphic? 
Consider  again  the  graphs  of  Fig  10.16,  but  labeled  as  in  Fig  10.17. 


t'l  V4  M]  M3 


0 I  02 

Two  isomorphic  graphs. 


The  adjacency  matrices  Ai  and  A2  of  Q\  and  02,  respectively,  are 


-  0 

1 

0 

1  - 

"  0 

0 

1 

1  " 

1 

0 

1 

1 

0 

and  Ai  = 

0 

0 

1 

1 

0 

0 

1 

1 

1 

0 

0 

_  1 

0 

1 

0  . 

_  1 

1 

0 

0  _ 

Although  these  graphs  are  isomorphic,  their  matrices  are  not  equal  simply  because 
of  the  way  the  graphs  were  presented  to  us,  their  labels  already  in  place.  Observe, 
however,  that  the  following  reassignment  of  labels  to  Q\  defines  an  isomorphism 
01  — ”  02- 


(1) 


m  - 

*  U 1 

v2  - 

*  U4 

V3  - 

■»  M2 

V4  - 

v  M  3 

We  use  this  isomorphism  to  obtain  a  permutation  matrix  P,  that  is,  a  matrix 
whose  rows  are  the  rows  of  the  identity  matrix,  but  not  necessarily  in  their  natural 
order.  Precisely,  let  P  be  that  4x4  matrix  with  row  1  of  the  4  x  4  identity 
matrix  as  its  first  row,  row  2  of  the  identity  as  its  fourth  row,  row  3  of  the 
identity  as  its  second  row,  and  row  4  of  the  identity  as  its  third  row.  In  other 
words. 
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-  1  0  0  0  ' 
0  0  10 
0  0  0  1 
0  10  0 


Notice  how  the  isomorphism  in  (1)  determines  P.  If  the  isomorphism  maps  Vj  to 
uj,  we  put  row  i  of  the  identity  matrix  into  row  j  of  P.  Just  as  the  relabeling  of 
the  vertices  of  Q\  given  by  (1)  transforms  Q\  into  (?2,  the  matrix  P  transforms 
the  adjacency  matrix  of  Q j  into  the  adjacency  matrix  of  Qi  in  the  sense  that 
PA\PT  =  A2.  The  notation  PT  means  the  transpose  of  the  matrix  P,  that  is,  the 
matrix  obtained  from  P  by  interchanging  rows  and  columns. 


PA\PT 


1  0  0  0  " 
0  0  10 
0  0  0  1 
0  10  0. 

0  10  1" 
0  10  1 
10  10 
10  10. 

0  0  1  1  - 
0  0  11 
110  0 
110  0 


-0101 
10  10 
0  10  1 
.10  10 

"10  0  0 
0  0  0  1 
0  10  0 
0  0  10 


=  a2 


-  1  0  0  0  " 

0  0  0  1 

0  10  0 
_  0  0  1  0  _ 


Graph  Q \  shown  at  the  right  is  the  same  as  Q  in 
Fig  10.15,  except  that  the  labels  v\,  V2,  V3,  V4,  v$ 
have  been  replaced  by  u\ ,  M4,  M3,  «2,  t/5,  respec¬ 
tively.  Find  the  adjacency  matrix  A\  of  Q\.  Then 
find  a  permutation  matrix  such  that  PAP1  =  A \. 


M3 

o 


The  proof  of  the  following  theorem,  which  we  have  been  illustrating,  is  not 
especially  interesting  and  will  not  be  included.  The  key  idea  is  that  if  Q  is  a 
graph  with  adjacency  matrix  A,  then  an  isomorphism  from  Q  to  another  graph 
just  amounts  to  renumbering  the  vertices  of  Q.  Renumbering  vertices  corresponds 
to  permuting  the  rows  (equivalently,  computing  PA)  and  permuting  the  columns 
of  A  (equivalently,  computing  APT),  hence,  changing  A  to  the  matrix  PAP1 . 
See  the  Exercises  for  more  details. 


10.3.4  THEOREM 


Answers  to  Pauses 


Labeled  graphs  Q\  and  Q2,  with  adjacency  matrices  A\  and  A2,  respectively,  are 
isomorphic  if  and  only  if  A2  =  PA)  PT  for  some  permutation  matrix  P. 

8.  No,  by  Property  1. 


9.  Here  is  a  graph  with  adjacency  matrix  A  = 


-0 

0 

0 

1 


0  0 
0  1 
1  0 
0  0 


r 

0 

0 

0 
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10.  There  are  four  walks  of  length  3  from  14  to  us :  v4 U5 u? u_s ,  tq v  1 14 U5 ,  V4V5V4V5, 
and  V4V2V4VS. 

'010001  rioooo" 

1  0  0  1  1  0  0  0  10 

11.  A\  =  00000  />  =  00100 

0  1  0  0  1  0  1  0  0  0 

_oioioJ  Looooi_ 

The  equation  PAPT  =  A|  corresponds  to  the  reassignment  of  labels  to  the 
graph  Q,  whose  adjacency  matrix  is  A.  The  matrix  P  is  therefore  the  5  x  5 
matrix  where  row  1  of  the  identity  is  in  row  1,  where  row  2  of  the  identity  is 
in  row  4,  where  row  3  of  the  identity  is  in  row  3,  where  row  4  of  the  identity 
is  in  row  2,  and  row  5  of  the  identity  is  in  row  5. 


EXERCISES 


The  symbol  [BB]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 


1.  Find  the  adjacency  matrices  of  the  graphs  in  Figs  10.1 
[BB]  and  10.3. 

2.  What  is  the  adjacency  matrix  of  /C„?  Label  the  vertices 
of  K.m  n  so  that  the  adjacency  matrix  has  an  especially 
nice  form. 

3.  (a)  [BB]  Let  A  be  the  adjacency  matrix  of  the  graph  Q2 

shown  in  Fig  10.8.  Determine  the  (3,  5)  entry  of  A3 
by  inspection  of  the  graph,  that  is,  without  writing 
down  A  explicitly.  Determine  the  (2,  2)  entry  of  A3 
by  similar  means. 

(b)  Repeat  part  (a)  for  the  graph  G\  of  Fig  10.8. 

4.  [BB]  What  is  the  significance  of  the  total  number  of  1’s 
in  the  adjacency  matrix  of  a  graph? 

5.  Let  A  be  the  adjacency  matrix  of  a  graph  Q  whose  vertex 
set  is  [v\ ,  . . .  ,  iv,}-  Prove  that  the  /th  entry  on  the  diago¬ 
nal  of  A3  equals  twice  the  number  of  different  triangles 
which  contain  vertex  v, . 

6.  Suppose  that  Q  is  a  graph  with  adjacency  matrix  A. 

(a)  [BB]  Show  that  the  number  of  walks  of  length  2  in 
G  is  the  sum  of  the  entries  of  the  matrix  A2. 

(b)  Let  dj  denote  the  degree  of  the  vertex  i>,-  in  CL  Show 
that  the  sum  of  the  entries  of  A2  is  also  £*/?. 

7.  Find  the  adjacency  matrices  A\  and  A 2  of  the  graphs 
G 1  and  G2  shown.  Find  a  permutation  matrix  P  such 
that  A2  =  PA\Pt,  thus  proving  that  G 1  and  G2  are 
isomorphic.  [Hint:  See  Pause  6  of  Section  9.3.] 


14  V2  u  1  u  2 


V4  U3  «3 

Q\  Gi 

8.  (a)  [BB]  Find  the  adjacency  matrices  A 1  and  A2  of  the 
graphs  G\  and  G2  shown. 

v\  t>4  u  1  u  3 


v2  v3  u2  u4  w5 

G,  G2 

(b)  [BB]  Explain  why  the  function  <p :  G\  — ►  Gi  defined 
by 

tp{v\)  =  m4,  y»(n2)  =  «5,  tp( U3)  =  u  1, 

(p(v4)  =  M3,  <p(v 5)  =  M2 

is  an  isomorphism. 

(c)  [BB]  Find  a  permutation  matrix  P  which  corre¬ 
sponds  to  the  isomorphism  in  (b)  such  that 
PA\PT  =  A2. 

9.  Repeat  Exercise  8  for  the  graphs  Q\  and  G2  shown.  For 
<p,  take  the  function  G\  — >  G2  defined  by 

tp (Vl)  =  114,  <p{V2)  =  Ml,  <P(V 3)  =  «5, 
tp(v4)  =  «6,  tp(v5)  =  U 3,  <P(V6)  =  It 2- 
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M  = 


v5  u4 

Gi 


U  4 


<?2 


«3 


10.  Repeat  Exercise  8  for  the  graphs  G\  and  Gi  shown.  For 
cp,  take  the  function  Q \  — >  G2  defined  by 

<0(ni)  =  «1,  <P( Vl)  =  M2.  ^(«3>  =  M6,  ¥>(114)  =  Mg, 

=  M4,  (p(v 6)  =  m3,  <p(v7)  =  m7,  <pOg)  =  u5. 
vi 


U1 


M4 


02 


and  let  P  = 


0 


0  I 
0  0  1 
1  0  0 


Thus  P  is  a  permutation  matrix  whose  rows  are  those 
of  the  3  x  3  identity  matrix  in  the  order  2,  3,  1. 

(a)  [BB]  Compute  PA  and  compare  with  A. 

(b)  Compute  AP 7  and  compare  with  A. 

(c)  Compute  PAP1  and  compare  with  A. 


(b)  A,  = 


A2  = 


(c)  Ai  = 


0 


A2  = 


0  10  0 
10  11 
0  10  0 
0  10  0 


1 


0 


0  1 
1  0 


0 

1 

0  I 

0  0 


0  0  0  11 
0  0  10  0 
0 


0  1 
1  0 
1  0  0 
0  1  1 
1 
1 

0  10  0  1 
0 
0 

1 

0 


1  0 
1  0  1 
1  0 
0  0 
0  0  11 
0  0  0  0 


0 


0 


0 


1  0 
1  0  1 
0  1  0 
I  0  1 


0 


0  10  11 


1  0  0 
0  1  0 
0  1 

I  0 

0  1 


0 

I 

0 

0 

1 

1 

0 

I 

0 


14.  [BB]  Let  A  be  the  adjacency  matrix  of  a  bipartite  graph. 
Prove  that  the  diagonal  entries  of  A37  are  all  equal  to  0. 


12.  Let  A 


tation  matrix 


b 

c 

d 

H 

r 

s 

y 

z 

w 

P 

Y 

S 

-  0 

0 

1 

0 

1 

0 

0 

0 

0 

0 

0 

1 

_  0 

1 

0 

0 

and  let  P  be  the  permu- 


Find  PA,  APT,  and  PAP1  without  calculation  but  with 
explanations. 


13.  For  each  pair  of  matrices  A 1 ,  A2  shown,  decide  whether 
or  not  there  is  a  permutation  matrix  P  with  A2  = 
P A\PT.  Either  find  P  or  explain  why  no  such  P  exists. 


r  0 


(a)  [BB]  A,  = 


1 

0 

0 


I 

0 

1 

1 


0  0  - 
I  I 
0  1 
1  0 


15.  Let  A  be  the  adjacency  matrix  of  a  graph  Q. 

(a)  [BB]  Find  a  necessary  and  sufficient  condition  for 
the  matrix  A2  to  be  the  adjacency  matrix  of  some 
graph. 

(b)  Find  a  necessary  and  sufficient  condition  for  A3  to 
be  an  adjacency  matrix. 

16.  Let  A  be  the  adjacency  matrix  of  a  graph  Q  with  at  least 
two  vertices.  Prove  that  Q  is  connected  if  and  only  if,  for 

some  natural  number n,  the  matrix  B  =  A+A2F - FA" 

has  no  zero  entries. 

17.  Suppose  A 1  and  A2  are  the  adjacency  matrices  of  iso¬ 
morphic  graphs  Q\  and  G2,  respectively.  Show  that  A 1 
and  A 2  have  the  same  characteristic  polynomial.  [Hint: 
First  show  that  if  P  is  a  permutation  matrix,  then  PT  = 
P-'.I 

18.  [BB]  Discuss  other  ways,  besides  the  adjacency  matrix, 
that  a  graph  could  be  stored  in  a  computer. 
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10.4  Shortest  Path  Algorithms 

In  this  section,  we  consider  graphs  whose  edges  have  numbers  attached  to  them. 
Typically,  the  number  associated  with  an  edge  is  a  unit  of  time,  distance,  cost,  or 
“capacity”  in  some  sense. 


10.4.1  DEFINITION 


A  weighted  graph  is  a  graph  Q  in  which  each  edge,  e,  is  assigned  a  nonnegative 
real  number,  w(e),  called  the  weight  of  e.  The  weight  of  a  subgraph  of  Q  (often 
a  path  or  a  trail)  is  the  sum  of  the  weights  of  the  edges  of  the  subgraph. 


One  famous  problem  concerning  weighted  graphs  is  known  as  the  Traveling 
Salesman’s  Problem. 


The  Traveling  Salesman's  Problem 

On  a  typical  business  trip,  a  traveling  salesman  visits  various  towns  and  cities.  If 
he  wants  to  avoid  having  to  pass  through  the  same  community  twice,  he  needs 
a  Hamiltonian  cycle  through  the  map  of  towns  and  air  routes.  This  map  can  be 
thought  of  as  a  graph  in  an  obvious  way.  Assigning  to  each  edge  a  weight  equal  to 
the  distance  between  the  cities  at  the  ends,  the  traveling  salesman’s  graph  becomes  a 
weighted  graph.  Among  all  Hamiltonian  cycles  (assuming  there  are  any),  what  our 
salesman  would  like  to  find  is  one  whose  weight  is  a  minimum,  in  order  to  economize 
on  flying  time  and  expenses.  The  problem  of  finding  a  minimum  Hamiltonian  cycle 
in  a  weighted  (Hamiltonian)  graph  is  called  the  Traveling  Salesman’s  Problem.  It 
can  be  solved  by  laboriously  calculating  all  Hamiltonian  cycles  and  then  selecting 
the  most  economical.  At  the  time  of  writing,  it  is  unknown  whether  or  not  a  more 
efficient  algorithm  is  possible.  In  fact,  the  Traveling  Salesman’s  Problem  is  many 
people’s  favorite  example  of  an  NP-complete  problem:  It  is  an  open  question  as  to 
whether  or  not  there  exists  an  efficient  (polynomial  time)  algorithm  for  its  solution. 


Our  aim  in  the  rest  of  this  section  is  to  discuss  a  certain  problem  concerning 
weighted  graphs  for  which,  unlike  the  Traveling  Salesman’s  Problem,  complete 
and  efficient  solutions  exist.  A  shortest  path  between  two  vertices  in  a  weighted 
graph  is  a  path  of  least  weight.  (In  an  unweighted  graph,  a  shortest  path  means 
one  with  the  fewest  number  of  edges.)  Numerous  algorithms  for  finding  shortest 
paths  have  been  discovered.  We  will  present  two  of  them  here  and  show  how 
they  apply  to  the  graph  of  Fig  10.18,  which  we  might  view  as  a  map,  the  vertices 
representing  towns,  the  edges,  roads,  and  the  weights  of  the  edges,  distances. 


A 


E 


Figure  10.18 


F 
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Figure  10.19 


The  first  algorithm  we  present  is  due  to  Holland’s  Edsger  Dijkstra  (1930-), 
one  of  today’s  leading  computer  scientists  and  discoverer  of  a  number  of  important 
graph  algorithms.  Dijkstra’ s  algorithm6  finds  the  shortest  path  from  a  specified 
vertex  A  to  another  specified  vertex  E.  If  continued  indefinitely,  it  gives  shortest 
paths  from  A  to  all  other  vertices  in  the  graph.  If  no  path  exists,  then  this  is 
identified  by  the  algorithm.  It  proceeds  by  progressively  assigning  to  each  vertex 
v  in  the  graph  an  ordered  pair  (x,  d),  where  d  is  the  shortest  distance  from  A  to  v 
and  xv  is  the  last  edge  on  the  shortest  path.  Thus,  if  E  is  eventually  labeled  ( w,t ), 
the  shortest  path  from  A  to  E  is  t  units  long  and  the  last  edge  of  the  shortest  path 
is  wE.  The  first  coordinate  of  the  label  for  w  determines  the  second  last  edge 
of  the  shortest  path,  and  by  continuing  to  work  backward,  the  entire  path  can  be 
found.  If  vertex  E  never  gets  labeled,  there  is  no  path  from  A  to  E\  the  graph  is 
not  connected. 

To  find  a  shortest  path  from  vertex  A  to  vertex  £  in  a  weighted  graph,  carry  out 
the  following  procedure. 

Step  1.  Assign  to  A  the  label  (— ,  0). 

Step  2.  Until  E  is  labeled  or  no  further  labels  can  be  assigned,  do  the  following. 

(a)  For  each  labeled  vertex  u(x,  d)  and  for  each  unlabeled  vertex  v  adjacent 
to  u,  compute  d  +  w(e),  where  e  =  uv. 

(b)  For  each  labeled  vertex  u  and  adjacent  unlabeled  vertex  v  giving  min¬ 
imum  d'  =  d  +  w(e),  assign  to  v  the  label  ( u ,  d').  If  a  vertex  can  be 
labeled  ( x ,  d')  for  various  vertices  x,  make  any  choice. 


Dijkstra’s  shortest  path  algorithm  applied  to  a  weighted  graph. 


Here  is  how  Dijkstra’s  algorithm  works  for  the  graph  in  Fig  10.18.  (Refer 
to  Fig  10.19.)  First,  give  A  the  label  (— ,  0).  There  are  three  edges  incident  with 
A  with  weights  7,  5,  and  8.  Since  d  =  0,  vertex  H  gives  the  smallest  value  of 
d  +  w{e),  so  H  acquires  the  label  (A,  5).  Now  we  repeat  Step  2  for  the  two 
vertices  labeled  so  far. 

There  are  two  unlabeled  vertices  adjacent  to  the  already  labeled  vertex  A.  The 
numbers  d+w(e)  are  0+7  =  7  and  0+8  =  8.  There  are  also  two  unlabeled  vertices 
adjacent  to  the  other  labeled  vertex,  77,  and  for  these,  the  numbers  d  +  w(e)  are 


6E.  W.  Dijkstra,  “A  Note  on  Two  Problems  in  Connection  with  Graphs,”  Numerische  Mathematik  1 
(1959),  269-271. 
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5  +  4  =  9  and  5  +  5  =  10.  The  smallest  d  +  w{e)  is  7,  corresponding  to  the 
labeled  vertex  u  =  A  and  the  unlabeled  v  =  B.  Thus,  B  is  labeled  (A,  7).  Again 
we  repeat  Step  2. 

Now  there  are  three  labeled  vertices.  There  is  one  unlabeled  vertex  adjacent 
to  the  labeled  vertex  A,  and  for  this,  d  +  w(e)  =  0  +  8  =  8.  There  is  also  just 
one  unlabeled  vertex  adjacent  to  the  labeled  77,  and  here,  d  +  w(e)  =  5  +  5  =  10. 
There  are  two  unlabeled  vertices  adjacent  to  the  third  labeled  vertex  B\  for  C, 
d  +  w{e)  =  7  +  8  =  15  and  for  /,  d  +  w(e)  =  10.  The  smallest  d  +  w(e)  is  8, 
corresponding  to  edge  AG.  So  G  acquires  the  label  (A,  8). 

We  repeat  Step  2.  There  are  four  labeled  vertices,  A,  B,  77 ,  and  G,  and  the 
algorithm  requires  that  we  look  at  all  unlabeled  vertices  adjacent  to  each  of  these. 
Since  all  vertices  adjacent  to  A  and  to  H  have  already  been  labeled,  we  have, 
in  fact,  only  to  look  at  B  and  G.  There  are  two  unlabeled  vertices  adjacent  to 
B.  For  C,  d  +  w(e)  =  7  +  8  =  15;  for  7,  d  +  w{e)  =  7  +  3  =  10.  There  are 
two  unlabeled  vertices  adjacent  to  G.  For  I ,  d  +  w(e)  =  8  +  2  =  10;  for  F, 
d  +  w(e)  =  8  +  7  =  15.  The  minimum  d  +  w(e)  occurs  with  7  and  either  edge  Bl 
or  GI.  We  can  therefore  assign  to  7  either  the  label  ( B ,  10)  or  the  label  (G,  10); 
we  opt  for  ( B ,  10). 

Repeating  Step  2,  we  have  only  to  look  at  vertices  B,  G  and  7.  The  only 
unlabeled  vertex  adjacent  to  B  is  C,  for  which  d  +  w{e)  =  7  +  8  =  15.  The  only 
unlabeled  vertex  adjacent  to  G  is  F,  for  which  d  +  w(e)  =  8  +  7  =  15.  There  are 
two  unlabeled  vertices  adjacent  to  7.  For  C,  d  +  w(e)  =  10  +  5=15  and  for  J, 
d  +  w(e)  =  10  +  7  =  17.  Vertices  C  and  F  which  tie  for  the  minimum  d  +  u>(e) 
are  each  labeled.  Vertex  F  is  labeled  (G,  15)  and  C  can  be  labeled  either  (B,  15) 
or  (/,  15);  we  choose  ( B .  15). 

Continuing  in  this  way,  the  vertices  of  the  graph  acquire  the  labels  shown 
in  Fig  10.19.  The  shortest  route  from  A  to  E  has  weight  21.  A  shortest  path  is 
AGFE,  as  we  see  by  working  backward  from  E.  Since  E  was  labeled  last,  the 
algorithm  has  actually  found  the  length  of  a  shortest  route  from  A  to  any  vertex. 
For  instance,  ABIJ  is  a  shortest  path  to  J,  of  weight  17. 

What  is  the  complexity  of  Dijkstra’ s  algorithm?  We  take  as  our  basic  operation 
an  addition  or  comparison  (which  we  weight  equally).  Each  time  we  iterate  Step  2, 
one  or  more  new  vertices  acquire  labels;  the  worst  case  occurs  if  only  one  new 
vertex  is  labeled  each  time.  Assume,  therefore,  that  at  the  kth  iteration,  k  vertices 
have  been  labeled.  Each  of  these  vertices  is  conceivably  adjacent  to  n—k  unlabeled 
vertices.  For  each  such  vertex  there  is  one  addition.  After  at  most  k(n—k)  additions 
in  all,  we  must  find  the  minimum  of  at  most  k(n—k )  numbers,  a  process  requiring 
k{n  —  k)  —  1  comparisons.7  At  worst,  therefore,  Dijkstra’s  algorithm  requires 

f(n)  =  2 kin  -  k)  -  1  ]  =  —  -  -n  +  1 

l  j  3 

operations,  a  function  which  is  O(n-).  Note  that  /(«)  is  also  an  upper  bound  for 
the  total  number  of  operations  required  if  the  algorithm  runs  until  all  vertices  are 
labeled.  [This  was  our  assumption  in  computing  /(/?).] 


^Finding  the  minimum  of  t  numbers  requires  t  —  I  comparisons.  (See  Exercise  4  of  Section  8.2.) 
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10.4.3  DIJKSTRA'S 
ALGORITHM 
(IMPROVED) 


With  a  few  minor  modifications,  the  efficiency  of  Dijkstra’s  algorithm  can  be 
substantially  increased.  We  present  an  improved  version  here  and  apply  it,  as  with 
the  former,  to  determine  the  weight  of  a  shortest  path  from  A  to  E  in  the  graph 
of  Fig  10.18. 

In  the  new  version,  the  starting  point  A  is  assigned  a  permanent  label  of 
0,  while  all  other  vertices  initially  are  assigned  temporary  labels  of  oo.  At  each 
iteration,  the  temporary  labels  are  decreased  or  left  unchanged,  and  one  additional 
vertex  is  assigned  a  permanent  label,  namely,  the  shortest  distance  from  A  to  that 
particular  vertex.  This  procedure  continues  until  the  required  terminal  point  E 
acquires  a  permanent  label,  or  until  some  iteration  results  in  no  temporary  labels 
(including  that  of  E)  being  changed.  In  the  latter  case,  we  can  conclude  that  there 
is  no  path  from  A  to  E. 

To  find  the  length  of  a  shortest  path  from  vertex  A  to  vertex  E  in  a  weighted 
graph,  proceed  as  follows. 


Step  1.  Set  v\  =  A  and  assign  to  this  vertex  the  permanent  label  0.  Assign  every 
other  vertex  a  temporary  label  of  oo,  where  oo  is  a  symbol  which,  by  definition, 
is  deemed  to  be  larger  than  any  real  number. 

Step  2.  Until  E  has  been  assigned  a  permanent  label  or  no  temporary  labels  are 
changed  in  (a)  or  (b),  do  the  following. 

(a)  Take  the  vertex  v,  which  most  recently  acquired  a  permanent  label,  say 
d.  For  each  vertex  v  which  is  adjacent  to  v,  and  has  not  yet  received 
a  permanent  label,  if  d  +  w(v,v )  <  f,  the  current  temporary  label  of  v, 
change  the  temporary  label  of  v  to  d  +  w(vjv). 

(b)  Take  a  vertex  v  which  has  a  temporary  label  smallest  among  all  temporary 
labels  in  the  graph.  Set  v,-+ \  =  v  and  make  its  temporary  label  permanent. 
If  there  are  several  vertices  v  which  tie  for  smallest  temporary  label,  make 
any  choice. 


Here  is  how  the  algorithm  works  for  the  graph  of  Fig  10.18.  We  will  describe 
the  first  few  iterations  in  words,  draw  a  figure  summarizing  the  state  we  have 
reached,  and  then  describe  the  remaining  iterations.  Readers  might  well  wish  to 
follow  our  description  with  their  own  diagrams. 

At  the  start,  A  —  v\  is  given  the  permanent  label  0;  all  others  are  given 
temporary  labels  oo.  Next,  H,  B,  and  G  have  their  temporary  labels  decreased  to 
5,  7,  and  8,  respectively,  while  all  other  temporary  labels  stay  at  oo.  Since  5  is 
the  minimum  of  5,  7,  8,  we  set  v2  =  H  and  make  5  its  permanent  label. 

Now  examine  those  vertices  with  temporary  labels  adjacent  to  u2.  For  B, 
5  +  w(e)  =  5  +  4  =  9,  but  this  is  larger  than  its  temporary  label  of  7,  so  no  change 
occurs.  For  G,  5  +  w(e)  =  5  +  5  =  10,  again  larger  than  the  older  temporary 
label  of  8.  Hence,  no  temporary  labels  are  changed.  The  smallest  temporary  label 
in  the  graph  at  this  point  is  7,  so  this  becomes  the  next  permanent  label  and 
i>3  =  B. 

Next,  we  examine  temporary  vertices  adjacent  to  vt,.  This  will  assign  to  C  a 
new  temporary  label  of  7  +  8  =  15  and,  to  /,  a  new  temporary  label  of  7  +  3  =  10. 
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Figure  10.20 


The  smallest  temporary  label  is  8,  on  G;  so  1)4  =  G  and  this  vertex  acquires  the 
permanent  label  8.  Figure  10.20  shows  the  present  state  of  affairs.  Permanent 
labels  are  circled;  temporary  labels  are  in  parentheses. 


Next,  we  focus  on  V4.  For  /,  we  obtain  8  +  2  =  10,  the  current  label,  so  there 
is  no  change.  For  F,  we  get  8  +  7  =  15,  so  F  gets  a  temporary  label  of  15  and 
115  =  /  is  assigned  the  permanent  label  of  10.  At  the  next  iteration,  we  have  a 
choice  of  C  or  F  for  i<(,  and  J  gets  a  temporary  label  of  17.  Assuming  V(,  =  C, 
then  F  will  be  vj  and  D  will  obtain  a  temporary  label  of  23. 

At  the  next  step,  E  gets  a  temporary  label  of  21  and  v%  =  J .  Then  D  gets  its 
temporary  label  lowered  to  20  and  becomes  1)9.  Finally,  tqo  =  E  with  a  permanent 
label  of  21.  Note  that  the  permanent  labels  are  exactly  the  second  coordinates  of 
the  labels  assigned  to  the  vertices  in  Fig  10.19. 

As  stated,  this  algorithm  gives  the  length  of  the  shortest  path  but  not  the  path 
itself.  If  at  each  choice  of  a  permanently  labeled  vertex  Vj,  however,  we  also  make 
note  of  the  vertex  v?  from  which  its  permanent  label  arose,  using 

(permanent  label  of  V()  +  (weight  of  edge  V(V,-)  =  permanent  label  of  Vj, 

then  we  can  easily  trace  back  the  shortest  path.  Also — and  the  original  algo¬ 
rithm  has  the  same  property — if  allowed  to  continue  until  all  vertices  are  labeled, 
the  improved  version  will  give  the  shortest  distances  from  A  to  all  other  ver¬ 
tices. 

Before  introducing  another  shortest  path  algorithm,  we  justify  the  adjective 
improved,  which  we  applied  to  the  second  version  of  Dijkstra's  algorithm  by 
showing  that  the  complexity  function  for  the  second  version  indeed  has  smaller 
order  than  that  for  the  original. 

In  the  second  version,  exactly  one  vertex  is  given  a  permanent  label  at  each 
iteration  of  Step  2.  Assume  u*  has  just  received  a  permanent  label.  There  are,  at 
worst,  n  —  k  vertices  adjacent  to  v *.  For  each  such  vertex,  one  addition  is  required 
and  then  one  comparison  to  determine  whether  or  not  to  change  a  temporary  label. 
This  process  requires  2 (n  —  k)  operations.  Finally,  to  choose  the  smallest  among 
n—k  temporary  labels  requires  an  additional  n  —  k  —  1  comparisons.  In  all,  at 
most 

n-  1 

f(n)  -  2 (n  -  k)  +  (n  -  k  -  1)  ] 

A  =  1 
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10.4.4  THE 
FLOYD-WARSHALL 
ALGORITHM 


n  —  1 

=  £[3(n  -k)  -  1] 

1 

3  5 

=  - n 2 - n  +  1  =  0(n2) 

1  2 

operations  are  required.  Recalling  that  the  original  version  of  Dijsktra's  algorithm 
was  0(/;3),  we  see  that  the  second  is  indeed  more  efficient  than  the  original. 
Again,  we  also  observe  that  the  complexity  function  is  still  0(n 2)  even  if  it  runs 
until  all  vertices  get  labeled. 

If  our  goal  is  to  find  the  shortest  path  between  every  pair  of  vertices  in 
a  weighted  graph  with  n  vertices,  we  can  employ  either  version  of  Dijkstra’s 
algorithm,  letting  it  run  until  all  vertices  have  acquired  their  final  labels  and 
repeating  this  procedure  for  each  of  the  n  possible  starting  points.  The  complexity 
functions  increase  by  a  factor  of  n  to  0(n4)  in  the  original  version  and  to  O(rr’) 
in  the  improved  case.  There  is  also  an  algorithm,  due  to  R.  W.  Floyd8  and  S.  War- 
shall,9  which  determines  the  shortest  distances  between  all  pairs  of  vertices  in  a 
graph.  This  algorithm  is  popular  because  it  is  so  easy  to  describe. 

To  find  the  shortest  distances  between  all  pairs  of  vertices  in  a  weighted  graph 
where  the  vertices  are  tq  ,  V2,  . . .  ,v„,  carry  out  the  following  procedure. 

Step  1.  For  /  =  1  to  n,  set  d(i,  i )  =  0.  For  i  ^  j,  if  V{Vj  is  an  edge,  let  d(i,  j ) 
be  the  weight  of  this  edge;  otherwise,  set  d(i,  j)  =  oo. 

Step  2.  For  k  =  1  to  n, 

for  i,  j  =  1  to  n,  let  d(i ,  j)  =  min{d(i,  j),  d(i,  k )  +  d(k ,  y)} 

The  final  value  of  d(i,  j )  is  the  shortest  distance  from  v,-  to  Vj. 

Initially,  the  algorithm  sets  the  shortest  distance  from  u,  to  vj  to  be  the  length 
of  edge  VjVj,  if  this  is  an  edge.  After  the  first  iteration  of  Step  2  (k  =  1),  this 
shortest  distance  has  been  replaced  by  the  length  of  the  path  VjViVj,  if  this  is  a 
path.  In  general,  after  stage  k,  the  algorithm  has  determined  the  shortest  distance 
from  Vi  to  Vj  via  the  vertices  v\,  V2, .  ■ .  ,  v^.  This  distance  is  the  true  shortest 
distance  after  k  —  n.  In  Fig  10.21,  we  show  a  graph,  the  initial  values  of  d(i ,  j), 
and  the  values  of  d(i,  j)  after  each  change  in  k. 

The  Floyd-Warshall  algorithm  is  very  efficient  from  the  point  of  view  of 
storage  since  it  can  be  implemented  by  just  updating  the  matrix  of  distances  with 
each  change  in  k\  there  is  no  need  to  store  different  matrices.  In  many  specific 
applications,  it  is  faster  than  either  version  of  Dijkstra's  algorithm  although,  like 
the  improved  version,  it  too  is  0(n2). 


**R.  W.  Floyd,  “Algorithm  97:  Shortest  Path,"  Communications  of  the  Association  for  Computing  Machin- 
ery  5  (1962),  345. 

9S.  Warshall,  “A  Theorem  on  Boolean  Matrices,”  Journal  of  the  Association  for  Computing  Machinery 
9  (1962).  11-12. 


V2  | 


v6  i  v5 


10.4  Shortest  Path  Algorithms 

Initial  values  of  d(i,  j ) 


V] 

v2 

V3 

v4 

v5 

f6 

n 

0 

1 

3 

00 

1 

4 

V2 

1 

0 

1 

oo 

2 

oo 

v3 

3 

1 

0 

3 

oo 

00 

v4 

00 

oo 

3 

0 

1 

2 

v5 

1 

2 

00 

1 

0 

2 

v6 

4 

oo 

00 

2 

2 

0 

331 


After  k  =  1 
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Figure  10.21  An  application  of  the  Floyd-Warshall  algorithm. 


EXERCISES 


The  symbol  [BB]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 


1.  The  Traveling  Salesman's  Problem  is  that  of  finding 
the  Hamiltonian  cycle  of  least  weight  in  a  Hamilto¬ 
nian  graph.  Is  this  what  a  traveling  salesman  necessarily 
wants  to  do?  Discuss. 

2.  (a)  [BB]  Apply  the  original  version  of  Dijkstra's  algo¬ 

rithm  to  find  the  length  of  the  shortest  path  from  A 
to  every  other  vertex  in  the  figure.  Show  the  final 
labels  on  all  vertices.  Also,  find  the  shortest  path 
from  A  to  E. 

(b)  Apply  the  original  version  of  Dijkstra’s  algorithm 
to  find  the  length  of  the  shortest  path  from  H  to 
every  other  vertex  in  the  graph.  Show  the  final 
labels  on  all  vertices.  Find  the  shortest  path  from 
H  to  D. 


3.  [BB;  A  only]  Apply  the  improved  version  of  Dijkstra’s 
algorithm  to  find  the  length  of  a  shortest  path  from  A  and 
from  H  to  every  other  vertex  in  the  graph  of  Exercise  2. 
In  each  case,  exhibit  an  order  in  which  permanent  labels 
might  be  assigned. 

4.  Use  the  first  form  of  Dijkstra’s  algorithm  to  find  the 
shortest  path  from  A  to  R  (and  its  length)  in  the  graph 
shown.  Show  the  final  labels  on  all  vertices. 
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5.  [BB]  Use  the  improved  version  of  Dijkstra’s  algorithm 
to  find  the  length  of  a  shortest  path  from  A  to  R  in 
the  graph  of  Exercise  4.  Also,  exhibit  an  order  in  which 
permanent  labels  might  be  assigned. 

6.  Use  the  original  form  of  Dijkstra’s  algorithm  to  find  the 
shortest  path  from  A  to  T  in  the  graph  shown.  Label  all 
vertices. 


7.  [BB]  Use  the  improved  version  of  Dijkstra’s  algorithm 
to  find  the  shortest  path  from  A  to  E  (and  its  length). 
Label  all  vertices. 


9.  [BB]  Could  Dijkstra's  algorithm  (original  version), 
employed  to  determine  a  shortest  path  from  A  to  £  in 
a  weighted  graph,  terminate  before  E  is  labeled?  Could 
the  improved  algorithm  terminate  before  E  acquires  a 
permanent  label?  Explain. 

10.  (a)  If  weights  were  assigned  to  the  edges  of  the  graph 
shown  in  Exercise  I  of  Section  10.2,  the  Travel¬ 
ing  Salesman’s  Problem  would  not  have  a  solution. 
Why  not? 

(b)  Despite  this  observation,  our  salesman  still  has  to 
complete  the  trip.  Assigning  weights  as  shown  in 
the  figure,  and  assuming  the  salesman  starts  at  A 
and  does  not  wish  to  travel  along  the  same  edge 
more  than  once,  find  a  most  efficient  route  for  the 
trip. 


E  G 


8.  Use  the  improved  version  of  Dijkstra’s  algorithm  to  find 
the  shortest  path  from  A  to  £  in  the  graph  shown.  Show 
the  final  labels  on  all  vertices. 


(c)  Suppose  the  salesman  is  willing  to  cover  the  same 
edge  more  than  once.  Is  the  route  found  in  (b)  still 
the  most  efficient? 

(d)  Use  the  original  form  of  Dijkstra’s  algorithm  to  find 
the  shortest  paths  from  E  to  each  of  the  other  ver¬ 
tices  in  the  above  graph.  Label  all  vertices. 

11.  (a)  [BB]  How  could  any  of  the  algorithms  presented 
in  this  section  be  used  to  find  the  path  requiring 
the  fewest  number  of  edges  between  two  specified 
vertices  in  an  unweighted  graph? 

(b)  Use  one  of  Dijkstra’s  algorithms  to  find  the  shortest 
path  from  A  to  B  in  the  graph. 
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12.  [BB]  Suppose  the  improved  version  of  Dijskstra’s  algo¬ 
rithm  is  used  to  find  the  shortest  distances  between  all 
pairs  of  vertices  in  a  graph  on  n  vertices  by  being  per¬ 
mitted  to  run  until  all  vertices  acquire  their  final  labels. 
Why  is  this  procedure  O(n^)  (in  terms  of  comparisons)? 

13.  Suppose  the  Floyd-Warshall  algorithm  is  applied  to  the 

graph  in  Fig  10.18  (with  vertices  A,...  ,J  relabeled 
v\ . UK),  respectively). 

(a)  What  are  the  final  values  of  d(  1,  1),  </(I,  2), . . .  , 
d(  1.  10)? 

(b)  [BB]  Find  the  values  of  d(l ,  5),  d{  1 . 6),  d( 3, 4),  and 
d( 8,  5)  after  k  =  4. 

(c)  What  is  the  initial  value  of  d( 2.  5)?  How  does  this 
value  change  as  k  increases  from  1  to  10? 

14.  The  Floyd-Warshall  algorithm  is  applied  to  the  graph 
shown. 


Vi  2  »3 


(a)  Find  the  final  values  of  d( 7,1),  d( 7,2),  .... 
d(7,  8). 

(b)  Find  the  values  of  d(  1,2),  d( 3,4),  d( 2,5),  and 
d( 8.  6)  after  k  =  4. 

(c)  Find  the  values  of  d(6,  8)  at  the  start  and  as  k  varies 
from  1  to  8. 

15.  Inadvertently,  David  reverses  the  order  of  the  k  and  the 
i,  j  loops  in  Step  2  of  the  Floyd-Warshall  algorithm, 
implementing  Step  2  in  the  form 

For  i,  /  =  1  to  «, 

for  k  =  1  to  n,  let  <?(;',  /)  =  min[r/(;,  j),  d(i ,  k)+ 
d(kj) } 

Does  this  affect  the  algorithm?  Explain. 

16.  [BB]  Suppose  the  values  of  the  d(i,  j)  for  two  consec¬ 
utive  values  of  k  are  the  same  in  an  implementation  of 
the  Floyd-Warshall  algorithm.  Is  it  necessary  to  con¬ 
tinue,  that  is,  can  it  be  assumed  that  the  values  will  now 
remain  constant? 

17.  Prove  that  the  Floyd-Warshall  algorithm  works.  Specif¬ 
ically,  assume  that  Q  is  a  weighted  graph  with  vertices 

v  i ,  V2 . v„  and  assume  that  there  is  a  path  from  v/ 

to  Vj.  Prove  that  when  the  algorithm  terminates,  d(i,  j) 
is  the  length  of  a  shortest  path  from  u;  to  vj. 

18.  Show  that  the  Floyd-Warshall  algorithm  requires  0(n}) 
additions  and  comparisons. 

19.  Discover  what  you  can  about  Edsger  Dijkstra  and  write 
a  short  note  about  him. 

20.  Discover  what  you  can  about  R.  W.  Floyd  and  write  a 
short  note  about  him. 


REVIEW  EXERCISES  FOR  CHAPTER  10 


1.  In  the  Konigsberg  Bridge  Problem  (see  Fig  9.1),  two 
new  bridges  are  constructed,  one  joining  A  to  C  and 
the  other  B  to  D.  Use  Theorem  10.1.4  to  show  that  the 
answer  to  the  question  is  now  yes. 

2.  One  of  the  mayoralty  candidates  in  Konigsberg  says  that 
he  can  obtain  a  positive  solution  to  the  bridge  prob¬ 
lem  by  building  only  one  new  bridge.  Is  he  telling  the 
truth? 

3.  Suppose  Q\  and  Qi  are  graphs  with  no  vertices  in  com¬ 
mon  and  assume  that  each  graph  possesses  an  Eulerian 


trail.  Show  that  it  is  possible  to  select  vertices  v  and 
w  of  Q i  and  t?2,  respectively,  such  that  if  v  and  w  are 
joined  by  a  new  edge,  the  resulting  graph  will  possess 
an  Eulerian  trail. 

4.  True  or  false?  Explain  your  answers  in  each  case. 

(a)  Every  trail  is  a  path. 

(b)  Every  open  trail  is  a  path. 

(c)  If  there  is  an  open  trail  from  vertex  v  to  vertex  w, 
then  there  is  a  path  from  v  to  w. 

(d)  Every  path  is  an  open  trail. 
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(e)  If  there  is  a  path  from  vertex  u  to  vertex  w,  then 
there  is  an  open  trail  from  v  to  w. 

5.  Let  Q  be  a  connected  graph  with  at  least  two  vertices. 
Assume  that  every  edge  of  Q  belongs  to  a  unique  cycle. 
Prove  that  Q  is  Eulerian. 

6.  Is  the  graph  Hamiltonian?  Is  it  Eulerian?  Explain  your 
answers. 


C 


1.  (a)  How  many  edges  must  a  Hamiltonian  cycle  in  )C„_„ 
contain? 

(b)  How  many  Hamiltonian  cycles  does  „  have? 
Assume  n  >  2. 

(c)  What  is  the  maximum  number  of  edge  disjoint 
Hamiltonian  cycles  in  £„,„? 

(d)  When  n  =  4,  show  that  the  maximum  number  of 
edge  disjoint  Hamiltonian  cycles  predicted  in  7(c) 
is  realized. 

8.  True  or  false?  Explain  your  answers  in  each  case. 

(a)  In  a  Hamiltonian  graph,  every  edge  belongs  to  some 
Hamiltonian  cycle. 

(b)  In  a  Hamiltonian  graph,  every  edge  belongs  to  a 
cycle. 

(c)  Every  Eulerian  graph  contains  a  subgraph  which  is 
Hamiltonian. 

(d)  Every  Hamiltonian  graph  contains  a  subgraph  which 
is  Eulerian. 

9.  A  connected  graph  Q  has  14  vertices  and  88  edges.  Show 

that  Q  is  Hamiltonian  but  not  Eulerian. 

10.  The  following  questions  refer  to  the  graph  Q  drawn  on 

the  left. 


tq  V2  f3  U’\  W  2  U'3 


(a)  Find  the  adjacency  matrix  A  of  Q. 


(b)  Without  any  calculation,  determine  the  (1,5)  entries 
of  A2,  A3,  A4,  and  A5. 

(c)  Find  an  isomorphism  (p:  Q  H,  where  H  is  the 
graph  on  the  right. 

(d)  If  B  is  the  adjacency  matrix  of  H,  find  a  permuta¬ 
tion  matrix  P  such  that  PAP T  =  B.  Check  your 
answer  by  writing  down  B  and  computing  the  prod¬ 
uct  PAPT . 

11.  Let  tq ,  i>2 . ur  and  uq .  uq. ....  u’12  be  the  biparti¬ 

tion  sets  of  the  complete  bipartite  graph  JCg.  12 •  Let  A  be 
the  adjacency  matrix  of  this  graph,  where  the  vertices 
are  listed  in  the  order  tq . vg,  uq . «>I2- 

(a)  What  is  the  (1, 5)  entry  of  A? 

(b)  What  is  the  (8,  9)  entry  of  A? 

(c)  What  is  the  (10,  12)  entry  of  A2? 

(d)  What  is  the  (5,  5)  entry  of  A2? 

(e)  What  is  the  (20,  6)  entry  of  A2? 

(fj  What  is  the  (5,  7)  entry  of  A4? 

(g)  What  is  the  (5,  7)  entry  of  A15? 

12.  Suppose  Q  is  a  connected  graph.  Is  it  possible  to  deter¬ 
mine  from  the  adjacency  matrix  of  Q  whether  or  not  Q 
is  Eulerian?  Explain. 

13.  Martha  claims  that  a  graph  with  adjacency  matrix 

'  0  1  1  10  1  1" 

10  10  10  1 

1  10  0  0  1  1 

A  =  10  0  0  111 

0  10  10  11 

10  1  1  10  0 

1  1  1  1  10  0. 

must  be  Hamiltonian.  How  can  she  be  so  sure? 

14.  If  A  is  the  adjacency  matrix  of  a  graph  Q  and  A2  =  [bq  ], 
find  5  JT  bu. 

15.  Apply  the  first  form  of  Dijkstra's  algorithm  to  the  fol¬ 
lowing  graph,  showing  the  shortest  distances  from  A  to 
every  other  vertex.  Exhibit  an  order  in  which  a  shortest 
path  from  A  to  £  might  be  realized. 


B  2  C 


Review  Exercises  335 


16.  Apply  the  improved  version  of  Dijkstra's  algorithm  to 
the  graph  of  Exercise  15  to  find  the  length  of  a  shortest 
path  from  A  to  every  other  vertex. 

17.  Apply  the  original  form  of  Dijkstra’s  algorithm  to  find 
the  length  of  the  shortest  path  from  A  to  every  other 
vertex.  Show  the  final  labels  on  all  vertices.  Also  find 
the  shortest  path  from  A  to  H . 


B  8  D  2  G 


C  5  F  4  / 


18.  Apply  the  improved  version  of  Dijkstra’s  algorithm  to 
answer  Exercise  17. 

19.  Apply  the  Floyd-Warshall  algorithm,  showing  the  initial 
values  of  d(i ,  j)  and  the  values  of  d(i,  j)  at  the  end  of 
every  step. 


20.  Apply  the  Floyd-Warshall  algorithm  to  the  graph  in 
Exercise  17.  Show  the  initial  values  of  th ed(i,  j),  i,  j  = 

1,2 . 9;  the  values  after  k  =  1,  k  =  3,  and  k  =  5; 

and  the  final  values.  Explain  how  the  algorithm  can  be 
used  to  find  the  shortest  path  from  A  to  H . 

21.  An  encyclopedia  salesman,  traveling  by  car,  wishes  to 
visit  ten  towns  and  return  home  without  passing  through 
the  same  town  twice.  He  knows  that  one,  but  only  one, 
of  the  towns  has  a  direct  connection  to  each  of  the 
other  towns  (that  is,  connections  passing  through  no 
other  towns).  He  also  knows  that  there  are  a  total  of 
39  such  direct  road  connections  between  pairs  of  towns. 
Why  is  he  confident  that  he  will  be  able  to  find  such  a 
route? 
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11.1  The  Chinese  Postman  Problem 

In  this  chapter,  we  present  various  applications  of  the  ideas  developed  in 
Chapter  10.  We  use  what  we  know  about  Eulerian  graphs  to  solve  the  so-called 
Chinese  Postman  Problem  and  to  reconstruct  RNA  chains.  The  latter  subject 
requires  the  concept  of  a  digraph ,  a  graph  in  which  the  edges  have  directions. 
We  show  how  Hamiltonian  paths  can  be  used  in  the  study  of  tournaments  and, 
finally,  we  will  see  how  shortest  path  algorithms  can  be  applied  to  scheduling 
problems. 

A  mail  carrier  who  begins  his  route  at  the  post  office  must  deliver  letters  to 
each  block  in  a  certain  part  of  town  and  then  return  to  the  post  office.  What  is 
the  least  amount  of  walking  the  mail  carrier  must  do?  This  problem,  a  version  of 
which  was  first  solved  by  H.  E.  Dudeney  in  1 9 1 7, 1  is  today  known  as  the  Chinese 
Postman  Problem  because  it  was  studied  in  most  general  form  by  the  Chinese 
mathematician,  Mei-ko  Kwan  (also  known  as  Meigo  Guan).  It  is  clear  that  the 
same  problem  is  faced  by  delivery  people  of  various  kinds,  by  paper  boys,  by 
street  repair  crews,  by  snow  plow  operators,  and  so  on. 

We  represent  the  mail  carrier’s  problem  by  a  weighted  graph  Q  with  each 
vertex  denoting  a  street  corner  and  each  edge  a  street  of  length  the  weight  of  that 
edge.  If  we  assume  that  the  post  office  is  on  a  corner,  then  any  route  the  postman 
follows  corresponds  to  a  closed  walk  in  the  graph  which  uses  each  edge  at  least 
once.  If  the  graph  Q  has  an  Eulerian  circuit,  then  we  have  an  optimal  solution 
since  every  block  will  be  walked  exactly  once.  If  not.  then  the  postman  will  have 
to  walk  certain  blocks  more  than  once,  but  his  aim  is  to  plan  these  extra  trips  so 
that  the  total  distance  walked  is  as  small  as  possible. 


’See  Angela  Newing,  "The  Life  and  Work  of  H.  E.  Dudeney,"  Mathematical  Spectrum  21  (1988/89), 
37-44. 

~M.  K.  Kwan,  "Graphic  Programming  Using  Odd  or  Even  Points,"  Chinese  Math  1  (1962),  273-277. 
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11.1.1  THE  CHINESE 
POSTMAN  PROBLEM 


Figure  11.1 


EXAMPLE  1 


Given  a  connected  possibly  weighted  graph,  find  the  shortest  closed  walk  that 
covers  every  edge  at  least  once. 


The  first  step  in  the  solution  to  this  problem  is  the  realization  that  the  post¬ 
man’s  walk  will  follow  an  Eulerian  circuit  in  a  pseudograph  obtained  from  the 
given  graph  by  the  duplication  of  certain  edges. 

Suppose  that  the  graph  Q  shown  in  Fig  11.1  represents  the  streets  which  a 
postman  must  cover  and  suppose  that  all  streets  have  the  same  length. 


C 


D 


Q' 


Since  the  postman’s  route  covers  all  streets,  it  doesn’t  matter  where  the  post 
office  is,  so  assume  it  is  at  vertex  P.  Since  Q  has  vertices  of  odd  degree  (namely, 
A  and  B),  Q  is  not  Eulerian,  so  the  postman  cannot  service  this  area  by  walking 
each  block  exactly  once.  Our  problem  is  to  determine  the  least  number  of  edges 
which  should  be  duplicated  in  order  to  obtain  an  Eulerian  pseudograph.  Since  all 
vertices  of  an  Eulerian  pseudograph  must  have  even  degree,  we  will  have  to  add 
an  extra  copy  (or  extra  copies)  of  certain  of  the  edges  incident  with  A  and  with  B. 
If  A  and  B  were  joined  by  an  edge,  then  a  single  extra  copy  of  that  edge  would 
certainly  suffice.  Since,  however,  A  and  B  are  not  joined  by  an  edge,  we  need 
at  least  two  additional  edges.  The  pseudograph  Q'  shown  on  the  right  in  Fig  11.1 
(extra  edges  shown  as  dashed  lines)  is  Eulerian  and  a  solution  to  the  Chinese 
Postman  Problem.  One  Eulerian  circuit  is  PACDEAEBGFEBP. 

Unlike  many  graph  theoretical  problems,  the  Chinese  Postman  Problem  has 
been  solved:  Procedures  have  been  found  for  determining  which  edges  of  a  graph 
must  be  duplicated  in  order  to  obtain  the  Eulerian  pseudograph  of  least  weight.' 

We  illustrate  with  reference  to  the  graph  Q  in  Fig  1 1 .2,  where  we  again  assume 
that  all  streets  have  the  same  length.  There  are  four  odd  vertices  in  Q,  labeled 
A,  6,  C.  D  in  the  figure.  Consider  all  the  partitions  of  these  vertices  into  pairs  and, 
for  each  partition,  calculate  the  sum  of  the  lengths  of  shortest  paths  between  pairs. 
As  the  calculations  in  Table  11.1  show,  the  minimum  sum  of  lengths  is  5,  and  this 
is  achieved  with  two  partitions  either  of  which  yields  a  solution  to  the  problem.  If 
we  choose,  for  instance,  {A,  S),  {C,  D},  then  a  pseudograph  of  minimum  weight 
is  obtained  by  duplicating  the  edges  along  the  shortest  path  from  A  to  B  and 


'^See  J.  Edmonds,  “The  Chinese  Postman  Problem.”  Operations  Research  13  Suppl.  1  (1965),  353; 
J.  Edmonds  and  E.  L.  Johnson.  Matching.  Euler  Tours  and  the  Chinese  Postman.  Mathematical  Program¬ 
ming  5  (1973).  88-124;  E.  L.  Lawler,  Combinatorial  Optimization  Networks  and  Matroids ,  Holt,  Rinehart 
and  Winston,  New  York.  1975;  and  E.  Minieka,  Optimization  Algorithms  for  Networks  and  Graphs,  Marcel 
Dekker.  New  York.  1978. 
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Table  11.1 

Partition  into  pairs 

Sum  of  lengths  of  shortest  paths 

{A,  B},  {C,  D) 

2  +  3  =  5 

{A,  C},  {£,  D] 

4  +  2  =  6 

[A,D},{B,C] 

2  +  3  =  5 

along  the  shortest  path  from  C  to  D,  as  shown  on  the  right  of  Fig  11.2.  As 
before,  many  people  could  have  obtained  our  solution  by  inspection.  Computers 
cannot  see,  however.  Our  intent  has  been  to  illustrate  a  concrete  procedure  which 
works  in  all  cases  and  does  not  depend  on  a  picture.  ▲ 

To  find  an  Eulerian  pseudograph  of  minimum  weight  by  duplicating  edges  of  a 
weighted  connected  graph  Q, 


Step  I.  Find  all  the  odd  vertices  in  Q. 


Step  2.  For  each  partition  of  the  odd  vertices  into  pairs  of  vertices, 


it’l,  W’l),  {+><  W2) . {Vm,  Wm), 

find  the  length  of  a  shortest  path  between  each  vt  and  w,  and  add  these 
lengths. 

Step  3.  Take  the  partition  for  which  the  sum  of  lengths  in  Step  2  is  minimum 
and,  for  each  pair  {u(  ,  in,  }  of  vertices  in  this  partition,  duplicate  the  edges  along 
a  shortest  path  from  Vj  to  w, . 

The  second  step  of  this  algorithm  assumes  that  the  odd  vertices  of  a  graph  can 
always  be  partitioned  into  pairs.  Why  should  this  be  the  case? 

In  the  implementation  of  the  algorithm,  will  it  ever  be  necessary  to  duplicate 
an  edge  more  than  once?  In  other  words,  if  an  edge  e  has  been  duplicated  for 
inclusion  in  the  shortest  path  from  u,  to  wi,  will  it  perhaps  be  necessary  later  to 
duplicate  it  for  inclusion  in  the  shortest  path  from  vj  to  w/2 

In  the  Exercises,  we  ask  you  to  show  that  the  pseudograph  produced  by 
this  algorithm  is  Eulerian  and,  furthermore,  that  given  any  Eulerian  pseudograph 
obtained  from  Q  by  duplicating  certain  edges,  the  odd  vertices  can  be  paired  such 
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that  there  are  edge  disjoint  paths  of  new  edges  between  them.  From  this,  it  follows 
that  our  algorithm  indeed  produces  an  Eulerian  pseudograph  of  minimum  weight. 

Suppose  we  attach  some  weights  to  the  edges  of  the  graph  we  considered 
before,  as  shown  in  Fig  1 1 .3.  There  are  still  four  odd  vertices  and  three  partitions 
of  these  into  pairs,  but  the  lengths  of  shortest  paths  between  pairs  of  odd  vertices 
have  changed. 


Figure  11.3 


A 


A 


As  the  calculations  in  Table  1 1 .2  show,  the  minimum  sum  of  lengths  is  now  9, 
corresponding  to  the  partition  {A,D},{#,C}.  Duplicating  the  edges  on  a  shortest 
path  from  A  to  D  and  from  B  to  C,  as  shown  on  the  right  of  Fig  11.3,  we 
obtain  an  Eulerian  pseudograph  of  minimum  weight  and  a  solution  to  the  Chinese 
Postman  Problem  for  the  graph  Q. 


Table  11.2 

Partition  into  pairs 

Sum  of  lengths  of  shortest  paths 

{A,fi},(C,D} 

5  +  5  =  10 

{A,C},{S,D} 

6  +  4  =  10 

{A,  D],  {B,  C} 

5  +  4  =  9 

Answers  to  Pauses 


1.  The  number  of  odd  vertices  in  a  graph  is  even  (Corollary  9.2.7). 

2.  No,  it  won’t.  To  see  why  not,  think  about  how  the  partition 

V  ={{vi,  w i},  { V2 ,  u>2 } . {vm,  wm}} 

was  determined.  Denoting  by  €,•  the  length  of  the  shortest  path  between  vt 
and  Wj,  the  sum,  L  =  J2i  fo  is  least  among  all  similar  sums  arising  from  all 
partitions  of  the  odd  vertices  into  pairs. 

If  an  edge  e  is  duplicated  because  it  is  part  of  the 
shortest  path  between  v,  and  w,,  and  it  is  later 
required  again  because  it  is  part  of  the  shortest 
path  between  vj  and  Wj,  then  the  partition 
obtained  from  P  by  replacing  the  pairs 
{Vj,  Wj),  { Vj ,  Wj}  by  the  pairs  {u,-,  vj},  {wj,  wj } 
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has  sum  of  shortest  path  lengths  less  than  or  equal  to  L  —  2w(e),  contradicting 
the  minimality  of  L. 


EXERCISES 


The  symbol  [BB]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 


1.  Solve  the  Chinese  Postman  Problem  for  each  of  the 
graphs  shown. 


2.  [BB;  /C5]  Solve  the  Chinese  Postman  Problem  for  /Cs, 
1C 6,  and  £3, 5. 

3.  [BB]  Solve  the  Chinese  Postman  Problem. 


4.  [BB]  Solve  the  Chinese  Postman  Problem  for  each 
graph. 


Q  C 

1 

0  7  t 

>2  0 

0  1  c 

1 

> 

2 

) 

l6 

3 

0  c 

6  .  i 

© - 6 

3 - © 

w  13 

/  \ 

) 

> 

) 

t )  c 

rs  r 

l 

0 - 6 

5.  Solve  the  Chinese  Postman  Problem  for  each  graph. 


6.  [BB]  Suppose  X  and  Y  are  any  two  vertices  of  a 
weighted  connected  graph  Q.  Explain  why  the  shortest 
mail  carrier's  route  from  X  to  X  has  the  same  length  as 
the  shortest  route  from  Y  to  Y . 


7.  Solve  the  Chinese  Postman  Problem  for  the  unweighted 
graph  shown. 


graph. 


The  remaining  exercises  concern  the  algorithm  presented 
in  this  section  for  obtaining  an  Eulerian  pseudograph  of 
minimum  weight  from  a  given  non-Eulerian  connected 
graph  Q  by  duplicating  certain  edges. 
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9.  [BBJ  Prove  that  the  pseudograph  Q'  which  the  algorithm 
finds  is  Eulerian. 

10.  Suppose  Q  has  n  odd  vertices.  In  how  many  ways  can 
these  be  partitioned  into  |  pairs? 

11.  Suppose  that  Q"  is  any  Eulerian  pseudograph  obtained 
from  Q  by  duplicating  certain  edges. 

(a)  Prove  that  each  odd  vertex  of  Q  is  the  initial  vertex 
of  a  path  consisting  entirely  of  new  edges  (in  Q") 
and  leading  to  another  odd  vertex. 


(b)  Improve  the  result  of  (a)  by  showing  that 

the  odd  vertices  of  Q  can,  in  fact,  be  parti¬ 
tioned  into  two  equal  sets — {i>i,i>2 . v,,,}  and 

(wi ,  u>2,  ■  ■  ■  — with  edge  disjoint  paths  con¬ 

sisting  entirely  of  new  edges  between  each  v, 
and  uij . 

(c)  Conclude  that  of  all  the  pseudographs  obtain¬ 
able  from  Q  by  duplicating  edges,  the  algorithm 
yields  an  Eulerian  pseudograph  of  minimum 
weight. 


11.2  Digraphs 

Often,  when  graphs  are  used  to  model  real-life  situations,  the  edges  represent  lines 
of  communication  such  as  roads  or  pipes.  In  such  situations,  it  is  not  unusual  for 
the  flow  through  an  edge  to  have  an  associated  direction. 


11.2.1  DEFINITION 


A  digraph  is  a  pair  (V,  8)  of  sets,  V  nonempty  and  each  element  of  8  an  ordered 
pair  of  distinct  elements  of  V.  The  elements  of  V  are  called  vertices  and  the 
elements  of  8  are  called  arcs. 


Loosely  speaking,  a  digraph  is  just  a  graph  in  which  each  edge  has  an  orien¬ 
tation  or  direction  assigned  to  it.  (We  use  the  words  “orientation”  and  “direction” 
synonymously.)  It  can  be  pictured  like  a  graph,  with  the  orientation  of  an  arc 
indicated  by  an  arrow:  We  draw  an  arrow  u  — >  v  if  in,  v)  e  8.  Some  digraphs 
are  pictured  in  Fig  1 1.4. 


Figure  11.4 


As  with  most  aspects  of  graph  theory,  there  is  a  rather  extensive  terminology 
used  for  digraphs  and,  as  we  have  noted  before,  different  textbooks  employ  the 
same  words  with  slight  variations  in  meaning.  The  policy  in  this  text  is  to  use 
the  same  terms  for  graphs  and  digraphs  whenever  possible,  with  one  exception: 
In  a  digraph,  we  use  the  term  “arc”  instead  of  “edge.”  An  edge  is  just  a  set,  an 
unordered  pair  of  vertices  {n,  v};  an  arc  is  an  ordered  pair  ( u ,  v)  or  (i>,  u)  (as 
ordered  pairs,  these  are  different).  Just  as  it  has  been  our  custom  when  naming 
edges  of  a  graph  not  to  use  set  notation,  we  omit  parentheses  when  naming  arcs 
of  a  digraph,  thus  referring  to  “the  arc  uv”  rather  than  “the  arc  (w,  v).” 

As  with  the  term  “graph”,  our  definition  of  “digraph”  precludes  the  existence 
of  loops  and  multiple  arcs.  Note  the  subtle  distinction  between  multiple  edges  and 
multiple  arcs,  however.  Since  the  arcs  uv  and  vu  are  different,  they  may  both 
appear  in  a  digraph.  All  the  pictures  in  Fig  1 1 .4  represent  digraphs,  in  particular 
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04,  which,  without  the  arrows,  would  not  be  a  graph.  There  are  multiple  edges 
but  not  multiple  arcs  between  u  and  v. 

In  the  same  way  that  the  vertices  of  a  graph  have  degrees,  a  vertex  of  a 
digraph  has  an  indegree  and  an  outdegree ,  these  being,  respectively,  the  number 
of  arcs  directed  into  and  away  from  that  vertex.  Considering  the  graph  Q-$  in 
Fig  1 1 .4,  vertices  u  and  x  have  indegree  2  and  outdegree  1 ,  vertex  v  has  indegree 
0  and  outdegree  2,  and  vertex  w  has  indegree  1  and  outdegree  1.  The  indegree 
sequence  of  is  2,  2,  1,0  and  the  outdegree  sequence  of  Q$  is  2,  1,  1,  1.  The 
following  result  can  be  proved  with  a  routine  modification  of  the  argument  used 
to  prove  Proposition  9.2.5. 


11.2.2  PROPOSITION 


(cf.  Proposition  9.2.5)  The  sum  of  the  indegrees  of  the  vertices  of  a  digraph  equals 
the  sum  of  the  outdegrees  of  the  vertices,  this  common  number  being  the  number 
of  arcs. 


As  might  be  expected,  when  considering  the  possible  isomorphism  of 
digraphs,  the  direction  of  arcs  must  be  taken  into  account.  In  Fig  11,4,  Q\  and  Qs 
are  not  isomorphic  because  every  vertex  in  Q\  has  outdegree  1  whereas  in  Qi, 
outdegw  =  2.  The  indegree  sequences  of  isomorphic  digraphs  must  be  equal  as 
must  be  their  outdegree  sequences.  With  this  small  change  in  property  2,  Propo¬ 
sition  9.3.2  is  equally  valid  for  digraphs. 

By  convention,  any  walk  in  a  digraph  respects  orientation  of  arcs;  that  is,  each 
arc  is  followed  in  the  direction  of  its  arrow.  For  example,  digraph  Q\  in  Fig  11.4 
is  Eulerian  because  uvwu  is  an  Eulerian  circuit.  This  is  also  a  Hamiltonian  cycle. 
Digraph  Q2  has  neither  an  Eulerian  circuit  nor  a  Hamiltonian  cycle  (the  arc  is  uw, 
not  wu),  although  it  does  have  a  Hamiltonian  path  uvw. 

Explain  why  neither  digraph  Q%  nor  Q4,  of  Fig  1 1 .4,  is  Hamiltonian. 

In  Theorem  10.1.4,  we  established  necessary  and  sufficient  conditions  for  a 
graph  to  be  Eulerian.  This  theorem  can  be  adapted  to  the  situation  of  digraphs 
with  a  suitable  strengthening  of  the  concept  of  connectedness. 


11.2.3  DEFINITION 


A  digraph  is  called  strongly  connected  if  and  only  if  there  is  a  walk  from  any 
vertex  to  any  other  vertex  which  respects  the  orientation  of  each  arc. 


Imagine  a  city  where  every  street  is  one-way.  The  digraph  of  streets  and 
intersections  of  this  city  is  connected  if  it  is  possible  to  drive  from  any  intersection 
to  any  other  intersection,  perhaps  by  moving  the  wrong  way  down  a  one-way 
street,  and  strongly  connected  if  such  travel  is  always  possible  without  breaking 
the  law! 

The  proof  of  the  following  theorem  is  left  to  the  Exercises. 


11.2.4  THEOREM 


(cf.  Theorem  10.1.4)  A  digraph  is  Eulerian  if  and  only  if  it  is  strongly  connected 
and,  for  every  vertex,  the  indegree  equals  the  outdegree. 


Is  digraph  G3  in  Fig  11.4  Eulerian?  What  about  the  digraph  in  Fig  11.5? 


The  adjacency  matrix  A  of  a  digraph  Q  with  vertices  iq ,  vj,  ■  ■  ■  ,  vn  is  defined 
by  setting  a,j  =  1  if  there  is  an  arc  from  vertex  u,-  to  vertex  vj  and  0  otherwise. 
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Figure  11.5 


Figure  11.6 


Figure  11.7 

EXAMPLE  2 


A  digraph  associated  with  K 3. 


Unlike  the  adjacency  matrix  of  a  graph  (see  Section  10.3),  the  adjacency  matrix  of  a 
digraph  is  generally  not  symmetric,  since  an  arc  from  v,  to  Vj  does  not  imply  an  arc 
from  Vj  to  Vj.  Most  assertions  made  in  Section  10.3  apply,  with  appropriate  changes. 
For  example,  the  (/,  j)  entry  of  A2  is  the  number  of  different  walks  of  length  2  from 
Vj  to  Vj  (respecting  the  orientation  of  arcs).  This  number  need  not  be  the  same  as  the 
number  of  walks  of  length  2  from  Vj  to  Vj,  obviously.  The  ;th  entry  of  the  diagonal 
of  A2,  being  the  number  of  walks  of  length  2  from  v,  to  Vj ,  is  the  number  of  vertices 
Vj  such  that  there  exists  a  circuit  of  the  form  VjVjVj.  (See  Fig  1 1.6.) 


Theorems  10.3.3  and  10.3.4  hold  for  digraphs.  Digraphs  are  isomorphic  if  and 
only  if  their  vertices  can  be  labeled  in  such  a  way  that  their  adjacency  matrices  are 
equal.  Labeled  digraphs  Q\  and  Q2  with  adjacency  matrices  A  \  and  A 2,  respectively, 
are  isomorphic  if  and  only  if  A 2  =  PA  \  PT  for  some  permutation  matrix  P. 


Two  isomorphic  digraphs. 


Consider  the  digraphs  of  Fig  1 1.7.  The  adjacency  matrices  of  Q\  and  Q2  are 


-  0 

1 

0 

1  - 

-  0 

0 

1 

0  ~ 

0 

0 

0 

I 

and  A2  = 

1 

0 

0 

0 

0 

1 

0 

0 

0 

1 

0 

0 

.  0 

0 

1 

0 

_  1 

0 

1 

0  _ 

The  map  <p :  Q\  -*■  Qi  defined  by 

(p{v\)  =  U4,  cp(V2)  =  U\,  <p(VT,)  =  U2,  (p{vA)=U  3 

is  an  isomorphism,  and  the  permutation  matrix 

-  0  1  0  0  - 

0  0  10 

F  -  0  0  0  1 

.1  0  0  0  _ 

determined  by  tp  (as  in  Section  10.3)  has  the  property  that  P A  \  P 1  =  At.  ▲ 
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11.2.5  THE 
BELLMAN-FORD 
ALGORITHM 


The  shortest  path  algorithms  described  in  Section  10.4  can  be  applied  to 
digraphs,  with  obvious  modifications  which  take  into  account  the  orientation  of 
arcs.  Examples  appear  in  our  discussion  of  scheduling  problems  in  Section  11.5. 

Here  is  another  algorithm,  due  to  Richard  Bellman  (1920-1984)  and  Lester  R. 
Ford  Jr.  (1927-)  which,  for  a  directed  graph  with  vertices  v\ ,  ui,  . . .  ,v„,  computes 
successively  the  length  di(j)  of  a  shortest  path  from  m  to  vj  using  at  most  i  arcs. 
Since  no  path  in  such  a  digraph  can  have  more  than  n  —  1  arcs,  the  final  values  of 
dn~\(j),  j  =  2, . . .  ,  n  are  the  lengths  of  shortest  paths  from  v\  to  all  other  vertices. 

Given  a  weighted  digraph  with  vertices  labeled  v\ ,  V2, . . .  ,  vn,  to  find  the  shortest 
distances  from  U|  to  all  other  vertices,  proceed  as  follows. 


Step  1.  for  i,  j  =  1,2, ,  n , 

•  if  i  =  j,  set  w(i,  i)  —  0;  if  i  ^  j  and  VjVj  is  an  arc,  set  w(i,  j)  to 
the  weight  of  v,  vj ;  otherwise,  set  w(i,  j)  =  oo. 

•  set  <5?o(  1 )  =  0  and,  for  j  =  2,  3, . . .  ,  n,  set  do(j)  —  oo. 

•  for  i  =  2  to  n,  set  p(i)  =  1. 

Step  2.  for  i  =  1  to  n 

for  /  =  I  to  n 

•  find  the  k  for  which  min  =  di-\(k)  +  w(vk,  vj )  is  least; 

•  if  min  <  di-\(j),  set  dj(j)  =  min  and  p(j)  =  Vk 
else  set  di(j)  =  dj-i(j). 

Step  3.  if  d„(j)  =  dn-i(j)  for  /  =  1,2 . n, 

for  7  =  1  to  n  output  d„(j)\ 

else  output  “No  shortest  paths.  There  is  a  negative  weight  cycle.” 

If  there  are  no  negative  weight  cycles,  the  value  of  dn(j)  =  dn-i(j)  which  is 
output  in  Step  3  is  the  length  of  a  shortest  path  from  vi  to  vj,  this  path  consisting 
of  vertices  p( 2),  p( 3),  . . .  ,  p(j). 


Figure  11.8 


We  apply  this  algorithm  to  the  graph  shown  in  Fig  11.8.  After  initialization 
in  Step  1 ,  the  process  begins  with  Step  2.  Setting  i  =  1 ,  the  algorithm  computes 
the  weight  of  a  shortest  one  arc  path  to  vertices  V2,  U3,  V5.  The  shortest  path  to 

i>2  has  weight  1,  and  the  last  vertex  on  this  path  is  uj.  The  shortest  path  to  1)5  has 
weight  4,  and  the  last  vertex  on  this  path  is  v\.  Shortest  paths  to  V3  and  V4 ,  using 
at  most  one  arc,  have  infinite  length.  The  first  column  of  the  table  in  Table  11.3 
records  this  data. 

Now  i  =  2  and  the  algorithm  computes  the  length  of  a  shortest  path  to 
t>2.  t!3 ,  V4,  V5  using  at  most  two  arcs.  For  each  j  =  2,  3,  4,  5,  the  algorithm  first 
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Table  11.3  Results  of  applying  the  Bellman-Ford 
algorithm  to  the  digraph  shown  in  Fig  11.8.  The  (/,  j) 
entry  is  oo  or,  when  dj(j)  is  finite,  the  pair  d, (j ) ,  p(j). 

Max.  no.  of  arcs 


1 

2 

3 

4 

5 

V2 

1,  V\ 

1,  V\ 

1,  tq 

1,  Vl 

1,  Vl 

Vertices 

OO 

2,  V2 

2,  v2 

2,  V2 

2,  V2 

V4 

00 

5,  v5 

5,  D5 

4,  P5 

4,  V5 

V5 

4,  vi 

4,  iq 

3,  U3 

3,  V3 

3,  V3 

computes  the  minimum  d\(k)  +  tofu*,  Vj );  that  is,  the  length  of  a  shortest  path 
to  Vk  using  one  arc  plus  the  weight  of  arc  Vk  Vj .  This  number  (min)  is  compared 
with  d\(k),  the  length  of  the  shortest  one  arc  path  to  i >*-.  The  smaller  of  these  two 
numbers  becomes  dzij).  If  the  shortest  path  had  length  min,  the  algorithm  also 
sets  the  predecessor  vertex  p( 2)  =  Vk,  thus  remembering  that  Vk  is  the  vertex 
preceding  on  a  shortest  at  most  two  arc  path  to  vj.  The  shortest  at  most  two 
arc  path  to  i>2  is  still  a  one  arc  path,  and  the  same  is  true  for  i>5-  The  shortest  at 
most  two  arc  path  to  has  length  2,  and  the  last  vertex  on  such  a  path  is  t>2- 
The  shortest  at  most  two  arc  path  to  14  has  length  5,  and  the  last  vertex  on  such 
a  path  is  V5.  We  leave  it  to  you  to  check  the  remaining  columns  of  the  table. 

In  this  example,  all  vertices  eventually  have  finite  labels.  Will  this  always  be  the 
case? 


Figure  11.9  An  application  of  Dijkstra’s  algorithm  (first  form)  to  a  weighted  digraph  with  some  negative 
weights.  The  final  labeling  at  vy  is  wrong. 

In  contrast  to  the  algorithms  of  Dijkstra,  the  Bellman-Ford  algorithm  works 
perfectly  well  on  weighted  digraphs  where  some  arcs  have  negative  weight.  In 
Fig  1 1.9,  we  show  the  result  of  applying  the  original  form  of  Dijkstra’s  algorithm 
to  the  digraph  of  Fig  1 1.8,  with  different  (and  some  negative)  arc  weights.  Since 
neither  version  of  Dijkstra’s  algorithm  examines  a  vertex  again  after  it  has  acquired 
its  permanent  label,  vertex  V3  is  incorrectly  determined  to  be  at  distance  3  from  V] . 
The  correct  length  of  a  shortest  path  from  v\  to  1)3  is,  of  course,  2,  along  v\  U5U4U3, 
and  this  is  found  by  the  Bellman-Ford  algorithm,  as  shown  in  Table  1 1.4. 

Any  path  in  a  digraph  with  n  vertices  cannot  contain  more  than  n  —  1  arcs. 
So  in  Step  2  of  the  Bellman-Ford  algorithm,  why  does  the  outer  for  loop  run 
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Table  11.4  Results  of  applying  the  Bellman-Ford 
algorithm  to  the  digraph  shown  in  Fig  1 1 .9. 

Max.  no.  of  arcs 


1 

2 

3 

4 

5 

l>2 

1, 

1,  V\ 

1,  V) 

1,  V\ 

1,  ih 

Vertices  113 

00 

3,  V2 

2,  v4 

2,  V4 

2,  u4 

v4 

00 

3,  v5 

3,  v5 

3,  v5 

3,  v5 

V5 

4,  V\ 

4,  vi 

4,  v 1 

4,  v\ 

4,  v\ 

from  /  =  1  to  n  and  not  to  /  =  n  —  1?  (Remember  that  i  is  the  number 
of  arcs  in  a  path.)  The  answer  is  in  order  to  detect  negative  weight  cycles 
(which  can  be  reached  from  iq).  This  is  also  the  reason  for  the  inclusion  of 
Step  3.  If  a  weighted  digraph  has  a  cycle  of  negative  total  weight,  there  is 
no  shortest  path  to  any  vertex  on  this  cycle  which  is  reachable  from  v\,  since 
by  going  around  the  cycle,  lengths  can  be  decreased  as  much  as  desired.  Sup¬ 
pose  the  value  of  dn(j)  is  different  (and  hence  less)  than  the  value  of  dn-\(j) 
for  some  vertex  Vj.  Thus  there  is  a  walk  of  n  edges  from  v\  to  vn  of  total 
weight  less  than  any  walk  which  uses  fewer  than  n  edges.  Such  a  walk  must 
contain  a  repeated  vertex,  and  hence  a  cycle  C :  u\U2---u\,  so  the  walk  is 
W:  v\  ■  ■  -  C  ■  ■  -  v„.  Notice  that  the  cycle  must  have  negative  total  weight  since 
otherwise  iq  •••«[•••  vn  (omitting  C)  would  have  weight  less  than  or  equal  to 
W.  It  can  be  shown  that  any  negative  weight  cycle  will  be  detected  by  Bellman- 
Ford  since  in  this  case,  for  some  vertex  Vj,  the  value  of  dn  ( j )  must  be  less  than 
dn-i(j).  [See  Exercise  23(b).] 

FiS  11.9,  change  the  weight  of  W3U5  from  4  to  1.  Make  a  table  like  that  in 
Table  1 1.3.  Examine  the  values  of  e/4 (7 )  and  d$(j)  and  comment. 


Weighted  digraphs  are  helpful  in  settings  where  graphs  are  not  appropriate; 
specifically,  in  any  situation  where  travel  between  two  vertices  can  proceed  in 
only  one  direction.  We  provide  examples  in  the  next  three  sections. 


Answers  to  Pauses 


3.  In  t/3,  vertex  v  has  indegree  0.  Since  there  is  no  way  of  reaching  v  on  a  walk 
respecting  orientation  of  edges,  no  Hamiltonian  cycle  can  exist.  In  Q4,  vertex 
x  has  outdegree  0  so  no  walk  respecting  orientations  can  leave  x. 

4.  The  digraph  @3  is  not  Eulerian.  It  is  not  strongly  connected  (there  is  no  way 
to  reach  v).  Also,  the  indegrees  and  outdegrees  of  three  vertices  (u,  v,  and  x) 
are  not  the  same.  The  digraph  in  Fig  11.5  is  Eulerian,  however.  It  is  strongly 
connected  (there  is  a  circuit  uvwu  which  permits  travel  in  the  direction  of 
arrows  between  any  two  vertices),  and  the  indegree  and  outdegree  of  every 
vertex  is  2.  (An  Eulerian  circuit  is  uwvuvwu.) 

5.  If  there  is  no  directed  path  from  v\  to  some  vertex  17 ,  the  label  for  v,  will 
always  be  00. 
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6.  Max.  no.  of  arcs 


1 

2 

3 

4 

5 

Since  cU(v4)  =  3  and 

V2 

1,  U] 

1,  V\ 

1,  Ul 

1,  V] 

1,  ni 

dsfa)  =  2,  the 

Vertices  123 

OO 

3,  v2 

2,  V4 

2, 1)4 

2,  V4 

algorithm  has  detected 

l>4 

OO 

3,  V5 

3,  u5 

3,  V5 

2,  v5 

the  negative  cycle 

V5 

4,  v\ 

4,  v\ 

4.  U] 

3,  U3 

3,  V3 

U3U5U4U3. 

EXERCISES 


The  symbol  [ BB ]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 


1.  Which  of  the  following  pairs  of  digraphs  are  isomor¬ 
phic?  Explain  your  answers. 


2.  Must  the  indegree  and  the  outdegree  sequences  of  a 
digraph  be  the  same?  Explain. 

3.  [BB]  Prove  Proposition  1 1.2.2. 

4.  Exhibit  a  digraph  which  is  strongly  connected  but  not 
Eulerian. 

5.  [BB]  Let  Q  be  a  connected  graph  with  all  vertices  of 
even  degree.  Can  the  edges  of  Q  be  oriented  so  that  the 
resulting  digraph  is  Eulerian?  Explain. 

6.  Let  A  be  the  adjacency  matrix  of  a  digraph  Q.  What 
does  the  sum  of  the  entries  in  row  i  of  A  represent? 
What  about  the  sum  of  the  entries  in  column  i? 


7.  [BB]  Let  A  be  the  adjacency  matrix  of  the  digraph  pic¬ 
tured  at  the  left. 

(a)  Find  A. 

(b)  Determine  the  (3,  3)  and  (1.4)  entries  of  A2  with¬ 
out  calculating  the  matrix  A2  and  explain  your  rea¬ 
soning. 

(c)  Determine  the  (4,  2)  entry  of  A3  and  the  (1, 3)  entry 
of  A4  without  calculating  A3  or  A4  and  explain  your 
reasoning. 

(d)  Is  the  digraph  strongly  connected?  Explain. 

(e)  Is  the  digraph  Eulerian?  Explain. 


1 


8.  Repeat  Exercise  7  for  the  digraph  at  the  right  in  that 
Exercise. 

9.  Let  A  be  the  adjacency  matrix  of  a  digraph  Q.  Explain 
why  Ar  is  also  the  adjacency  matrix  of  a  digraph  QT . 
Is  it  possible  for  Q  and  Q1  to  be  isomorphic?  Explain. 

10.  [BB;  (a)]  Label  the  vertices  of  each  pair  of  digraphs  in 
Exercise  1  and  then  give  the  adjacency  matrices  corre¬ 
sponding  to  your  labeling.  (For  any  pair  of  graphs  which 
are  isomorphic,  label  in  such  a  way  that  the  two  graphs 
have  the  same  adjacency  matrix.) 

11.  [BB]  Consider  the  digraphs  Q i,  Gi  shown. 

(a)  Find  the  adjacency  matrix  A\  of  Q\  and  the  adja¬ 
cency  matrix  Ai  of  Gl¬ 
ib)  Explain  why  the  map  <p:  G\  ->  Gi  defined  by 

<p(vi)  —  U 2,  <p(U2)  =  «1,  <p(v 3)  =  «4,  <P( V4)  =  «3 
is  an  isomorphism. 

(c)  Find  the  permutation  matrix  P  which  corresponds 
to  (p  and  satisfies  PA\PT  =  A2. 

(d)  Are  these  digraphs  strongly  connected? 
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(e)  Are  these  digraphs  Eulerian? 


12.  Repeat  Exercise  1 1  for  the  digraphs  shown  and  the  map 
<p  \  Q\  — >  Q2  defined  by 

<p(Vl)  =  M3,  <p(v 2)  =  i/4,  tp( Vi)  =  M2,  <p(V4)  =  «5, 
»5(l>5)  =  Wi¬ 
th 


13.  In  each  of  the  following  cases,  find  a  permutation  matrix 
P  such  that  A2  =  PA  \  P1 . 


-  0 

1 

0 

1  - 

-  0 

0 

1 

1  - 

0 

0 

1 

0 

,  A2  = 

0 

0 

1 

0 

0 

0 

0 

1 

0 

0 

0 

0 

_  0 

0 

0 

0  . 

_  0 

1 

0 

0  . 

-  0 

0 

1 

0  - 

-  0 

0 

0 

I  - 

0 

0 

0 

1 

,  A2  = 

0 

0 

1 

0 

1 

0 

0 

0 

0 

1 

0 

0 

_  0 

1 

0 

0  _ 

.  1 

0 

0 

0  _ 

-  0 

1 

0 

1 

1  - 

■  0 

0 

0 

0 

1  - 

0 

0 

1 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

1 

1 

>  ^2  = 

1 

1 

0 

1 

0 

0 

0 

0 

0 

1 

1 

0 

0 

0 

0 

.  0 

1 

0 

0 

0  . 

.  0 

1 

0 

1 

0  _ 

14.  [BB;  (a)]  Which  of  the  digraphs  in  Exercise  1  are 
strongly  connected?  Explain  your  answers. 


16.  [BB]  If  a  graph  Q  is  connected  and  some  orientation 
is  put  on  the  edges  of  Q,  must  the  resulting  digraph  be 
strongly  connected?  Explain. 

17.  Prove  Theorem  1 1.2.4. 


18.  [BB]  Answer  true  or  false  and  explain:  If  a  digraph  Q 
with  at  least  two  vertices  is  strongly  connected,  then 
every  vertex  of  Q  is  part  of  a  circuit  (consisting  of  more 
than  a  single  vertex). 

19.  Apply  the  Bellman-Ford  algorithm  to  the  digraphs 
shown,  answering  by  means  of  a  table  like  Table  1 1 .3. 


(a)  [BB] 


v2  1  v? 


V5 


20.  [BB]  Would  it  be  sensible  to  use  the  Bellman-Ford 
algorithm  on  undirected  graphs?  Comment. 


21.  (a)  [BB]  Apply  the  original  form  of  Dijkstra’s  algo¬ 
rithm.  Does  it  give  correct  lengths  of  shortest  paths? 
Explain. 

(b)  [BB]  Would  Bellman-Ford  work?  Explain. 


15.  An  orientation  of  a  graph  can  be  achieved  by  replacing 
each  edge  (x,  y],  an  unordered  pair  of  vertices,  by  one 
of  the  ordered  pairs  (x,  y)  or  (y,  x);  that  is,  you  put  an 
arrow  on  each  edge  of  the  graph. 

(a)  [BB]  Find  all  nonisomorphic  orientations  of  K. 3. 

(b)  Find  all  nonisomorphic  orientations  of  K. 4. 

(c)  Find  all  nonisomorphic  orientations  of  K.2,3. 


22.  When  the  Bellman-Ford  algorithm  detects  the  presence 
of  a  negative  weight  cycle,  can  it  be  adapted  to  exhibit 
such  a  cycle?  Explain. 

23.  (a)  [BB]  The  Bellman-Ford  algorithm  can  be  terminated 

as  soon  as  two  successive  columns  are  the  same;  that 
is,  if  for  two  successive  values  of  i,  the  numbers 
di(2), ...  , d,(n)  are  identical.  Explain. 
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(b)  Show  that  the  Bellman-Ford  algorithm  will  detect 
the  presence  of  any  negative  weight  cycle  in  a 
weighted  digraph  (assuming  there  is  at  least  one 
directed  path  from  v\  to  every  other  vertex). 

24.  Find  a  reasonable  estimate  of  the  complexity  of  the 
Bellman-Ford  algorithm,  in  terms  of  comparisons. 

25.  In  an  attempt  to  get  the  algorithms  of  Dijkstra  to  work  A 
correctly  on  digraphs  with  negative  weight  edges,  the 


following  idea  is  proposed.  Determine  the  highest  neg¬ 
ative  weight  k  of  an  arc,  add  |&|  to  the  weight  of  every 
arc  (now  all  arcs  have  nonnegative  weights),  use  Dijk¬ 
stra  to  find  shortest  paths,  and  finally  subtract  from  each 
final  weight,  |£j  times  the  number  of  arcs  required  on  a 
shortest  path.  Comment. 

26.  Discover  what  you  can  about  Bellman  and  Ford  and 
write  a  short  note  about  these  men. 


11.3  RNA  Chains 

In  October  1989,  it  was  announced  that  Sidney  Altman,  a  Canadian-born  pro¬ 
fessor  at  Yale  University,  and  Thomas  Cech  of  the  University  of  Colorado  had 
won  jointly  the  Nobel  Prize  in  chemistry  for  research  on  ribonucleic  acid  that 
“could  help  explain  the  origins  of  life”  and  “probably  provide  a  new  tool  for  gene 
technology.” 

Ribonucleic  (pronounced  “rl-bo-noo-cla-ic”)  acid  (RNA  for  short)  and  de¬ 
oxyribonucleic  acid  (DNA)  are  genetic  materials  carried  in  the  cells  of  the  human 
body.  In  fact,  RNA  is  a  chain  each  link  of  which  is  one  of  four  chemicals:  uracil, 
cytosine  (“sT-to-zen”),  adenine,  and  guanine.  Throughout  this  section,  we  will  use 
the  letters  U,  C,  A,  and  G,  respectively,  to  represent  these  chemicals.  Here,  for 
example,  is  one  RNA  chain: 

(1)  UCGAGCUAGCGAAG. 

There  are  two  kinds  of  enzymes  which  break  the  links  in  an  RNA  chain. 
The  first,  called  a  G-enzyme,  breaks  a  chain  after  each  G-link.  For  example,  this 
enzyme  would  break  the  preceding  chain  into  the  following  fragments : 

G-fragments:  UCG,  AG,  CUAG,  CG,  AAG. 

The  second  enzyme,  called  a  U,C-enzyme,  breaks  the  chain  after  each  U-  and  after 
each  C-link.  Thus,  it  would  break  the  chain  in  (1)  into  the  fragments 

U,C-fragments:  U,  C,  GAGC,  U,  AGC,  GAAG. 

It  is  apparently  not  possible  to  discover  directly  the  sequence  of  U’s,  C’s,  A's, 
and  G’s  which  comprise  a  particular  chain,  but  using  the  enzymes  just  described, 
the  collection  of  fragments  of  the  chain,  the  so-called  complete  enzyme  digest , 
can  be  determined.  The  problem  is  then  how  to  recover  an  RNA  chain  from  its 
complete  enzyme  digest.  In  this  section,  we  show  how  to  solve  this  problem  with 
a  method  which  makes  use  of  graph  theory. 

There  are  5!  possible  RNA  chains  with  the  G-fragments  just  listed.  (In  real 
situations,  the  number  is  much  larger.4)  Our  aim  is  to  find  which  of  these  120 


4The  first  chain  successfully  recovered  contained  77  chemicals.  See  Robert  W.  Holley.  Jean  Apgar, 
George  A.  Everett,  James  T.  Madison,  Mark  Marquisse,  Susan  H.  Merrill,  John  Robert  Penswick,  and  Ada 
Zamir,  “Structure  of  a  Ribonuclei  Acid,”  Science  147  (1965),  1462-1465. 
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chains  also  gives  rise  to  the  given  set  of  U,C-fragments.  Our  method  involves 
associating  with  any  complete  listing  of  G-  and  U,C-fragments  a  directed  pseu¬ 
dograph  (it  is  seldom  a  graph)  and  hunting  for  Eulerian  circuits,  each  of  which 
corresponds  to  an  RNA  chain  with  the  specified  fragments.  If  the  pseudograph  is 
not  Eulerian,  then  no  such  chain  exists.  (Perhaps  an  error  was  made  in  listing  the 
fragments.) 

In  any  recovery  problem,  we  may  assume  that  there  are  at  least  two  G- 
fragments  and  two  U,C-fragments;  otherwise  there  is  no  difficulty  in  obtaining 
a  solution.  Also,  it  is  straightforward  to  determine  the  fragment  with  which 
the  chain  ends.  In  the  preceding  listing  of  fragments,  for  example,  the  U,C- 
fragment  GAAG  is  abnormal:  U,C-fragments  should  end  in  U  or  C  but  GAAG 
does  not.  The  only  way  this  could  happen  is  for  GAAG  to  be  at  the  end  of  the 
chain. 

Using  the  fact  that  GAAG  must  come  last  in  the  sought  after  RNA  chain,  lower 
the  previous  estimate  of  120  possible  chains  with  the  given  fragment  lists  by 
determining  how  many  RNA  chains  have  the  set  of  U,C-fragments  listed  pre¬ 
viously. 

The  initial  lists  of  fragments  will  always  contain  at  least  one  abnormal  frag¬ 
ment  (at  the  end).  If  this  is  a  G-fragment  ending  in  U  or  C,  or  a  U, C-fragment 
ending  in  G,  then  there  will  be  only  one  abnormal  fragment  and,  in  that  case,  the 
final  fragment  is  uniquely  determined.  The  only  way  in  which  there  can  be  two 
abnormal  fragments  is  if  they  both  end  in  A.  In  this  case,  however,  one  will  be 
identifiable  as  contained  within  the  other;  it  is  the  longer  of  the  two  with  which 
the  chain  ends.  Again,  the  final  fragment  is  uniquely  determined. 

There  can  never  be  more  than  two  abnormal  fragments.  Why? 

To  illustrate  the  general  procedure,  we  consider  the  example  described  previ¬ 
ously.  We  assume  that  we  know  only  the  fragments  and,  to  make  things  amusing, 
change  the  order  of  the  fragments  and  start  with  these  lists: 

G-fragments:  CUAG,  AAG,  AG,  UCG,  CG 

U,C-fragments:  U,  U,  AGC,  C,  GAAG,  GAGC. 

Imagine  splitting  the  G-fragments  further  using  the  U,C-enzyme  and  splitting 
the  U,C-fragments  further  with  the  G-enzyme.  For  example,  UCG  in  the  first  list 
would  become  U,  C,  G,  while  AGC  in  the  second  list  would  become  AG,  C.  A 
fragment  like  AAG  wouldn’t  split  further.  The  resulting  bits  are  called  extended 
bases',  those  extended  bases  which  are  neither  first  nor  last  in  a  fragment  are 
called  interior.  Thus,  a  fragment  which  splits  into  n  >  3  extended  bases  will 
yield  n  —  2  interior  extended  bases.  In  the  splitting  of  the  fragment  UCG  with  the 
U,C-enzyme,  U,  C  and  G  are  extended  bases;  C  is  interior.  Similarly,  the  extended 
base  AG  from  the  U, C-fragment  GAGC  is  interior.  A  fragment  must  split  into  at 
least  three  extended  bases  before  it  gives  rise  to  an  interior  extended  base;  there 
are  no  interior  extended  bases  from  the  fragment  AGC,  for  instance. 

We  now  list  all  the  interior  extended  bases  obtained  by  the  above  splitting. 
In  our  example,  we  obtain 


interior  extended  bases:  U,  C,  AG. 
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which  arise  from  the  G-fragments  CUAG,  UCG  and  the  U,C-fragment  GAGC,  respec¬ 
tively.  Next,  we  list  all  fragments  (G-  or  U,C-)  which  are  unsplittable,  that  is, 
fragments  which  are  already  extended  bases.  In  our  example,  we  obtain 

unsplittable  fragments:  AAG,  AG,  U,  U,  C, 

these  being  the  G-fragments  which  the  U,C-enzyme  does  not  split  and  the  U,C- 
fragments  which  the  G-enzyme  does  not  split. 

There  are  exactly  two  unsplittable  fragments  which  are  not  interior  extended 
bases,  namely,  the  fragments  AAG  and  U.  These  are  the  fragments  with  which  the 
original  chain  begins  and  ends.  Since  we  have  already  determined  that  the  chain 
ends  with  GAAG,  we  know  that  U  must  be  the  first  fragment  in  the  chain. 

Finally,  we  will  see  how  graph  theory  comes  into  play.  Consider  again  the 
complete  enzyme  digest: 

G-fragments:  CUAG,  AAG,  AG,  UCG,  CG 

U,C-fragments:  U,  U,  AGC,  C,  GAAG,  GAGC. 

Having  determined  the  fragments  with  which  the  chain  begins  and  ends,  we  pro¬ 
ceed  to  construct  a  directed  pseudograph.  Consider  any  fragment  in  one  of  the 
preceding  lists  which  is  not  an  extended  base  (in  other  words,  which  splits  under 
the  action  of  the  other  enzyme),  UCG  for  example.  Draw  two  vertices  labeled  U 
and  G  for  the  first  and  last  extended  bases  in  this  G-fragment  and  join  them  by 
an  arc  labeled  UCG  going  from  the  first  extended  base  to  the  last,  as  shown  in 
Fig  11.10. 

UCG 

Figure  11.10  Uo - *■ - QG 

Similarly,  the  G-fragment  CG  determines  an  arc  labeled  CG  from  a  vertex 
labeled  C  to  one  labeled  G.  Repeating  for  all  normal  fragments  which  split  (not 
just  those  giving  rise  to  an  interior  extended  base),  we  obtain  all  parts  of  the 
directed  pseudograph  shown  in  Fig  11.11  except  for  the  arc  labeled  GAAG*U.  By 
what  reasoning  have  we  included  this  additional  arc? 

AGC 


GAAG*U 


Figure  11.11  A  directed  pseudograph  determining  an  RNA  chain. 

The  longest  abnormal  fragment  is  GAAG  and,  as  we  have  seen,  this  is  the 
last  fragment  in  the  chain.  Draw  an  arc  from  G,  its  first  extended  base,  to  U,  the 
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11.3.1  RECOVERY  OF 
AN  RNA  CHAIN  FROM 
ITS  COMPLETE  ENZYME 
DIGEST 


first  extended  base  in  the  chain  (determined  earlier)  and  label  this  arc  GAAG*U. 
To  find  the  RNA  chain,  we  look  for  Eulerian  circuits  in  the  pseudograph  which 
begin  with  U  and  end  with  the  arc  GAAG*U.  We  then  obtain  the  original  chain 
by  writing  down  the  labels  of  the  arcs  on  the  Eulerian  circuit  in  order,  noting 
that  each  vertex  encountered  is  listed  twice  on  the  arc  labels,  so  one  of  these 
occurrences  should  be  discarded.  If  there  is  more  than  one  Eulerian  circuit,  then 
there  is  more  than  one  chain  with  the  given  fragments.  If,  by  some  chance,  no 
Eulerian  circuit  exists,  then  the  original  fragments  contradicted  each  other;  there 
is  no  RNA  chain  with  these  fragments. 

In  the  directed  pseudograph  of  Fig  11.11,  there  is  just  one  Eulerian  circuit: 
UCG,  GAGC,  CUAG,  AGC,  CG,  GAAG*U.  Eliminating  overlap  at  each  vertex,  we 
obtain  the  RNA  chain  UCGAGCUAGCGAAG. 

Here  is  a  summary  of  the  method  we  have  outlined. 


1.  Determine  the  fragment  with  which  the  chain  ends.  It’s  the  abnormal  frag¬ 
ment  if  there  is  only  one  and  otherwise,  the  longer  of  the  two  abnormal 
fragments. 

2.  List  the  interior  extended  bases  and  the  unsplittable  fragments.  The  chain 
begins  and  ends  with  the  two  unsplittable  fragments  which  are  not  interior 
extended  bases.  Thus,  the  first  and  last  fragments  of  the  chain  are  known. 

3.  Form  a  directed  pseudograph  as  follows: 


(a)  For  each  normal  fragment  which  splits  under  the  action  of  the  other 
enzyme,  draw  an  arc  from  a  vertex  labeled  with  its  first  extended  base  to 
a  vertex  labeled  with  its  last  extended  base.  Label  the  arc  with  the  name 
of  the  fragment. 

(b)  Draw  a  final  arc  from  the  first  extended  base  of  the  largest  abnormal 
fragment  X  to  the  first  extended  base  Y  of  the  chain.  Label  this  arc 
X*Y. 

4.  Determine  all  Eulerian  circuits  in  the  pseudograph  which  end  with  X*Y  and 
write  down  in  order  the  labels  of  the  arcs  without  repeating  the  label  of  each 
vertex  on  the  circuit. 


PROBLEM  3.  Determine  the  RNA  chain  or  chains  (if  any)  which  give  rise  to 
the  following  complete  enzyme  digest. 

G-fragments:  AACUG,  UAG,  A,  AG,  AG,  AG,  G 

U,C-fragments:  U,  AGAAC,  AGAGA,  GGAGU. 

Solution.  There  are  two  abnormal  fragments,  A  and  AGAGA.  The  longer  is 
AGAGA,  so  this  is  the  fragment  with  which  the  chain  ends. 

The  interior  extended  bases  and  unsplittable  fragments  are  these: 

interior  extended  bases:  U,  AG,  G,  AG 
unsplittable  fragments:  A,  AG,  AG,  AG,  G,  U. 
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(Notice  that  the  U,C-fragment  GGAGU  yields  two  interior  extended  bases.)  The 
two  unsplittable  fragments  which  are  not  extended  bases  are  AG  and  A.  Since  the 
chain  ends  AG  AG  A,  it  begins  AG. 


In  Fig  11.12,  we  show  the  directed  pseudograph  which  depicts  this 
situation.  The  extended  base  with  which  the  longer  of  the  abnormal  frag¬ 
ments  begins  is  AG,  which  is  also  the  initial  fragment.  Thus,  the  final  arc,  labeled 
AGAGA*AG,  is  a  loop  from  AG  to  itself.  The  unique  answer  is 
AGAACUGGAGUAGAGA.  I 


The  connection  between  graph  theory  and  the  problem  of  recovering  RNA 
chains  from  their  enzyme  digests  does  not  seem  to  be  well  known.  It  came  first  to 
our  attention  in  a  book  on  combinatorics  by  Fred  Roberts,5  although  the  ideas  seem 
to  have  originated  in  work  of  James  E.  Mosimann.6  We  are  indebted  to  George 
Hutchinson7  for  making  available  some  additional  material  to  complement  his  and 
Mosimann’ s  journal  articles. 

We  would  also  like  to  thank  Daniel  Kobler  for  making  us  aware  of  the  fact 
that  graph  theory  techniques  similar  to  those  described  in  this  section  have  been 
used  recently  to  study  problems  of  recovering  DNA  chains  from  other  types  of 
information  about  fragments.8 


Answers  to  Pauses 


7.  Since  GAAG  comes  last,  we  seek  the  number  of  arrangements  of  the  frag¬ 
ments  U,C,GAGC,U,AGC.  Since  U  appears  twice,  the  answer  is  ^5!  =  60.  (See 
Section  7.3.) 

8.  There  can  be  at  most  one  abnormal  G-fragment  and  at  most  one  abnormal 
U,C-fragment. 


5Fred  S.  Roberts,  Applied  Combinatorics,  Prentice  Hall,  New  Jersey,  1984. 

6  James  E.  Mosimann,  Elementary  Probability  for  the  Biological  Sciences ,  Appleton-Century-Crofts,  New 
York,  1968. 

7George  Hutchinson,  "Evaluation  of  Polymer  Sequence  Fragment  Data  Using  Graph  Theory,"  Bulletin 
of  Mathematical  Biophysics  31  (1969),  541-562. 

_ ,  Evaluation  of  Polymer  Sequence  Data  from  Two  Complete  Digests,  Internal  Report,  Lab¬ 
oratory  of  Applied  Studies,  Division  of  Computer  Research  and  Technology,  National  Institute  of  Health, 
Maryland  (1968). 

^For  example,  see  J.  Blazewicz,  A.  Hertz,  D.  Kobler,  D.  de  Werra,  “On  Some  Properties  of  DNA 
Graphs,”  Discrete  Applied  Mathematics  98  (1999),  1-19. 


11.4  Tournaments  355 


EXERCISES 


The  symbol  [BB]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 


1.  Answer  the  following  questions  for  each  pair  of  frag¬ 
ment  lists  given. 

i.  From  how  many  RNA  chains  could  the  given  list  of 
G-fragments  arise?  From  how  many  chains  could  the 
given  list  of  U,C-fragments  arise?  Which  of  these 
numbers  provides  a  “better”  estimate  of  the  num¬ 
ber  of  RNA  chains  whose  G-fragments  and  U,C- 
fragments  are  as  described? 

ii.  Find  all  RNA  chains  with  the  given  complete  en¬ 
zyme  digests. 

(a)  [BB]  G-fragments:  CUG,  CAAG,  G,  UC 

U,C-fragments:  C,  C,  U,  AAGC,  GGU 

(b)  G-fragments:  G,  UCG,  G,  G,  UU 

U,C-fragments:  GGGU,  U,  GU,  C 

(c)  G-fragments:  UCACG,  AA,  CCG,  AAAG,  G 

U,C-fragments:  C,  GGU,  GAAAGAA,  C,  AC,  C 

(d)  G-fragments:  UUCG,  G,  ACG,  CUAG 

U,C-fragments:  G,  C,  GGAC,  U,  AGU,  C,  U 

(e)  [BB]  G-fragments:  UG,  CC,  ACCAG,  G,  AUG 

U,C-fragments:  C,  GAU,  C,  AGC,  GAC, 

GU 

(f)  G-fragments:  CACG,  AUG,  UCAG,  AG,  UG 

U,C-fragments:  AC,  C,  GU,  G,  AGAU,  GAGC,  U 

(g)  G-fragments:  UCG,  AUAG,  AG,  UCAAUAG,  G 

U,C-fragments:  C,  AGU,  AG,  GGAU,  C,  AGU, 

AAU 


(h)  G-fragments:  UAG,  CUAAG,  UA,  G,  UCAG, 

UAG 

U,C-fragments:  C,  U,  U,  AGGC,  AAGU,  AGU, 

AGU,  A 

2.  (a)  [BB]  Why  must  there  always  exist  an  abnormal  frag¬ 

ment? 

(b)  [BB]  Give  a  necessary  and  sufficient  condition  for 
there  to  exist  two  abnormal  fragments. 

3.  [BB]  Find  the  length  of  the  smallest  RNA  chain  with  the 
property  that  its  G-  and  U,C-fragments  are  the  same  as 
those  of  another  RNA  chain.  (Assume  that  these  chains 
contain  no  A's  and  are  of  length  greater  than  1.) 

4.  Consider  the  two  lists: 

•  the  interior  extended  bases, 

•  the  unsplittable  fragments. 

Show  that 

(a)  [BB]  every  entry  on  the  first  list  is  on  the  second 
list: 

(b)  [BB]  the  first  and  last  extended  bases  in  the  entire 
RNA  chain  are  on  the  second  list  but  not  on  the  first; 

(c)  any  item  on  the  second  list  which  is  not  on  the  first 
list  must  be  a  fragment  with  which  the  chain  begins 
or  ends. 

5.  Explain  why  every  RNA  chain  determines  an  Eulerian 
circuit  in  the  corresponding  directed  pseudograph. 


11.4  Tournaments 


11.4.1  DEFINITION 


Pause  9 


In  this  section,  we  consider  another  kind  of  digraph  called  a  tournament,  which 
is  just  a  complete  graph  with  an  orientation. 

A  tournament  is  a  digraph  with  the  property  that  for  every  two  distinct  vertices 
u  and  v,  exactly  one  of  uv,  vu  is  an  arc.  The  score  of  a  vertex  in  a  tournament, 
denoted  s(v),  is  the  outdegree  of  that  vertex.  The  score  sequence  of  a  tournament 
is  the  list  of  outdegrees,  in  nonincreasing  order. 

Suppose  that  a  tournament  T  has  n  vertices,  each  of  which  has  a  different  score. 
What  is  the  score  sequence  of  T? 


Tournaments  arise  in  many  different  situations,  the  most  obvious  giving  rise  to 
their  name.  A  round-robin  competition  in  sport  is  one  in  which  every  competitor 
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Figure  11.13 


11.4.2  THEOREM 


Proof 


plays  every  other  competitor  once.  Such  a  competition  can  be  represented  by  a 
tournament  in  which  the  players  are  vertices  and  an  arc  directed  from  vertex  v  to 
vertex  u  indicates  that  player  v  defeats  player  u.  In  this  context,  the  score  of  a 
vertex  is  the  number  of  wins  achieved  by  that  player  in  the  tournament  and  the 
score  sequence  is  the  sequence  of  wins  by  all  the  players,  arranged  in  decreasing 
order.  An  example  of  a  tournament  involving  five  players  and  with  score  sequence 
3,  3,  2,  2,  0  appears  in  Fig  11.13. 


A 


A  tournament  of  size  five. 

There  are  other  less  obvious  applications  of  tournaments.  In  psychology,  sub¬ 
jects  are  sometimes  presented  with  alternatives  two  at  a  time  and  asked  to  express 
a  preference  for  one  of  them.  Representing  the  various  alternatives  as  the  vertices 
of  a  digraph  and  assuming  all  possible  pairs  of  alternatives  are  presented  to  the 
subject,  then  that  person’s  preferences  can  be  represented  as  a  tournament  in  a 
natural  way. 

Tournaments  arise  in  biology  as  well.  It  is  sometimes  the  case  that  for  every 
pair  of  animals  of  a  certain  species,  one  is  dominant  over  the  other.  This  relation 
defines  a  pecking  order  in  the  species,  and  this  can  be  analyzed  by  means  of 
tournaments. 

It  is  usually  desired  to  rank  the  members  of  a  tournament.  In  the  round-robin 
competition  pictured  in  Fig  11.13,  the  numbers  of  wins  recorded  by  A,  B,  C,  D, 
and  E  are  3,  2,  2,  0,  and  3  respectively.  Since  A  and  E  have  the  best  records,  these 
players  should  finish  ahead  of  the  rest.  Since  E  defeated  A,  perhaps  E  should 
be  ranked  first,  although  A  might  resent  this  since  A  defeated  B,  who  in  turn 
defeated  E.  A  Hamiltonian  path  provides  one  way  of  settling  rankings,  namely, 
rank  v  ahead  of  u  if  vertex  v  occurs  before  vertex  u  in  the  Hamiltonian  path.  Such 
a  ranking  may  seem  reasonable  since  a  player  will  always  have  beaten  the  person 
ranked  immediately  below  him  or  her.  In  Fig  11.13,  for  example,  EACBD  is  a 
Hamiltonian  path  and  E,  A,  C,  B,  and  D  is  the  corresponding  ranking  of  players. 
The  following  theorem,  due  to  the  Hungarian  mathematician  L.  Redei,  tells  us 
that  this  procedure  is  available  in  any  tournament. 

(Redei9)  Every  tournament  has  a  Hamiltonian  path. 

Suppose  T  is  a  tournament  with  n  vertices.  We  proceed  by  induction  on  n.  The 
result  is  trivial  for  n  =  1  and  for  n  —  2  the  only  tournament  is  o — ► — © ,  which 
clearly  has  a  Hamiltonian  path.  So  assume  that  k  >  2  and  that  the  result  is  true 


®L.  Redei,  “Ein  kombinatorischer  Satz.”  Acta  Litterarum  ac  Scientiarum  Szeged  7  (1934),  39-43. 
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Figure  11.14 


11.4.3  DEFINITION 


Figure  11.15 


for  n  =  k.  Let  T  be  a  tournament  with  k  +  1  vertices,  let  u  be  any  vertex  of  T, 
and  let  T\{u)  be  that  subgraph  of  T  obtained  by  deleting  u  (and  all  arcs  incident 
with  u).  This  subgraph  has  k  vertices  and  it  is  a  tournament  since  each  pair  of 
vertices  is  still  joined  by  an  arc.  By  the  induction  hypothesis,  we  conclude  that 
T  \  {/<}  has  a  Hamiltonian  path  v\  V2  ■  ■  ■  v/c .  Now  there  is  an  arc  between  u  and  tq 
(see  Fig  11.14).  If  this  arc  is  uv\  (that  is,  directed  from  u  to  iq),  then  uv\v  2  •  •  • 
is  a  Hamiltonian  path  in  T . 


Suppose,  on  the  other  hand,  that  the  arc  is  v\u  and  consider  the  arc  between 
u  and  ib.  If  this  arc  is  uv 2,  then  v \  u iq  13  •  •  •  is  a  Hamiltonian  path  in  T  and 
again  we  have  the  desired  result.  In  general,  if  there  is  a  minimal  integer  i  >  1 
such  that  the  arc  between  u  and  v,  is  u Vj ,  then  is  an  arc  (by  minimality  of 

;),  and  v\ V2  ■  ■  ■  Vi-\uv /  •  •  •  is  a  Hamiltonian  path  (see  Fig  1 1.14(b)).  If  there  is 
no  such  minimal  /,  all  arcs  between  u  and  Vj  are  of  the  type  VjU  and  vjib  •  •  •  v^u 
is  a  Hamiltonian  path.  In  any  case,  there  is  a  Hamiltonian  path.  I 

Unfortunately,  there  is  no  reason  why  a  Hamiltonian  path  in  a  tournament 
should  be  unique,  so  disputes  over  ranking  can  still  exist.  In  our  earlier  example 
concerning  the  tournament  in  Fig  11.13,  for  instance,  ABECD  is  another  Hamil¬ 
tonian  path. 

Find  a  third  Hamiltonian  path. 

The  point  is  that  with  more  than  one  Hamiltonian  path,  there  is  no  unique 
ranking  of  the  players.  Certain  tournaments,  however,  do  have  unique  Hamiltonian 
paths. 

A  tournament  is  transitive  if  and  only  if  whenever  uv  and  vw  are  arcs,  then  uw 
is  also  an  arc. 


No  3-cycle  can  appear  in  a  transitive  tournament. 

Transitive  tournaments  are  precisely  those  which  have  no  subtoumaments  of 
the  form  illustrated  in  Fig  11.15.  For  instance,  the  tournament  in  Fig  1 1.13  is  not 
transitive  because  ABEA  is  a  3-cycle  like  that  in  Fig  11.15. 

It  turns  out  that  we  can  characterize  those  tournaments  with  unique  Hamilto¬ 
nian  paths  in  terms  of  the  concepts  of  transitivity  and  scores. 
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11.4.4  THEOREM 


The  following  properties  of  a  tournament  T  are  equivalent. 


(1)  T  has  a  unique  Hamiltonian  path. 

(2)  T  is  transitive. 

(3)  Every  player  in  T  has  a  different  score. 


Proof  We  show  (1)  -*  (2),  that  a  tournament  with  a  unique  Hamiltonian  path  must  be 
transitive,  and  leave  the  rest  of  the  proof  to  the  Exercises  (Exercise  9).  So  assume 
that  T  has  a  unique  Hamiltonian  path  and  order  the  vertices  so  that  this  path 
is  vn-\ v„-2  ■  ■  ■  vo.  It  is  sufficient  to  show  that  vj Vj  is  an  arc  whenever  j  >  i. 
Suppose,  by  way  of  contradiction,  that  this  is  not  the  case.  Then  there  must  be 
some  arc  vivj  with  j  >  i.  (See  Fig  11.16.) 


Figure  11.16 


Choose  /  as  small  as  possible  such  that  an  arc  of  this  type  exists  (for  some  j) 
and,  with  that  i  fixed,  choose  j  as  large  as  possible  such  that  the  arc  vtVj  exists. 
If  j  n  —  1  and  i  ^  0  (as  depicted  in  the  figure),  then 

vn-\  ■  ■  ■  vj+\VjVjVj-\  ■  •  •  Vi+iVi-i  ■■■vo 

is  a  second  Hamiltonian  path.  If  /  =  0  and  j  ^  n  —  1,  then 


Vn-\  ■  ■  ■  Vj  +  \V0VjVj-\  ■■  ■  V\ 

is  a  second  Hamiltonian  path.  If  j  —  n  —  1  and  i  ^  0, 

ViVn-\v„-2  ■  ■  ■  n,+iu,_i  •  •  •  vo 

is  a  second  Hamiltonian  path  and.  if  i  =  0  and  j  =  n  —  1,  then 


V0Vn-]V„-2  ■‘■Vl 


is  a  second  Hamiltonian  path.  In  all  cases,  we  contradict  the  uniqueness  of  the 
Hamiltonian  path.  I 

Since  we  don’t  need  graph  theory  to  rank  n  players  all  of  whose  scores  are 
different,  the  import  of  Theorem  1 1 .4.4,  sadly,  is  that  Hamiltonian  paths  are  not 
very  helpful  in  ranking  tournaments. 


Answers  to  Pauses 


In  a  tournament  of  n  players  vo,  r>i ,  . . .  ,  v„_i  suppose  the  score  of  player  vt  is  i. 

Find  a  Hamiltonian  path. 

9.  Since  each  player  in  a  tournament  of  n  players  can  achieve  at  most  n  —  1 
victories,  the  score  of  each  vertex  is  at  most  n  —  1.  The  score  of  each  vertex 
is  nonnegative,  so  any  score  s  satisfies  0  <  5  <  n  —  1.  The  only  set  of  n 
distinct  scores  in  this  range  is  {0,  1, 2, . . .  ,  n  —  1},  so  the  score  sequence  is 
n  —  1,  n  —  2, . . .  ,  2,  1,  0. 

10.  AC  BED  is  a  third  Hamiltonian  path  in  Fig  11.13. 
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11.  Player  v„ _  i  wins  all  n  —  1  games;  so  there  is  an  arc  from  u„_i  to  each  other 
Vi,  in  particular  to  u„_2.  Player  u„_2  wins  n  —  2  games  (losing  only  to  vn-\) 
so  there  is  an  arc  from  i>„_2  to  each  of  vo,  v\, .. .  ,  vn -3.  Continuing  in  this 
way,  we  see  that  there  is  an  arc  from  ry  to  w,  1  for  each  1  <  i  <  n  —  1. 
Thus,  v„_i  u„__2  •  •  •  vo  is  a  path  and,  since  it  contains  all  vertices,  Hamiltonian. 


EXERCISES 


The  symbol  [BB]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 


1.  (a)  [BB]  Draw  all  nonisomorphic  tournaments  with 

three  vertices  and  give  the  score  sequence  of  each. 
Which  of  these  are  transitive? 

(b)  Repeat  part  (a)  for  tournaments  with  four  vertices. 

2.  [BB]  Let  A  be  the  adjacency  matrix  of  a  tournament. 
Describe  A  +  AT  and  explain. 

3.  (For  students  of  linear  algebra) 

(a)  Prove  that  A  is  the  adjacency  matrix  of  a  tournament 
with  n  players  if  and  only  A  +  AT  =  J  —  /,  where 
I  is  the  n  x  n  identity  matrix  and  J  is  the  n  x  n 
matrix  consisting  entirely  of  1  ’s. 

(b)  If  A  is  the  adjacency  matrix  of  a  tournament,  so  is 
its  transpose.  Why? 

(c)  Let  P  be  an  n  x  n  permutation  matrix,  that  is,  a 
matrix  obtained  from  the  n  x  n  identity  matrix  by 
rearranging  its  rows.  If  A  is  the  adjacency  matrix  of 
a  tournament,  so  is  PAP1.  Why? 

4.  (a)  Find  all  Hamiltonian  paths  in  the  tournament  of 

Fig  11.13. 

(b)  Suppose  the  arc  in  Fig  11.13  is  DB,  not  BD.  Find 
all  Hamiltonian  paths  in  this  tournament. 

5.  Suppose  .si ,  52, . . .  ,  5„  are  the  scores  in  a  tournament. 

(a)  [BB]  Prove  that  J21=\  si  =  Xa=i(w  ~  1  —  si)  and 
interpret  this  result  in  the  context  of  a  round-robin 
tournament. 

(b)  Prove  that  J21=  1  sf  —  \(n  ~  1  —  5,)2  and  inter¬ 

pret  this  result  in  the  context  of  a  round-robin  tour¬ 
nament. 

6.  [BB]  Let  T  be  a  tournament  and  let  v  be  any  vertex  of 
T  having  maximum  score.  Show  that  the  length  of  the 
shortest  path  from  v  to  any  other  vertex  of  T  is  1  or  2. 


7.  (a)  [BB]  Could  3,  2,  1.  1  be  the  score  sequence  of  a 

tournament? 

(b)  Could  6,  6,  6,  6,  1,  L  1,  1  be  the  score  sequence  of 
a  tournament? 

8.  Let  S  be  the  set  of  scores  in  a  tournament  T  with  n 
vertices. 

(a)  Prove  that  =  ^n(n  —  1). 

seS 

(b)  Let  t  be  a  natural  number  less  than  n  and  let 
5] ,  52,  •  •  •  -5/  E  S. 

t 

Prove  that  ^Ps,-  >  \t(t  —  1). 

;=i 

[Remark:  H.  G.  Landau  has  proved  that  the  two  condi¬ 
tions  in  (a)  and  (b)  are  sufficient  as  well  as  necessary 
for  the  existence  of  a  tournament  with  a  prescribed  set 
of  scores.10  ] 

9.  Complete  the  proof  of  Theorem  1 1 .4.4  as  follows. 

(a)  Prove  that  the  scores  in  a  transitive  tournament  of 
n  vertices  are  all  different.  Thus,  (2)  ->  (3). 

(b)  Suppose  every  player  in  a  tournament  T  has  a  dif¬ 
ferent  score.  Show  that  T  has  a  unique  Hamiltonian 
path.  Thus,  (3)  —*■  (1).  [Hint:  Pauses  9  and  11] 

10.  Find  all  Hamiltonian  paths. 


11.  [BB;  — ►]  Prove  that  a  tournament  is  transitive  if  and 
only  if  it  contains  no  cycles. 


1,1  H.  G.  Landau,  "On  Dominance  Relations  and  the  Structure  of  Animal  Societies,  III;  The  Condition  for  a  Score  Structure,"  Bulletin  of  Mathematical 
Biophysics  15  (1955),  143-148. 
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11.5  Scheduling  Problems 

The  shortest  path  algorithms  for  weighted  graphs  discussed  in  Section  10.4  have 
many  obvious  applications.  Often,  the  weight  on  an  edge  represents  distance  (or 
cost)  and  the  shortest  path  is  just  the  shortest  (or  cheapest)  route  between  the 
distinguished  points.  There  is  also  a  large  area  of  application  to  weighted  digraphs, 
where  the  arcs  represent  tasks  which  must  be  performed  in  a  certain  order  and  the 
weight  of  an  arc  represents  the  time  required  to  carry  out  the  task.  The  problem 
in  which  we  are  interested  is  to  determine  the  order  in  which  the  tasks  should  be 
undertaken  in  order  that  they  all  be  completed  in  the  shortest  possible  time.  We 
present  two  types  of  such  "scheduling  problems"  in  this  section. 

We  assume  throughout  that  the  weights  of  arcs  in  digraphs  are  integers.  Such 
digraphs  are  also  called  directed  networks. 


11.5.1  DEFINITION 

11.5.2  A  TYPE  I 
SCHEDULING 
PROBLEM 


A  directed  network  is  a  digraph  with  an  integer  weight  attached  to  each  arc. 


The  construction  of  a  fence  involves  four  tasks:  setting  posts  ( Po ),  cutting  wood 
(C),  painting  (Pa),  and  nailing  (TV).  Setting  posts  must  precede  painting  and 
nailing,  and  cutting  must  precede  nailing.  Suppose  that  setting  posts  takes  three 
units  of  time,  cutting  wood  takes  two  units  of  time,  painting  takes  five  units  of  time 
for  uncut  wood  and  four  units  of  time  otherwise,  and  nailing  takes  two  units  of  time 
for  unpainted  wood  and  three  units  of  time  otherwise.  In  what  order  should  these 
tasks  be  carried  out  in  order  to  complete  the  project  in  the  shortest  possible  time? 


This  is  a  problem  which  can  be  easily  analyzed.  There  are  five  ways  in  which 
the  fence  can  be  completed: 

Po—C—N—Pa 

Po—Pa—C—N 

Po—C—Pa—N 

C—Po—N—Pa 

C—Po—Pa—N 

and  it  is  a  simple  matter  to  compute  the  time  for  each.  We  can  also  illustrate  the 
situation  with  the  directed  network  shown  in  Fig  11.17.  The  arcs  in  the  digraph 
represent  tasks  and  the  vertices  represent  stages  in  the  process.  The  direction  of  an 
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arc  indicates  passage  from  one  stage  to  another.  For  instance,  the  arc  with  weight 
2  from  D  to  F  represents  nailing,  while  the  two  arcs  leaving  vertex  A  represent 
cutting  and  painting.  Vertex  E  represents  the  stage  where  all  tasks  except  nailing 
have  been  completed. 

A  path  corresponds  to  a  certain  order  of  tasks;  thus,  the  three  different  paths 
from  5  to  E  represent  three  different  ways  to  cut,  paint,  and  set  posts.  Note  that 
the  weight  given  to  an  arc  may  depend  on  which  tasks  precede  it;  for  example, 
arcs  DF  and  ET  each  represent  nailing,  but  they  have  different  weights. 

The  shortest  path  from  S  to  T  shows  us  how  to  complete  the  project  in  the 
least  amount  of  time.  Applying  Dijkstra's  algorithm  (first  version)  to  the  digraph, 
we  obtain  the  labels  shown  in  Fig  11.18. 


Figure  11.18 


C(A,  8) 


One  shortest  path  is  SADFT ,  completing  the  job  in  1 1  units  of  time.  This  path 
corresponds  to  first  setting  posts,  then  cutting,  then  nailing,  and  finally  painting. 
Notice  that  there  was  a  choice  in  the  labeling  of  vertex  D:  it  could  equally 
well  have  been  labeled  (B,  5).  There  is,  therefore,  a  second  shortest  path  which 
completes  the  job  in  11  units  of  time,  namely,  SBDFT.  So  we  could  equally 
well  cut,  then  set  posts,  then  nail,  and  finally  paint. 

The  procedure  just  described  is  important  in  the  management  sciences  and  has 
been  thoroughly  studied.  Methods  for  solving  such  networks  are  often  called  crit¬ 
ical  path  methods  (CPMs)  or  project  evaluation  and  review  techniques  (PERTs), 
and  a  great  deal  of  specialized  terminology  has  been  developed.  For  example,  a 
shortest  path  such  as  SADFT  (or  SBDFT)  is  called  a  critical  path. 

In  the  scheduling  problem  discussed  previously,  one  task  had  to  be  completed 
before  another  could  start.  Such  would  be  the  case  in  a  one-person  project,  for 
example.  Also,  the  time  required  for  a  task  depended  upon  what  tasks  had  already 
been  completed.  There  are  other  situations,  however,  where  several  different  tasks 
can  be  pursued  simultaneously  and  where  the  time  of  a  given  task  is  independent 
of  the  other  tasks.  Think  of  the  construction  of  a  house  or  office  building.  Such  a 
project  often  requires  the  services  of  a  large  number  of  people.  While  the  order 
of  tasks  is  sometimes  important  (plumbing  before  carpets!),  sometimes  it  isn't 
(plumbing  and  electrical  work).  Also,  and  unlike  building  a  fence,  in  a  large 
construction  project  the  times  of  the  tasks  are  often  determined  only  by  the  nature 
of  the  task. 

As  before,  our  aim  is  to  plan  the  project  so  as  to  minimize  the  time  required. 
While  strictly  speaking,  the  procedure  we  outline  is  not  a  shortest  path  method, 
it  does  fit  naturally  into  this  section. 
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11.5.3  A  TYPE  II 
SCHEDULING 
PROBLEM 


In  order  to  finish  a  basement,  a  contractor  must  arrange  for  certain  tasks  to  be 
completed.  These  are  shown  in  Table  11.5  together  with  brief  codes  for  referring 
to  them  and  the  times  required  for  each. 


Table  11.5 

Task 

code 

Time 

Floor  installation 

F 

4 

Plumbing 

PI 

3 

Electrical  work 

E 

5 

Wallboard 

IT 

3 

Varnish  doors  and  moldings 

V 

1 

Paint 

Pa 

2 

Install  doors  and  moldings 

D 

1 

Lay  carpet 

C 

2 

Figure  11.19 


T 


We  represent  this  problem  by  the  digraph  in  Fig  11.19.  This  time  the  vertices 
represent  tasks  and  an  arc  from  u  to  v  means  that  task  u  precedes  task  v.  It  is 
necessary  to  install  the  floor  (F)  before  any  other  tasks  can  commence.  The  elec¬ 
trical  work  ( E )  and  plumbing  (PI)  must  both  be  completed  before  the  wallboard 
can  go  in,  but  neither  the  plumbing  nor  the  electrical  work  is  a  prerequisite  for  the 
other;  these  two  tasks  can  occur  simultaneously.  The  order  in  which  other  tasks 
are  carried  out  is  shown  in  the  digraph.  Note  that  while  the  electrical  work  must 
precede  the  painting,  we  do  not  show  an  arc  from  E  to  Pa  since  this  precedence 
can  be  inferred  from  the  arcs  EW  and  IT  Pa. 

The  weight  assigned  to  an  arc  is  the  time  it  takes  to  complete  the  task  repre¬ 
sented  by  the  terminal  vertex  of  the  arc.  Both  EW  and  PIW  are  assigned  a  weight 
of  3  since  both  E  and  PI  must  be  completed  before  W  can  commence  and  IT 
requires  three  units  of  time.  The  last  two  arcs,  leading  to  the  finish  of  the  project, 
are  assigned  a  weight  of  0  because  the  job  is  completed  once  D  and  C  are  finished. 

When  the  job  is  completed,  all  paths  from  start  to  finish  will  have  been  fol¬ 
lowed.  Thus,  in  this  situation,  it  is  the  time  required  for  the  longest  path  which  is 
the  shortest  time  for  the  entire  project.  To  find  this  path,  we  modify  in  two  ways 
the  first  version  of  Dijkstra’s  algorithm.  No  vertex  is  labeled  until  all  prerequisite 
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vertices  have  been  labeled  and  each  vertex  is  always  labeled  with  the  largest 
value  of  d  +  w(e).  The  final  labels  in  our  example  are  shown  in  Fig  11.20. 

(S.  4)  (F,  7)  (£,  12)  (W.  14)  (Pa,  15) 

4  F  3  PI  3  W  2  Pa  l  D  o 
"  - ► - —p  T(C,  16) 

2  X 

Figure  11.20  E(F,  9)  1  V(E,  10)  C(Pa,  16) 

The  job  can  be  completed  in  16  units  of  time,  and  FEW  PaC  is  the  critical 
path.  Note  that  the  tasks  not  on  the  critical  path  (PI,  V,  and  D)  can  be  delayed 
somewhat  without  delaying  the  project.  For  example,  plumbing  could  take  five 
time  units  instead  of  three  and  the  label  (E,  12)  for  W  would  still  be  correct:  The 
critical  path  would  not  be  affected.  Varnishing  could  take  as  much  as  six  units  of 
time  without  delaying  the  project  because,  if  this  were  to  occur,  only  the  label  for 
D  would  change,  to  (V ,  16):  The  project  could  still  be  completed  in  16  units  of 
time.  On  the  other  hand,  if  more  than  six  units  of  time  were  spent  on  varnishing, 
the  project  would  be  delayed.  This  extra  6—1=5  units  of  time  which  could  be 
spent  on  V  is  called  the  slack  of  the  task  V.  As  we  have  seen,  the  slack  of  PI 
is  2  and  the  slack  of  D  is  1.  Knowledge  of  these  values  allows  the  contractor  to 
complete  the  project  more  quickly  by  transferring  workers  to  other  tasks  on  the 
critical  path. 


EXERCISES 


The  symbol  [BB]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 


1.  [BB]  Solve  the  fence  problem  described  in  paragraph 
11.5.2,  assuming  that  the  setting  of  posts  takes  three 
units  of  time  if  the  wood  has  not  yet  been  cut  and  four 
units  of  time  otherwise.  (All  other  aspects  of  the  project 
are  unchanged.) 

2.  The  construction  of  a  certain  part  in  an  automobile 
engine  involves  four  activities:  pouring  the  mold,  cal¬ 
ibration,  polishing,  and  inspection.  The  mold  is  poured 
first;  calibration  must  occur  before  inspection.  Pouring 
the  mold  takes  eight  units  of  time,  calibration  takes  three 
units,  polishing  takes  six  units  of  time  for  an  uncali¬ 
brated  product  and  eight  units  of  time  for  a  calibrated 
one,  and  inspection  takes  two  units  of  time  for  a  pol¬ 
ished  product  and  three  units  of  time  for  an  unpolished 
one. 

(a)  What  type  of  scheduling  problem  is  this  (type  I  or 
II)?  Represent  the  paths  to  completion  of  this  job 
by  drawing  the  appropriate  directed  network. 


(b)  What  is  the  shortest  time  required  for  this  job? 
Describe  the  critical  path. 

3.  In  Exercise  2,  suppose  polishing  takes  nine  units  of  time 
for  an  uncalibrated  product.  (All  other  conditions  remain 
the  same.)  Find  the  new  critical  path  and  the  length  of 
the  revised  project. 

4.  Repeat  Exercise  2,  assuming  that,  in  addition  to  the  tasks 
stated,  the  workers  will  take  a  coffee  break.  The  cof¬ 
fee  break  must  take  place  after  calibration  and  before 
inspection.  If  it  occurs  before  polishing,  it  requires  two 
time  units;  after  polishing,  it  requires  six  time  units. 

5.  [BB]  Change  the  basement  project  described  in  para¬ 
graph  1 1 .5.3  by  supposing  that 

•  plumbing  requires  four  units  of  time; 

•  the  doors  and  moldings  will  be  installed  before  they 
are  varnished; 

•  the  carpets  cannot  be  installed  until  the  doors  and 
moldings  have  been  varnished;  and 
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•  there  are  kitchen  cabinets  to  be  installed  in  one  unit 
of  time  after  painting. 

Draw  the  appropriate  directed  network  which  depicts  the 
paths  to  completion  of  this  revised  project.  Describe  all 
critical  paths  and  the  slack  in  each  task. 

6.  [BB]  Suppose  we  want  to  build  a  Klein  bottle.  The  tasks 
involved  in  this  project  and  the  time  required  for  each 
are  recorded  in  the  following  table. 


Code 

Tasks 

Time  (in  months) 

G 

blow  glass 

7 

Po 

polish 

5 

Sh 

shape 

9 

A 

add  hole 

4 

H 

affix  handles 

5 

Tiv 

twist 

2 

R 

remove  hole 

4 

Pa 

paint 

6 

The  glass  must  be  blown  before  anything  else.  Polishing 
precedes  all  but  G  and  Sir,  shaping  precedes  all  but  G 
and  Po.  The  hole  must  be  added  before  it  is  removed 
(!)  and  before  the  bottle  is  twisted.  The  handles  go  on 
before  T w  and  Pa.  Once  T w,  R,  and  Pa  are  completed, 
the  project  is  finished. 

(a)  What  type  of  scheduling  problem  is  this  (type  I  or 
II)?  Draw  the  appropriate  directed  network. 

(b)  Find  all  critical  paths  for  this  project. 

(c)  Find  the  slack  of  Po,  Sh,  A,  T w,  and  R  for  each 
of  the  critical  paths  found  in  (b). 

(d)  Suppose  that  tasks  Po  and  A  are  each  delayed 
three  months.  Will  the  completion  of  the  project  be 
delayed?  Explain. 

7.  Repeat  Exercise  6  if,  in  addition  to  all  the  conditions 
stated  there,  A  must  also  precede  H . 

8.  Repeat  Exercise  6  if  A  takes  six  months  to  complete 
instead  of  four  and  R  takes  seven  months  to  complete 
instead  of  four. 

9.  What's  a  Klein  bottle?  Find  out  what  you  can  about  this 
thing  and  write  a  short  note. 

10.  [BB]  To  complete  her  Master's  thesis,  a  geology  stu¬ 
dent  must  perform  field  work  (F)  and  laboratory  anal¬ 
ysis  (L),  conduct  a  library  search  ( S )  of  the  literature, 
create  a  database  of  relevant  articles  (£>),  and  write  the 
thesis  (W).  The  write-up  cannot  begin  until  all  tasks 
except  for  the  laboratory  analysis  are  complete.  It  takes 
three  units  of  time  if  it  is  the  last  task  and  otherwise  five 
units  of  time.  The  field  work,  which  takes  four  units  of 


time,  must  precede  both  the  laboratory  analysis  and  the 
database  creation.  The  library  search  takes  two  units  of 
time.  The  laboratory  analysis  takes  two  units  of  time  if 
it  is  undertaken  after  the  library  search  and  otherwise 
three  units  of  time.  Creation  of  the  database  takes  one 
unit  of  time  if  it  is  delayed  until  the  library  search  is 
complete  and  otherwise  four  units  of  time. 

(a)  What  type  of  scheduling  problem  is  this  (I  or  II)? 
Draw  the  appropriate  directed  network. 

(b)  What  is  the  shortest  possible  time  in  which  this  stu¬ 
dent  can  complete  her  thesis? 

(c)  In  what  order  should  she  perform  her  thesis-related 
tasks  in  order  to  achieve  this  minimum  time? 

11.  Fred  and  Wilma  Noseworthy  are  planning  a  dinner  party. 
The  things  they  must  do  before  their  guests  arrive  and 
the  time  required  for  each  are  shown  in  the  following 
table.  The  fish  must  be  caught  and  the  wine  purchased 
before  the  table  is  set.  The  Noseworthys  are  not  fond 
of  raw  fish  or  raw  vegetables  and  do  not  use  wine  in 
their  cooking.  The  table  must  be  dusted  before  it  can  be 
set.  Vacuuming  is  never  done  until  the  table  is  set.  For¬ 
tunately,  Fred  and  Wilma  have  the  full  support  of  their 
student  daughter  and  son,  who  are  always  very  willing 
to  help  their  parents  with  whatever  jobs  have  to  be  done 
(so  up  to  four  jobs  can  be  done  simultaneously).  The 
entire  family  greets  their  guests  together  after  all  tasks 
have  been  completed. 


Dust  house 

D 

3 

Vacuum  house 

V 

2 

Set  table 

T 

1 

Buy  wine 

W 

4 

Catch  fish 

F 

6 

Pick  vegetables 

P 

2 

Cook  food 

C 

4 

Greet  guests 

G 

2 

(a)  What  type  of  scheduling  problem  is  this  (I  or  II)? 
Draw  the  appropriate  directed  network. 

(b)  What  is  the  shortest  possible  time  in  which  din¬ 
ner  preparations  can  be  accomplished?  Describe  the 
critical  path  and  illustrate  with  a  directed  network, 
showing  all  labels. 

(c)  Find  the  slack  in  IV,  C,  and  D. 

12.  Repeat  the  previous  problem  under  the  additional  assum¬ 
ption  that  wine  is  used  in  the  cooking. 

13.  Before  going  to  school,  John  must  take  a  shower,  get 
dressed,  eat  breakfast,  and  finish  his  math  assignment. 
John  will  take  his  shower  before  getting  dressed  or  eating 


Review  Exercises  365 


breakfast  and  he’ll  get  dressed  before  finishing  his  assi¬ 
gnment,  but  there  are  no  other  restrictions.  Taking  a 
shower  requires  9  minutes;  getting  dressed  takes  9  min¬ 
utes  if  John  has  first  eaten  breakfast  but  otherwise  12 
minutes.  Finishing  his  assignment  takes  12  minutes  if 
John  has  eaten  breakfast,  but  18  minutes  otherwise.  Eat¬ 
ing  breakfast  takes  9  minutes  if  John  has  finished  his 
assignment  and  18  minutes  otherwise. 


(a)  What  type  of  scheduling  problem  is  this  (I  or  II)? 
Draw  the  appropriate  directed  network. 

(b)  Find  all  possible  ways  in  which  John  can  carry  out 
his  tasks  before  school  so  that  he  can  leave  in  the 
shortest  possible  time. 

14.  Repeat  Exercise  13,  removing  the  restriction  that  John 
must  dress  before  finishing  his  assignment. 


REVIEW  EXERCISES  FOR  CHAPTER  11 


1.  Solve  the  Chinese  Postman  Problem  for  the  weighted 
graph  given  in  Exercise  17  of  the  Chapter  10  Review 
Problems. 

2.  Solve  the  Chinese  Postman  Problem  for  the  unweighted 
graph  £3,6. 

3.  Explain  how  to  solve  the  Chinese  Postman  Problem  for 
a  graph  Q  which  has  an  Eulerian  trail. 

4.  Let  Q  be  a  strongly  connected  digraph  with  the  property 
that  no  vertex  has  indegree  equal  to  its  outdegree.  Prove 
that  at  least  two  vertices  have  indegrees  greater  than 
their  outdegrees. 

5.  (a)  Find  two  different  orientations  on  the  edges  of  K4 

which  lead  to  nonisomorphic  digraphs. 

(b)  Find  two  different  orientations  on  the  edges  of  AC4 
which  lead  to  isomorphic  digraphs. 

6.  If  A  is  the  adjacency  matrix  of  a  digraph  and  A 2  =  [bjj], 
find  5  bu  and  explain. 

7.  (a)  Show  that  Dijkstra’s  algorithm  fails  and  explain 

why.  Start  at  vertex  A. 

B  2  C 


(b)  Apply  the  Bellman-Ford  algorithm  to  the  digraph. 
Your  answer  should  make  clear  that  you  understand 
this  algorithm  by  showing  how  the  labels  on  vertices 
change  as  the  algorithm  proceeds. 

8.  Find  all  RNA  chains  with  the  given  complete  enzyme 
digests. 


(a) 

G-fragments: 

U,C-fragments: 

(b) 

G-fragments: 

U,C-fragments: 

(c) 

G-fragments: 

U,C-fragments: 

G,  AUG.  UCAG,  CCUG 
G,  GGAU,  C,  C,  U,  AGC,  U 
G,  G,  AUU,  UCG,  G,  UG,  G 
GGGGU,  C,  U,  GAU,  GU 
UG,  CUAAG,  AG,  CG,  UCG 
GU,  AAG,  GAGC,  GC,  U,  U,  C 


9.  Let  n  >  3  and  assume  that  the  complete  graph  has 
vertices  labeled  wi,  u>2, ...  ,  w„  and  arc  WjWj  directed 
Wj  to  Wj  if  and  only  if  i  <  j.  Show  that  the  tournament 
on  this  digraph  is  transitive. 


10.  (a)  Could  5,  5,  4,  3.  2,  2,  2  be  the  score  sequence  of  a 
tournament? 

(b)  Could  8,  8,  8,  6,  6,  3,  3,  1,  1,  1  be  the  score  sequence 
of  a  tournament? 


11.  Let  T  be  a  tournament  and  let  v  be  any  vertex  of  T  hav¬ 
ing  minimum  score.  Show  that  the  length  of  the  shortest 
path  from  any  other  vertex  to  v  is  1  or  2. 

12.  In  a  round-robin  tennis  tournament,  we  know  that  Alice 
is  the  clear  winner.  George  is  the  clear  loser,  and  all 
other  players  are  tied  for  second.  Show  that  the  number 
of  players  competing  must  have  been  odd. 

13.  George  has  to  do  four  things  before  his  mathematics 
exam  tomorrow  morning:  study,  sleep,  eat  pizza,  and 
check  his  e-mail.  He  intends  to  spend  two  different 
periods  of  time  studying,  three  hours  on  each  occa¬ 
sion.  While  the  sessions  could  be  back  to  back,  at  least 
one  session  must  occur  before  he  sleeps.  Eating  pizza 
requires  one  hour,  but  it  also  takes  an  hour  for  the  order 
to  arrive  (and  something  else  could  be  done  during  this 
interval).  Eating  pizza  must  precede  sleeping  and  also 
at  least  one  of  the  study  sessions.  George  will  get  six 
hours  of  sleep  if  he  sleeps  after  all  study  has  been  com¬ 
pleted,  but  he’s  willing  to  settle  for  five  hours  otherwise. 
If  George  checks  his  e-mail  between  study  sessions,  he 
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will  do  so  quickly  and  take  only  one  hour;  otherwise, 
he  will  spend  two  hours  at  this  activity. 

(a)  Find  the  shortest  possible  time  in  which  George  can 
complete  his  activities,  and  a  way  in  which  he  can 
order  them  and  complete  them  in  this  time. 

(b)  Assume  that  George  decides  to  read  his  e-mail 
before  eating  his  pizza.  Does  this  change  the  answer 
to  (a)?  If  so,  how? 

(c)  Assume  that  George  decides  to  read  his  e-mail 
before  ordering  his  pizza.  Does  this  change  the 
answer  to  (a)?  If  so,  how? 

14.  The  following  chart  lists  a  number  of  tasks  that  must  be 
completed  in  order  for  a  crew  of  workers  to  construct  a 
glynskz. 


Task 

ABCDEFGHIJ 

Time 
(in  days) 

2231  123433 

Task  A  must  be  carried  out  before  any  other  tasks  can 
commence.  Task  B  must  precede  tasks  E  and  F,  and 
both  E  and  F  must  be  completed  before  H  can  begin. 
Tasks  C  and  D  must  precede  task  G,  which  in  turn  must 
precede  /.  Task  J  must  be  carried  out  last.  It  is  assumed 
that  there  are  enough  workers  to  carry  out  any  number 
of  tasks  simultaneously. 

(a)  What  is  the  fewest  number  of  days  needed  to  con¬ 
struct  this  glynskz?  Find  all  critical  paths. 

(b)  Find  the  slack  in  C,  D,  and  E. 
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12.1  What  Is  a  Tree? 

One  of  the  most  commonly  occurring  kinds  of  graph  is  called  a  tree,  perhaps 
because  it  can  be  drawn  so  that  it  looks  a  bit  like  an  ordinary  tree.  In  this  section, 
we  introduce  this  concept  with  a  variety  of  examples. 


System 


Root 


Cadd 


-o  Butch 

-O  Utilities 

_0  Figures 
-O  Exercises 
-©  Solutions 


Discrete 


Book 


Text 

Processing 


Graph 


— O 
— G 


-o  Courses 
-©  Letters 


Chapters 
1 

to 
12 


Figure  12.1  A  partial  computer  directory  structure. 


The  operating  system  on  a  computer  organizes  tiles  into  directories  and  subdi¬ 
rectories  in  the  same  way  that  people  gather  together  pieces  of  paper  into  folders 
and  put  them  into  the  drawers  of  a  filing  cabinet.  The  computer  on  which  this 
text  was  first  prepared  had  a  directory  called  “Book”  partitioned  into  two  sub¬ 
directories  called  “Discrete”  and  “Graph,”  each  in  turn  containing  a  number  of 
subdirectories,  one  for  each  chapter.  Each  chapter  subdirectory  had  one  file  for 
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each  section  in  that  chapter.  Part  of  the  organization  of  these  computer  files  is 
depicted  in  Fig  12.1 . 

Suppose  you  wanted  to  write  down  all  the  increasing  subsequences  which  can 
be  formed  from  the  sequence  4,  8.  5,  0,  6,  2:  4,  5,  6  is  one  such  subsequence,  as 

is  0,  2,  but  not  2,  4  because  it  is  not  a  subsequence  of  the  given  sequence.  The 
order  of  the  elements  in  a  subsequence  must  be  the  same  as  in  the  sequence 
itself.  To  ensure  that  no  subsequences  are  inadvertently  omitted,  some  systematic 
method  of  enumeration  is  required.  The  graph  drawn  in  Fig  12.2  indicates  such  a 
method.  In  the  figure,  the  desired  subsequences  correspond  to  paths  from  “start” 
to  each  vertex. 


List  all  the  monotonically  increasing  subsequences  of  the  sequence  4,  8,  5,  0,  6,  2. 


A 


Figure  12.3 


Suppose  the  vertices  of  the  graph  shown  in  Fig  12.3  represent  towns,  the 
edges  roads,  and  the  label  on  a  road  gives  the  time  it  takes  to  travel  the  road.  Is  it 
possible  for  a  salesperson  to  drive  to  all  the  towns  and  return  to  his  or  her  starting 
point  having  visited  each  town  exactly  once?  If  it  is,  what  is  the  shortest  distance 
required  for  such  a  trip?  In  Section  10.4,  we  introduced  this  “Traveling  Salesman's 
Problem”  and  mentioned  that  there  is  no  efficient  algorithm  known  for  solving 
it.  In  reasonably  small  graphs,  of  course,  we  can  exhaustively  enumerate  all  the 
possibilities.  Figure  12.4  indicates  how  we  can  use  a  graph  to  search  systematically 
for  Hamiltonian  cycles,  if  they  exist.  We  assume  that  the  salesperson  starts  at  A. 
The  graph  makes  it  apparent  that  there  are  in  fact  12  Hamiltonian  cycles  in  the 
graph  of  Fig  12.3. 

Answer  the  Traveling  Salesman's  Problem  for  the  graph  of  Fig  12.3.  Find  a  short¬ 
est  Hamiltonian  cycle  and  its  length. 

John  and  David  are  going  to  play  a  few  games  of  chess.  They  agree  that  the 
first  person  to  win  two  games  in  a  row  or  to  win  a  total  of  three  games  will  be 
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A 


Figure  12.4  A  hunt  for  Hamiltonian  cycles  in  the  graph  of  Fig  12.3. 

declared  the  winner.  The  possible  outcomes  can  be  conveniently  pictured  with  the 
graph  shown  in  Fig  12.5. 


Pause  3 


How  many  outcomes  are  possible?  What  is  the  maximum  number  of  games  John 
and  David  will  play? 


With  the  exception  of  that  in  Fig  12.3,  all  the  graphs  which  have  appeared  in 
this  section  are  trees. 


12.1.1  DEFINITION 


A  tree  is  a  connected  graph  which  contains  no  circuits. 


There  are  several  alternative  definitions. 


12.1.2  PROPOSITION 


Let  Q  be  a  graph.  Then  the  following  statements  are  equivalent. 

(1)  Q  is  a  tree. 

(2)  Q  is  connected  and  acyclic,  that  is,  without  cycles. 

(3)  Between  any  two  vertices  of  Q  there  is  precisely  one  path. 


Proof  (i)  — >  (2):  Since  a  cycle  is  a  circuit,  statement  (1)  immediately  implies  state¬ 
ment  (2). 
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(2)  -»•  (3):  If  there  are  two  different  paths  V\.  V2  from  some  vertex  v  to 
another,  w,  then  the  closed  walk  from  v  to  v  obtained  by  following  the  vertices  of 
V]  and  then  those  of  V2  in  reverse  order  would  contain  a  cycle,  contradicting  (2). 
(See  Exercise  12.) 

(3)  — ^  (1):  Since  there  is  a  path  between  any  two  vertices  of  Q,  certainly  Q 
is  connected.  Moreover,  it  can  contain  no  circuits.  Otherwise,  it  would  contain  a 
cycle  (Exercise  14,  Section  10.1)  and  a  cycle  determines  two  paths  between  any 
two  vertices  of  it. 

The  sequence  of  implications  (1)  —*■  (2)  —*■  (3)  — ►  (1)  just  established  proves 
the  equivalence  of  (1),  (2),  and  (3). 

I 


12.1.3  DEFINITION 


A  tree  is  rooted  if  it  comes  with  a  specified  vertex,  called  the  root. 


This  may  not  seem  like  much  of  a  definition,  but  the  concept  is  useful.  Also, 
there  is  a  convention  for  picturing  rooted  trees.  Suppose  v  is  a  root  in  a  tree  T. 
By  Proposition  12.1.2,  there  is  a  unique  path  from  v  to  any  other  vertex.  Thus, 
the  concept  of  level ,  meaning  distance  (number  of  edges  on  a  path)  from  v,  is 
well  defined.  When  drawing  a  rooted  tree  then,  we  agree  to  put  the  root  v  at  the 
top  (in  contrast  to  the  way  we  draw  trees  that  grow  out  of  doors),  we  put  all 
level  one  vertices  (those  of  distance  1  from  v)  on  a  horizontal  line  just  below  v, 
those  of  level  two  (distance  2  from  v)  on  a  horizontal  line  below  the  previous, 
and  so  on.  The  tree  in  Fig  12.2  is  drawn  so  as  to  show  the  vertex  “start”  as 
root.  Vertex  A  in  Fig  12.4  is  a  root.  Vertices  B.  D,  and  F  have  level  one;  the 
vertices  labeled  C,  E,  F,  C,  E,  B.  C,  E  in  the  next  row  have  level  two;  and 
so  on. 

The  way  in  which  a  rooted  tree  is  drawn  clearly  depends  upon  which  vertex 
is  the  root.  In  Fig  12.6,  we  show  two  pictures  of  the  same  tree,  rooted  at  A  on 
the  left  and  at  B  on  the  right. 


Figure  12.6 


The  same  tree  rooted  at  A  on  the  left  and  B  on  the  right. 


Answers  to  Pauses 


1.  There  are  13  in  all. 


4;  4,8;  4,5;  4.5,6;  4,6;  8;  5;  5,6;  0;  0,6;  0,2;  6;  2 


2.  A  shortest  Hamiltonian  cycle  is  ABCDEF A,  of  length  155. 
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3.  There  are  ten  possible  outcomes,  corresponding  to  the  ten  vertices  of  degree  1 
in  Fig  12.5.  As  many  as  five  games  could  be  required  to  decide  a  winner. 


EXERCISES 


The  symbol  [BB]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 


1.  [BB]  Suppose  eight  teams  qualify  for  the  playoffs  in 
a  local  softball  league.  In  the  first  round  of  playoffs. 
Series  A  pits  the  best  team  against  the  team  that  fin¬ 
ished  eighth,  Series  B  has  the  second  best  team  playing 
the  team  that  finished  seventh,  Series  C  has  the  third 
placed  team  playing  the  sixth.  In  Series  D,  the  teams 
that  finished  fourth  and  fifth  play.  In  the  second  round, 
the  winners  of  Series  A  and  C  compete,  as  do  the  win¬ 
ners  of  Series  B  and  D,  in  Series  E  and  F.  Finally, 
the  winners  of  these  series  play  to  determine  the  league 
champion.  Draw  a  tree  which  summarizes  the  playoff 
structure  in  this  league. 

2.  Edward  VII,  eldest  son  among  Queen  Victoria’s  nine 
children,  finally  ascended  to  the  throne  of  England  in 
1901  (after  60  years  as  Prince  of  Wales).  Upon  his 
death  in  1910,  he  was  succeeded  by  King  George  V. 
George  V  had  six  children,  perhaps  the  most  infa¬ 
mous  of  whom  was  Edward  VIII,  who  abdicated  to 
marry  a  twice  divorced  American.  Upon  this  abdi¬ 
cation,  George  VI,  father  of  the  present  Queen  of 
England,  became  king.  Queen  Elizabeth  has  four  chil¬ 
dren,  Charles,  Anne,  Andrew,  and  Edward.  Her  sister. 
Princess  Margaret,  has  two,  Sarah  and  David.  Prince 
Charles  has  two  boys,  William  and  Harry,  marking  the 
first  time  since  the  death  of  Edward  VII  that  there  have 
been  two  males  in  direct  succession  to  the  English 
throne.  Draw  that  portion  of  Prince  William's  family 
tree  which  shows  the  people  discussed  in  this  short 
narrative. 

3.  Make  a  tree  (like  that  in  Fig  12.2)  which  displays  all 
monotonically  increasing  subsequences  of  3,  2,  8,  0,  9, 
1 , 5.  How  many  are  there  altogether?  Taking  "start”  to  be 
the  root,  how  many  vertices  are  at  each  possible  level? 

4.  John  does  not  play  chess  very  well,  so  the  next  time  he 
plays  David  he  says  he  will  quit  after  winning  a  game,  or 
after  five  games  in  all  have  been  played.  Make  a  graph 
showing  all  possible  outcomes.  How  many  possible  out¬ 
comes  are  there?  In  how  many  of  these  does  John  win 
a  game? 

5.  [BB]  It's  the  weekend  and  Harry  has  several  ways  to 
spend  his  Friday  and  Saturday  evenings.  He  can  study, 
read,  watch  television,  or  go  out  with  the  boys.  Draw  a 


tree  which  shows  all  possible  ways  Harry  can  spend  his 
Friday  and  Saturday  evenings. 

6.  Mary  can  travel  from  St.  John’s  to  Comer  Brook  by  car, 
bus,  or  plane  and  from  Corner  Brook  to  Goose  Bay  by 
plane  or  boat.  Draw  a  tree  showing  all  the  possible  ways 
Mary  can  go  from  St.  John’s  to  Goose  Bay  via  Corner 
Brook.  In  how  many  of  these  ways  does  she  avoid  the 
bus? 

7.  [BB]  In  how  many  ways  can  a  committee  of  three  people 
be  chosen  from  the  following  group  of  people:  Bruce, 
Philomena,  Irene,  Tom,  and  Dave?  Draw  a  tree  which 
shows  all  possible  committees  and  the  way  each  was 
chosen. 

8.  Draw  the  tree  with  root  A  and  with  root  B. 


9.  The  vertices  in  the  graph  represent  towns,  the  edges 
roads,  and  the  labels  on  the  roads  costs  of  paving  the 
roads. 


A  5  B  2  C 


(a)  Make  a  tree  which  shows  all  paths  beginning  at 
vertex  A.  List  those  paths  which  terminate  at  C. 
Indicate  which  ones,  if  any,  are  Hamiltonian. 

(b)  Is  the  graph  Hamiltonian?  Explain. 

(c)  Which  roads  should  be  paved  in  order  that  one  may 
drive  from  A  along  paved  roads  to  as  many  towns 
as  possible  at  minimal  cost?  Justify  your  answer. 
What  is  this  minimal  cost? 

10.  Solve  the  Traveling  Salesman's  Problem  for  each  of 
the  following  graphs  by  making  trees  which  display  all 
Hamiltonian  cycles.  (Start  at  A  in  each  case.) 
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A 


11.  A  forest  is  a  graph  with  no  cycles.  Explain  why  a  forest 
is  the  disjoint  union  of  trees. 

12.  [BB]  Suppose  V\  and  Vj  are  two  paths  from  a  vertex 
v  to  another  vertex  in  in  a  graph.  Prove  that  the  closed 
walk  obtained  by  following  V\  from  v  to  w  and  then 
V2  in  reverse  from  w  to  t>  contains  a  cycle. 


12.2  Properties  of  Trees 

In  this  section,  we  derive  a  number  of  properties  of  trees  and  discuss  a  connection 
with  the  isomers  of  a  certain  kind  of  hydrocarbon. 


12.2.1  PROPOSITION 


Suppose  Q  is  a  graph  each  of  whose  vertices  has  degree  at  least  2.  Then  Q  contains 
a  circuit. 


Proof  Let  v\  be  any  vertex  of  Q.  Since  deg  v  1  7^  0,  we  may  proceed  to  an  adjacent 
vertex  i>2-  Since  deg  112  >  2,  we  may  proceed  to  an  adjacent  vertex  1)3  along  a 
new  edge.  Note  that  v\V2V?,  is  a  path.  In  general,  suppose  k  >  3  and  that  we 
have  found  a  path  through  distinct  vertices  v\,V2,...  ,  Vk .  Since  deg  v k  >  2,  Vk  is 

adjacent  to  a  vertex  Vk+\  ^  v*- 1-  If  Lt+i  e  {vi,  V2 . Vk- 2}.  we  have  a  circuit; 

otherwise,  v\V2  ■  ■  ■  vu+\  is  a  path.  Since  there  are  only  finitely  many  vertices  in 
Q,  we  cannot  continue  to  find  paths  of  increasing  length,  so  eventually  we  find  a 
circuit.  I 


In  a  trivial  way,  a  graph  with  a  single  vertex  (of  degree  0)  is  a  tree.  A  tree  with 
more  than  one  vertex  can  have  no  vertices  of  degree  0  (because  it  is  connected) 
so  all  its  vertices  have  degree  at  least  1 .  If  they  all  had  degree  at  least  2,  there 
would  be  a  circuit.  Thus,  we  obtain  a  basic  property  of  trees. 


12.2.2  COROLLARY 


A  tree  with  more  than  one  vertex  must  contain  a  vertex  of  degree  1.  (Such  a 
vertex  is  called  a  leaf ) 

What  is  a  leaf? 

Modify  the  argument  used  to  prove  Proposition  12.2.1  in  order  to  prove  that  a 
tree  with  more  than  one  vertex  must  contain  at  least  two  leaves. 


In  Exercise  21(b)  of  Section  10.1,  you  were  asked  to  prove  that  a  connected 
graph  with  n  vertices  has  at  least  n  —  1  edges.  It  turns  out  that  a  connected  graph 
with  n  vertices  and  exactly  n  —  1  edges  is  a  tree;  moreover,  this  property  charac¬ 
terizes  trees. 


12.2.3  THEOREM 


A  connected  graph  with  n  vertices  is  a  tree  if  and  only  if  it  has  n  —  1  edges. 


Proof  (— »•)  First  we  prove,  by  mathematical  induction,  that  a  tree  with  n  vertices  has 

n  —  1  edges.  This  is  clear  for  n  =  1,  so  assume  that  k  >  1  and  that  any  tree  with 
k  vertices  has  k  —  1  edges.  Let  T  be  a  tree  with  k  +  1  vertices.  We  must  prove 
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that  T  has  k  edges.  For  this,  let  w  be  a  vertex  in  T  of  degree  1.  Remove  v  and 
the  single  edge  e  with  which  it  is  incident.  The  point  to  appreciate  is  that  the 
pruned  tree  Tq  =  T  \  ju)  which  remains  is  itself  a  tree.  It  has  no  circuits  since 
the  original  tree  had  no  circuits.  Why  is  it  connected?  If  w  and  u  are  vertices  in 
Tq,  they  are  joined  by  a  path  V  in  T .  Since  you  enter  and  leave  each  vertex  of  V 
along  different  edges,  each  vertex  of  V  between  w  and  u  has  degree  at  least  2. 
Thus,  v  and  the  lone  edge  e  with  which  it  is  adjacent  are  not  part  of  the  path  V. 
So  V  must  lie  entirely  in  Tq.  Therefore,  Tq  is  connected  and  hence  a  tree.  Since 
7o  has  k  vertices,  it  has  k  —  1  edges,  by  the  induction  hypothesis.  Since  T  has 
one  more  edge  than  Tq,  T  has  k  edges,  which  is  what  we  wanted  to  show. 


v3 

"o 
v 

b 

Figure  12.7  u 

(<— )  For  the  converse,  we  suppose  that  Q  is  a  connected  graph  with  n  vertices 
and  n  -  1  edges  and  prove  that  Q  is  a  tree.  W'e  have  only  to  establish  the  absence 
of  circuits.  Suppose  that  Q  contains  a  circuit  tqt>2  ■  ■  ■  tqtq.  (Since  graphs  have  no 
loops  and  no  multiple  edges,  k  >  3.)  Remove  the  edge  e  =  V\V2-  The  remaining 
graph  Q  \  {e}  has  all  n  vertices  of  Q  but  n  —  2  edges;  moreover,  Q  \  {e}  is 
connected  because  any  two  vertices  u,  v  of  Q  which  were  connected  via  a  path 
which  included  e  are  still  connected  via  a  walk  in  which  v\VkVk-\  •• -1131)2  is 
substituted  for  e.  (See  Fig  12.7.) 

If  Q\  {e}  contains  a  circuit  we  delete  an  edge  from  this  and  obtain  a  connected 
graph  which  still  contains  n  vertices  but  has  just  n  —  3  edges.  Since  Q  is  finite, 
eventually  all  circuits  will  be  deleted  and  we  shall  be  left  with  a  connected  graph 
with  n  vertices,  fewer  than  n  —  1  edges,  and  no  circuits.  Such  a  graph  is  a  tree. 
We  showed  earlier  that  a  tree  with  n  vertices  must  have  n  —  1  edges.  Thus,  there 
can  be  no  circuit  in  Q  is  indeed  a  tree.  I 

The  tree  in  Fig  12.1  has  26  vertices  and  25  =  26  —  1  edges,  as  predicted  by 
Theorem  12.2.3.  Verify  this  theorem  for  the  trees  in  Figs  12.2  and  12.5. 

There  are  two  other  properties  of  trees  which  follow  quickly  from  Theo¬ 
rem  12.2.3.  The  first  has  already  been  noted. 


12.2.4  COROLLARY 


A  tree  with  more  than  one  vertex  has  at  least  two  leaves,  that  is,  at  least  two 
vertices  of  degree  1 . 


Proof  Suppose  T  is  a  tree  with  n  vertices  iq,  i>2, . . .  ,  vn  and  n  —  1  edges.  By  Proposi¬ 
tion  9.2.5,  deg  v,  =  2{n  —  \)  —  2n  —2.  We  already  know  that  some  vertex, 
say  iq,  has  degree  1.  If  the  remaining  n  —  1  vertices  each  have  degree  2  or  more, 
then  the  sum  of  all  the  degrees  would  be  at  least  1  +  2(n  —  1)  =  2n  —  1,  a  con¬ 
tradiction.  So  there  has  to  be  another  vertex,  in  addition  to  tq,  which  has  degree 
1  too.  I 


374  Chapter  12  Trees 


In  fact,  there  is  a  precise  formula  for  the  number  of  leaves,  a  formula  which 
shows  that  this  number  is  at  least  2.  See  Exercise  9. 


12.2.5  COROLLARY 


Any  edge  added  to  a  tree  must  produce  a  circuit. 


Proof  The  new  graph  remains  connected  and  it  is  not  a  tree  because  it  has  an  equal 
number  of  edges  and  vertices.  So  it  must  contain  a  circuit.  I 

HHm  Is  it  possible  for  two  circuits  to  be  formed  when  a  single  edge  is  added  to  a  tree? 

The  identification  of  a  tree  as  an  important  type  of  graph  is  credited  to  the 
English  mathematician  Arthur  Cayley  (1821-1895),  a  lawyer,  a  contemporary  of 
Sir  William  Rowan  Hamilton,  and  arguably  the  leading  English  mathematician  of 
the  nineteenth  century.  Cayley  did  fundamental  work  in  the  theory  of  matrices  and 
is  considered  by  many  to  be  the  founder  of  abstract  group  theory.  In  chemistry, 
chemical  compounds  with  the  formula  are  known  as  paraffins.  The 

molecule  Q//2&+2  contains  k  carbon  atoms  and  2k  +  2  hydrogen  atoms.  Each 
hydrogen  atom  is  bonded  to  exactly  one  atom,  a  carbon  atom;  each  carbon  atom  is 
bonded  to  four  atoms.  It  is  not  hard  to  see  that  there  is  just  one  possible  structure 
for  each  of  the  first  three  paraffins;  methane,  ethane,  and  propane.  (See  Fig  12.8.) 


CHa 

Figure  12.8  Methane 


c2h6 

Ethane 


C3//8 

Propane 


On  the  other  hand,  for  k  >  4,  various  arrangements  of  Q-  H2k+2  are  possible. 
Each  arrangement  is  called  an  isomer.  Arthur  Cayley  wanted  to  know  the  number 
of  isomers  of  each  paraffin.  Each  isomer  is  clearly  associated  with  a  connected 
graph,  a  graph  which  is,  in  fact,  a  tree.  Here  is  why. 

Each  of  the  k  carbon  atoms  in  C\  H2k+2  corresponds  to  a  vertex  of  degree  4  in 
the  associated  graph  and  each  of  the  2k  +  2  hydrogen  atoms  to  a  vertex  of  degree 
1.  The  total  number  of  vertices  in  the  graph  is  k  +  (2k  +  2)  =  3k  +  2  and  the  total 
number  of  edges  is  one  half  the  sum  of  the  degrees;  that  is,  j(4 k  +  (2k  +  2))  = 
3 k  4-  1.  Since  the  graph  is  connected  and  the  number  of  edges  is  one  less  than  the 
number  of  vertices,  it  is  a  tree  (by  Theorem  12.2.3). 

Cayley  also  noted  that  an  isomer  of  CkHik+2  is  completely  determined  by 
the  arrangement  of  the  carbon  atoms  and  that  these  form  a  tree  with  each  vertex 
of  degree  at  most  4.  (See  Exercise  10.)  Note  the  tree  of  carbons  in  each  of  the 
molecules  in  Fig  12.8.  To  draw  an  isomer  of  C^Hik+i,  we  draw  a  tree  with  k 
vertices  labeled  C,  each  of  degree  at  most  4,  and  then  add  new  edges  with  end 
vertices  labeled  H,  of  degree  1,  at  each  C  vertex  where  necessary  so  that  each  C 
vertex  acquires  degree  4.  (It  can  be  shown  that  2k  +  2  edges  must  be  added.  See 
Exercise  1 1 .) 
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Figure  12.9 


Figure  12.10 


12.2.6  DEFINITION 


Answers  to  Pauses 


To  enumerate  all  trees  with  n  vertices,  as  Cayley  was  attempting  to  do,  is 
very  difficult  unless  n  is  small.  The  situation  for  n  <  4  is  shown  in  Fig  12.9. 


o 


G - O  G - 0 - 0 


n  =  1  n  =  2  n  =  3 

one  tree  one  tree  one  tree 


The  unlabeled  trees  with  n  <4  vertices. 


Draw  graphs  of  all  possible  isomers  of  butane  (whose  chemical  formula  is  C\H m). 
How  many  are  there? 

There  is  no  closed  formula  for  the  number  of  trees  with  n  vertices,  although 
these  numbers  (for  various  n)  appear  as  the  coefficients  of  a  certain  generating 
function  which  begins 

x  +  x2  +  a3  +  2.r4  +  3x5  +  6x6  +  1  Ijc7  +  23jc8, 

but  whose  derivation  would  take  us  too  far  afield.1  On  the  other  hand,  as  we 
shall  see  in  the  next  section,  there  are  precisely  nn~ 2  labeled  trees  on  n  vertices 
(Theorem  12.3.3).  As  the  name  implies,  a  labeled  tree  is  one  in  which  each  vertex 
has  a  distinct  label  attached  to  it.  Figure  12.10  shows  graphs  of  the  33-2  =  3 
different  labeled  trees  with  vertices  A,  B,  and  C. 


g - e - o  g - e - o  g - e - o 

B  A  C  ABC  A  C  B 

The  three  labeled  trees  on  three  vertices. 


Trees  Tj  and  7^  whose  vertices  are  labeled  with  the  same  set  of  labels  are  isomor¬ 
phic  if  and  only  if,  for  each  pair  of  labels  v  and  w,  vertices  v  and  w  are  adjacent 
in  T\  if  and  only  if  they  are  adjacent  in  7j>. 

In  Fig  12.10,  no  two  of  the  labeled  trees  are  isomorphic.  The  situation  is 
different,  however,  in  Fig  12.1 1.  Here  the  labeled  trees  Tj  and  T2  are  isomorphic, 
although  neither  is  isomorphic  to  Tj.  The  adjacencies  between  the  labeled  vertices 
of  Tj  and  7j>  are  identical.  On  the  other  hand,  whereas  vertices  B  and  C  are 
adjacent  in  T\  (and  T2),  such  is  not  the  case  in  Tj. 

4.  A  leaf  is  a  vertex  of  degree  1 . 

5.  Let  ui  be  a  leaf  in  a  tree  and  start  a  path,  as  described  in  Proposition  12.2.1, 
with  t>i.  Since  there  are  no  circuits  in  a  tree,  the  path  v\,  V2, 113,  . . .  obtained 
in  the  proposition  can  be  extended  to  a  longer  path  as  long  as  each  vertex  we 
meet  has  degree  at  least  2.  By  finiteness,  it  must  eventually  terminate  with  a 
second  vertex  of  degree  1 ,  that  is,  another  leaf. 


'The  interested  reader  might  consult  Frank  Harary’s  book,  Graph  Theory,  Addison-Wesley  (1969). 
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A  B  BA  A  E 


%  r2  r3 

Figure  12.11  Tj  and  T2  are  isomorphic  labeled  trees  which  are  not  isomorphic  to  T). 

6.  The  tree  in  Fig  12.2  has  14  vertices  and  13  =  14  —  1  edges;  the  tree  in 
Fig  12.5  has  19  vertices  and  18  =  19  —  1  edges. 

7.  No,  it  isn’t.  First  note  that  when  an  edge  is  added  to  a  tree,  any  circuit  that  is 
produced  must  be  a  cycle.  To  see  why,  suppose  the  addition  of  edge  v\  i>2  pro¬ 
duces  a  circuit  U|W2U3  •  •  •  vnv\ .  If  this  circuit  is  not  a  cycle,  there  is  a  repeated 
vertex.  This  vertex  must  be  ui  or  u2i  otherwise,  the  original  tree  contained  a 
circuit,  contrary  to  fact.  If  the  repeated  vertex  is  vj  (the  argument  in  the  case 
of  V2  is  similar),  then  our  circuit  has  the  form  v\  V2  ■  ■  ■  v\w  ■  ■  ■  vnv\.  Since  the 
edge  V|  V2  is  not  repeated  in  the  circuit,  v\w  ■  ■  ■  vnv\  is  a  circuit  in  the  original 
tree,  a  contradiction.  Thus,  if  an  edge  v\  u2  is  added  to  a  tree,  it  is  in  fact  only 
a  cycle  which  is  produced.  If  two  cycles  were  produced,  there  would  be  two 
paths  from  vi  to  u2  in  the  original  graph,  contradicting  Proposition  12.1.2. 

8.  Since  there  are  two  nonisomorphic  trees  with  four  vertices  and  in  each  of 
these,  all  vertices  have  degree  at  most  4,  there  are  two  isomers  of  butane. 
These  are  pictured  in  Fig  12.12.  (Note  the  two  trees  of  four  carbon  atoms  and 
compare  with  the  two  trees  on  the  right  of  Fig  12.9.) 


u  u  u 


Figure  12.12  H 


EXERCISES 


The  symbol  [BB]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 

(b)  [BB]  How  many  isomers  does  pentane  (C5H 12) 
have?  Why? 


1.  (a)  [BB]  Draw  the  graphs  of  all  nonisomorphic  unla¬ 
beled  trees  with  five  vertices. 
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2.  (a)  Draw  the  graphs  of  all  nonisomorphic  unlabeled 

trees  with  six  vertices. 

(b)  How  many  isomers  does  hexane  ( C(,Hi4 )  have? 
Why? 

3.  [BB]  Suppose  Q  is  a  graph  whose  vertices  represent 
cities  in  a  country.  An  edge  in  Q  represents  a  direct  (non¬ 
stop)  air  flight  between  the  corresponding  cities.  What 
does  a  beta  index  less  than  1  say  about  air  travel  in  the 
country?  There  are  two  possibilities  to  consider.  (See 
Exercise  5  of  Section  9.1.) 

4.  Prove  that  a  connected  graph  with  n  vertices  is  a  tree 
if  and  only  if  the  sum  of  the  degrees  of  the  vertices  is 
2 (n  -  1). 

5.  [BB]  Recall  that  a  graph  is  acyclic  if  it  has  no  cycles. 
Prove  that  a  graph  with  n  vertices  is  a  tree  if  and  only 
if  it  is  acyclic  with  n  —  1  edges. 

6.  Suppose  Q  is  an  acyclic  graph  with  n  >  2  vertices  and 
we  remove  one  edge.  Explain  why  the  new  graph  Q' 
cannot  be  connected. 

7.  Suppose  a  graph  Q  has  two  connected  components,  7] , 
73,  each  of  which  is  a  tree.  Suppose  we  add  a  new  edge 
to  Q  by  joining  a  vertex  of  7]  to  a  vertex  in  73.  Prove 
that  the  new  graph  is  a  tree. 

8.  (a)  Let  e  be  an  edge  in  a  tree  T.  Prove  that  the  graph 

consisting  of  all  the  vertices  of  T  but  with  the  single 
edge  e  deleted  is  not  connected. 

(b)  Without  assuming  the  existence  of  vertices  of  degree 
1 ,  use  the  result  of  (a)  and  the  strong  form  of  mathe¬ 
matical  induction  to  prove  that  a  tree  with  n  vertices 
has  n  —  1  edges. 

9.  Let  T  be  a  tree  with  n  vertices  v\ ,  in, . . .  ,  v„.  Prove  that 
the  number  of  leaves  in  T  is  2  +  £7leg  t), >3  [deg  v,  —  2]. 

10.  [BB]  Prove  that  the  subgraph  of  a  C^Hik+l  tree  T  con¬ 
sisting  of  the  k  carbon  vertices  and  all  edges  from  T 
among  them  is  itself  a  tree. 


11.  (a)  [BB]  Suppose  T  is  a  tree  with  k  vertices  labeled  C, 

each  of  degree  at  most  4.  Enlarge  T  by  adjoining 
sufficient  vertices  labeled  H  so  that  each  vertex  C 
has  degree  4  and  each  vertex  H  has  degree  1 .  Prove 
that  the  number  of  H  vertices  adjoined  to  the  graph 
must  be  2k  +  2. 

(b)  Can  you  prove  (a)  without  assuming  that  T  is  a 
tree? 

12.  We  have  noted  in  this  section  that  there  are  n"~ 2  labeled 
trees  on  n  vertices. 

(a)  Verify  this  formula  for  n  =4  by  drawing  the 
nonisomorphic  unlabeled  trees  on  four  vertices  and 
discovering  how  many  different  labeled  trees  each 
determines.  (See  Fig  12.9.) 

(b)  Repeat  (a)  for  n  =  5. 

13.  Prove  that  a  tree  with  n  >  2  vertices  is  a  bipartite  graph. 

14.  [BB]  Find  necessary  and  sufficient  conditions  for  a  tree 
to  be  a  complete  bipartite  graph.  Prove  your  answer. 

15.  A  forest  is  a  graph  every  component  of  which  is  a  tree, 
equivalently,  a  graph  without  cycles.  (See  Exercise  1 1 
of  Section  12.1.) 

(a)  [BB]  Show  that  a  forest  with  c  components,  each 
containing  at  least  two  vertices,  has  at  least  2c  ver¬ 
tices  of  degree  1 . 

(b)  Is  the  result  of  (a)  true  without  the  stipulation 
that  each  component  contain  at  least  two  vertices? 
Explain. 

(c)  Find  a  formula  for  the  number  of  edges  in  a  forest 
with  n  vertices  and  c  components  and  prove  your 
answer. 

16.  (a)  [BB]  Show  that  a  tree  with  two  vertices  of  degree 

3  must  have  at  least  four  vertices  of  degree  1 . 

(b)  Show  that  the  result  of  (a)  is  best  possible:  A  tree 
with  two  vertices  of  degree  3  need  not  have  five 
vertices  of  degree  1 . 


12.3  Spanning  Trees 

Figure  12.13  represents  a  map:  The  vertices  correspond  to  towns  and  the  labels  on 
the  edges  represent  distances  along  existing  gravel  roads  between  adjacent  towns. 
The  province  plans  to  pave  certain  roads  in  such  a  way  that  one  can  get  between 
any  two  towns  on  pavement.  What  roads  should  be  paved  so  as  to  minimize 
the  total  length  of  pavement  required?  This  problem,  known  as  the  Minimum 
Connector  Problem ,  asks,  in  the  language  of  graph  theory,  for  a  minimum  spanning 
tree  for  the  graph. 
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Figure  12.13 


A 


12.3.1  DEFINITION 


A  spanning  tree  of  a  connected  graph  Q  is  a  subgraph  which  is  a  tree  and  which 
includes  every  vertex  of  Q.  A  minimum  spanning  tree  of  a  weighted  graph  is  a 
spanning  tree  of  least  weight,  that  is,  a  spanning  tree  for  which  the  sum  of  the 
weights  of  all  its  edges  is  least  among  all  spanning  trees. 


The  concept  of  spanning  tree  exists  only  for  a  connected  graph  Q  and,  if  Q 
has  n  vertices,  any  spanning  tree  must  necessarily  contain  n  —  1  edges. 

Finding  a  spanning  tree  in  a  connected  graph  Q  is  not  hard.  If  Q  has  no 
circuits,  then  it  is  already  a  tree,  so  Q  is  a  spanning  tree  for  Q.  If  Q  contains  a 
circuit,  then,  just  as  in  the  proof  of  the  second  half  of  Theorem  12.2.3,  we  can 
delete  an  edge  (without  deleting  any  vertex)  so  as  to  remove  the  circuit  but  leave 
the  graph  connected.  By  repeating  this  procedure,  we  eventually  find  a  connected 
subgraph  without  circuits  containing  all  the  vertices  of  Q,  that  is,  a  spanning  tree. 
Figure  12.14  shows  a  graph  and  three  of  its  spanning  trees.  Spanning  trees  are 
considered  to  be  “different”  if  they  make  use  of  different  edges  of  the  graph. 
Trees  T\  and  7)  in  Fig  12.14  are  isomorphic,  but  they  are  different  spanning 
trees  of  Q. 


Figure  12.14 


A  graph  Q  and  three  of  its  spanning  trees. 


There  is  a  procedure  for  obtaining  additional  spanning  trees  from  a  given 
spanning  tree.  In  Fig  12.14,  72  and  73  were  obtained  from  T\  as  follows:  First 
edge  d  was  added  to  7j,  giving  the  circuit  abcd\  removing  a  gives  T2  and 
removing  b  gives  7j.  Removing  c  would  give  a  fourth  spanning  tree,  and,  if 
we  wanted  more,  we  could  return  to  Tj  and  add  edge  e,  say,  thereby  complet¬ 
ing  another  circuit  abcef .  Successively  removing  each  of  the  edges  a,  b,  c,  and 
/  of  this  circuit  would  give  four  new  spanning  trees.  Remember  that  adding 
an  edge  to  a  tree  always  produces  a  circuit  (Corollary  12.2.5).  In  Exercise  14, 
we  ask  you  to  show  how  the  procedure  described  in  this  paragraph  can  be 
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adapted  to  show  that  all  the  spanning  trees  in  a  graph  can  be  obtained  from  a 
given  one. 

How  many  spanning  trees  does  a  connected  graph  possess?  For  this,  there 
is  a  pretty  result  established  in  1847  by  the  German  physicist  Gustav  Kirchhoff 
(1824-1887),  who  was  studying  electrical  networks. 

Readers  who  have  had  a  first  course  in  linear 
algebra  should  know  that  a  cofactor  of  a  square 
matrix  A  is  plus  or  minus  the  determinant  of  a 
submatrix  obtained  from  A  by  deleting  a  row  and 
a  column.  There  is  one  cofactor  for  each  position  :  : 

in  the  matrix.  The  choice  of  plus  or  minus  for 
the  (i,  j)  cofactor  is  determined,  respectively,  by 
whether  or  not  there  is  a  plus  in  position  (i,  j)  of 
the  configuration  shown  at  the  right. 

The  “+”  in  position  (3,  1),  for  example,  indicates  that  the  (3,  1)  cofactor  is 
the  determinant  of  the  matrix  obtained  by  deleting  row  3  and  column  1,  while  the 
”  in  position  (2,  3)  indicates  that  the  (2,  3)  cofactor  is  minus  the  determinant 
of  the  matrix  obtained  by  deleting  row  2  and  column  3. 

(Kirchhoff)  Let  M  be  the  matrix  obtained  from  the  adjacency  matrix  of  a  con¬ 
nected  graph  Q  by  changing  all  l’s  to  —  l’s  and  each  diagonal  0  to  the  degree  of 
the  corresponding  vertex.  Then  the  number  of  spanning  trees  of  Q  is  equal  to  the 
value  of  any  cofactor  of  M. 

It  is  surprising  that  there  is  such  a  simple  formula  because  it  is  often  a  difficult 
problem  to  count  graphs  of  a  specified  type.  In  particular,  counting  the  number 
of  nonisomorphic  spanning  trees  in  a  graph  is  a  complicated  affair. 

The  proof  of  Kirchhoff  s  Theorem  (which  is  also  known  as  the  Matrix- 
Tree  Theorem)  will  not  be  included  here.2  Instead,  we  content  ourselves  with 
an  illustration. 


Consider  the  graph  whose  adjacency  matrix  is 

-  0  1  1  1  ‘ 
1  0  1  1 
A  _  110  0 
110  0 


The  matrix  specified  in  Kirchhoff  s  Theorem  is 

~  3 

-1  -1 

-1 

M  = 

-1 

3  -1 

-1 

-1 

-1  2 

0 

-1 

-1  0 

2 

-Sec  Chapter  12  of  J.  A.  Bondy  and  U.  S.  R.  Murty,  Graph  Theory  with  Applications ,  North-Holland, 
New  York,  1981. 
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The  (1,1)  cofactor  of  M  is 


3 

-1 

-1 

+  det 

-1 

2 

0 

=  3 

-1 

0 

2 

(-D 


-1  0 

-1  2 


+  (-D 


-1 

-1 


2 

0 


=  3(4)+  1(— 2)  +  (—1)2  =  8 


(expanding  by  cofactors  of  the  first  row).  Kirchhoff’ s  Theorem  guarantees  that  all 
the  cofactors  of  M  equal  8.  For  example,  the  (2,  3)  cofactor  of  M  is 


—  det 


3  -1  -1 

-1  -1  0 

-1  -1  2 


— [  (— 1)(0)  +  2(— 3  —  1)  ]  =  —(—8)  =  8 


(expanding  by  cofactors  of  the  third  column).  Thus,  there  are  eight  spanning  trees 
in  the  graph.  A 

Draw  the  graph  discussed  in  the  previous  example  and  show  each  of  its  eight 
spanning  trees. 

How  many  spanning  trees  has  the  graph  in  Fig  12.14  and  why? 

Here  is  a  remarkable  application  of  Kirchhoff  s  Theorem. 


12.3.3  THEOREM 


The  number  of  labeled  trees  with  n  vertices  is  nn  2 


Proof  After  the  vertices  of  the  complete  graph  Kn  are  labeled,  any  spanning  tree  of  K',n 
is  a  labeled  tree  on  n  vertices.  Conversely,  any  tree  with  these  n  vertices  is  a 
spanning  tree  for  K.n  because  ICn  contains  all  possible  edges.2  Thus,  the  number 
of  labeled  trees  on  n  vertices  is  the  number  of  spanning  trees  for  /C„,  and  the 
latter  number  can  be  determined  by  Kirchhoff  s  Theorem. 

The  adjacency  matrix  of  K.„  contains  0’s  on  the  diagonal  and  l’s  everywhere 
off  the  diagonal.  So  the  matrix  whose  equal  cofactors  count  the  number  of  trees 
is  the  n  x  n  matrix 


n  —  1 

-1 

•  -1  " 

-1 

n  —  1 

•  -1 

-1 

-1  • 

-1 

-1 

-1 

-1 

-1 

•  -1 

•  •  n  —  1 

The  (1,1)  cofactor  of  M  is  the  determinant  of  an  (n  —  1)  x  (n  —  1)  matrix  which 
looks  exactly  like  M.  Recall  that  the  determinant  of  a  matrix  is  not  changed  if 
a  row  is  added  to  another  row.  Adding  to  the  first  row  in  turn  each  of  the  rows 


3  Here  Fig  12.15  may  help.  The  tree  on  the  left  is  a  spanning  tree  of  IC4  but  not  of  Q  since  RD  is  not 
an  edge  in  Q. 
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Figure  12.15 


Answers  to  Pauses 


below  it  gives  the  matrix 


1 

1 

1 

1 

—  1  n  —  1 

-1 

..  _i 

-1 

-1 

n  —  1 

-1 

-1  -1 

-1  • 

•  •  n  —  1 

and  then  adding  the  first  row  of  this  matrix  to  each  of  the  rows  below  it  gives  the 

matrix 

"  1  1  1  1  ” 

On  0  •  •  •  0 

0  0  n  0 

_  0  0  0  ■■■  n  _ 

which  is  an  upper  triangular  (n  —  1)  x  (n  —  1)  matrix  with  n  —  2  n  s  on  the 
diagonal.  Its  determinant  is  nn~ 1 .  I 


As  spanning  trees,  the  eight  are  all  different,  although  they  are  partitioned 
into  just  two  isomorphism  classes,  the  left  six  in  one  class,  the  rightmost  two 
in  the  other. 

10.  Labeling  the  graph  as  shown,  the  matrix  specified  in  Kirchoffs  Theorem 
is 


The  value  of  any  cofactor  is  24,  so  the  graph  in  Fig  12.14  has  24  spanning 
trees. 
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EXERCISES 


The  symbol  [BB]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 


1.  [BB]  This  exercise  concerns  the  graph  Q  shown  in 
Fig  12.14.  Delete  edges  a,  c,  and  e  in  order  to  obtain  a 
new  spanning  tree  T  for  Q.  Reinsert  edge  a  to  obtain 
the  circuit  afg.  Draw  two  other  spanning  trees  for  Q  by 
removing  first  /  and  then  g  from  this  circuit. 

2.  For  each  of  the  graphs  shown, 

•  find  three  spanning  trees  representing  two  isomor¬ 
phism  classes  of  graphs; 

»  find  the  total  number  of  spanning  trees. 

'  '  ^{> 


3.  Let  e  be  an  edge  of  the  complete  graph  K,„.  Prove  that 
the  number  of  spanning  trees  of  which  contain  e  is 
2  n"-3 4 5 6 7 8 9. 

4.  [BB]  How  many  labeled  trees  are  there  on  n  vertices, 
for  1  <  n  <  6? 

5.  [BB]  Draw  all  the  labeled  trees  on  four  vertices. 

6.  How  many  spanning  trees  does  ICy  have?  Why? 

7.  (a)  [BB]  Draw  all  the  spanning  trees  of  1C 2,2  and  indi¬ 

cate  the  isomorphism  classes  of  these.  How  many 
isomorphism  classes  are  there? 

(b)  Repeat  (a)  for  £2,3. 

8.  Determine  the  number  of  spanning  trees  of  the  complete 
bipartite  graph 

9.  [BB]  Suppose  some  edge  of  a  connected  graph  Q 
belongs  to  every  spanning  tree  of  Q.  What  can  you  con¬ 
clude  and  why? 


10.  Let  Q  be  a  tree.  Describe  all  the  spanning  trees  of  Q  and 
explain. 

11.  if  g  is  a  graph  and  e  is  an  edge  which  is  not  part  of  a 
circuit,  then  e  must  belong  to  every  spanning  tree  of  Q. 
Why? 

12.  (a)  [BB]  Prove  that  every  edge  in  a  connected  graph  is 

part  of  some  spanning  tree. 

(b)  Prove  that  any  two  edges  of  a  connected  graph  are 
part  of  some  spanning  tree. 

(c)  [BB]  Given  three  edges  in  a  connected  graph,  is 
there  always  a  spanning  tree  containing  these  edges? 
Explain  your  answer. 

13.  Let  Q  be  a  connected  graph  with  at  least  two  vertices. 
Show  that  Q  has  a  vertex  v  such  that  g  \  [u]  is  still 
connected.  Show  that  g  has  at  least  two  vertices  with 
this  property. 

14.  Suppose  that  %  and  T  are  two  different  spanning  trees 
for  a  graph  g. 

(a)  [BB]  Let  e  be  an  edge  in  T  which  is  not  in  %. 
Prove  that  there  is  an  edge  /  in  7o  which  is  not  in 
T  such  that  (7o  U  {e})  \  (/}  is  a  spanning  tree. 

(b)  Describe  a  procedure  for  obtaining  T  from  Tq. 

15.  Let  C„  be  the  cycle  with  n  vertices  labeled  1,2,..  ,  n 
in  the  order  encountered  on  the  cycle. 

(a)  Find  the  number  of  spanning  trees  for  C„  (in  two 
ways). 

(b)  Find  a  general  formula  for  the  number  of  spanning 
trees  for  C„  U  ]e),  where  e  joins  1  to  a  (3  <  a  < 
n  —  1). 

16.  [BB]  How  many  graphs  have  n  vertices  labeled 
«i,  t>2, . . .  ,  v„  and  n  —  1  edges?  Compare  this  number 
with  the  number  of  trees  with  vertices  tq, . . .  ,  v„,  for 
2  <  n  <  6. 

17.  Show  that  any  shortest  path  algorithm  can  be  used  to 
construct  a  spanning  tree  for  an  unweighted  connected 
graph.  (Caution!  A  shortest  path  algorithm  requires  a 
weighted  graph.  How  are  you  going  to  get  started?) 


12.4  Minimum  Spanning  Tree  Algorithms 

In  this  section,  we  answer  the  Minimum  Connector  Problem  suggested  by  the  town 
and  road  map  in  Fig  12.13.  The  problem  requires  us  to  find  a  minimum  spanning 
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12.4.1  KRUSKAL'S 
ALGORITHM 


EXAMPLE  2 


Figure  12.16 


tree,  and  we’d  like  to  find  one  efficiently.  In  theory,  we  could  enumerate  all  the 
spanning  trees  of  a  weighted  graph  and  simply  choose  the  tree  of  least  weight, 
but  if  the  graph  is  at  all  complicated,  this  is  not  an  easy  chore.  In  this  section, 
we  discuss  two  better  ways  discovered  in  the  1950s  by  J.  B.  Kruskal4  and  R.  C. 
Prim.5  Both  algorithms  are  in  some  sense  “obvious.”  They  produce  a  minimum 
weight  spanning  tree  an  edge  at  a  time,  at  each  stage  making  the  best  choice  of 
next  edge  without  thought  to  what  has  happened  already  or  what  will  ensue.  For 
this  reason,  they  are  called  greedy  algorithms. 

To  find  a  minimum  spanning  tree  in  a  connected  weighted  graph  with  n  >  1 
vertices,  carry  out  the  following  procedure. 

Step  1.  Find  an  edge  of  least  weight  and  call  this  e\ .  Set  k  =  1. 

Step  2.  While  k  <  n 

if  there  exists  an  edge  e  such  that  {e}  U  {e\,e2, ...  ,  C/t}  does  not  contain 
a  circuit,  let  e^+\  be  such  an  edge  of  least  weight  and  replace  k  by 
k  +  1; 

else  output  e\ ,  ei,  . . .  ,  eg  and  stop, 
end  while 

We  illustrate  with  reference  to  the  graph  of  Fig  12.16.  Kruskal’s  algorithm  ini¬ 
tially  picks  an  edge  of  lowest  weight.  There  are  several  edges  of  weight  2 
and  none  with  less  weight.  Any  of  these  may  be  selected.  Suppose  we  begin 
with  edge  e\  =  AJ.  There  still  remain  edges  of  weight  2,  so  we  choose 
any  other,  say  C2  =  GM.  Among  those  edges  which  do  not  form  a  circuit 
with  e\  and  ei  (at  this  point  there  are  no  edges  which  do),  we  choose  one  of 
lowest  weight,  say  DE ,  and  call  this  e-y.  Continuing,  we  might  select  edges 
AJ ,  GM,  DE,  MF,  FE,  LK,  H  J,  KM,  CK,  BC.  At  this  stage,  there  remains  an 
edge  ( GL )  of  weight  4,  but  this  edge  would  complete  a  circuit  with  LK,  KM, 


An  application  of  Kruskal’s  algorithm  might  select,  in  this  order,  edges  AJ,  GM,  DE,  MF, 
FE,  LK,  HJ,  KM,  CK,  BC,  J  L,  I  A. 


4J.  B.  Kruskal,  Jr„  "On  the  Shortest  Spanning  Subtree  of  a  Graph  and  the  Traveling  Salesman  Problem," 
Proceedings  of  American  Mathematical  Society  7  (1956),  48-50. 

'  R.  C.  Prim,  "Shortest  Connection  Networks  and  Some  Generalizations,"  Bell  System  Technical  Journal 
36  (1957),  1389-1401. 
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GM  already  selected,  and  so  we  choose  an  edge  of  weight  5,  say  J  L ,  and  another, 
I  A.  Now,  any  of  the  remaining  edges  will  complete  a  circuit  with  those  selected 
and  so  we  must  stop;  the  algorithm  assures  us  we  have  the  desired  minimum 
spanning  tree.  It  has  weight  39.  ▲ 

Why  does  the  algorithm  work?  Suppose  Q  is  the  given  connected  graph  and 
Tk  is  the  subgraph  produced  by  Kruskal’s  algorithm.  The  algorithm  terminates 
only  when  the  inclusion  of  any  edge  of  Q  not  in  Tk  produces  a  circuit  in  Tk  .  We 
must  prove  that  Tk  is  a  spanning  tree  and  that  it  is  minimum. 

First,  note  that  Tk  contains  every  vertex  of  Q .  Why?  Suppose  to  the  contrary 
that  some  vertex  v  is  not  in  Tk-  Since  Q  is  connected,  there  exists  some  edge 
incident  with  v.  Adding  this  edge  to  Tk  cannot  produce  a  circuit  in  Tk  since 
none  of  the  edges  incident  with  v  is  in  Tk-  Thus,  the  algorithm  could  continue, 
a  contradiction.  By  its  construction  Tk  contains  no  circuits.  To  see  that  it  is 
connected,  let  v  and  w  be  any  two  vertices  of  Tk  .  There  is  a  path  in  Q  from  v  to 
w  and  any  edge  of  this  path  not  in  Tk  can  be  replaced  by  a  path  in  Tk  since  the 
inclusion  of  any  edge  of  Q  not  in  Tk  produces  a  circuit.  Thus,  Tk  is  a  spanning 
tree.  It  is  more  difficult  to  establish  that  Tk  is  minimum. 

Let  Tm  be  a  minimum  spanning  tree  in  Q.  We  wish  to  show  that  the  weight 
of  Tk  equals  the  weight  of  Tm.  Suppose  that  Q  (hence,  also  Tm  and  Tk)  has  n 
vertices.  Then  Tk  contains  n  —  1  edges.  Suppose  these  were  determined  by  the 
algorithm  in  the  order  e\,ej, ...  .  en-\.  If  Tk  =  Tm,  there  is  nothing  to  prove; 
otherwise,  we  may  suppose  that  there  are  edges  of  Tk  not  in  Tm.  Let  ek  be  the 
first  such  edge.  Since  Tm  is  also  a  spanning  tree,  with  an  argument  similar  to  that 
used  to  solve  Exercise  14(a)  of  Section  12.3,  there  is  an  edge  e  in  Tm,  different 
from  e\  and  different  from  any  of  e\ ,  e2,  ■  ■  ■  ,  ek-  i  in  the  case  k  >  1,  such  that 
T  =  ( Tm  U  {ek})  \  { e )  is  a  spanning  tree.  The  weight  of  T  is 

(1)  w(T)  =  w(Tm )  +  w(ek)  -  w(e)  >  w(Tm) 

since  Tm  has  minimum  weight.  Therefore,  w(ek)  >  w(e). 

If  k  =  1,  w(ek)  <  w(e)  since  e\  was  the  first  edge  chosen  and  so  has 
minimum  weight.  So  w(e)  =  w(ek).  If  k  >  1,  the  edges  e,  e\,e2,  ...  ,  ek-\  do 
not  contain  a  circuit  because  they  are  all  part  of  the  tree  Tm.  Again,  w ( e )  —  w(ek)', 
otherwise,  w(e)  <  w(ek)  and  the  algorithm  would  have  chosen  e  instead  of  ek. 
In  all  cases,  this  makes  T  a  minimum  spanning  tree  since,  by  (1),  its  weight  is 
the  same  as  the  weight  of  the  minimum  spanning  tree  Tm.  Also,  T  is  “closer” 
to  Tk  than  was  Tm  in  the  sense  that  T  has  edges  e\, ...  ,ek  in  common  with 
Tk-  Now  if  T  =  Tk,  then  Tk  is  minimum  (which  is  what  we  want  to  show); 
otherwise,  we  apply  to  T  and  Tk  the  procedure  just  applied  to  Tm  and  Tk  and 
obtain  another  minimum  spanning  tree  with  yet  another  edge  in  common  with  Tk- 
Continuing  in  this  way,  we  eventually  arrive  at  a  minimum  spanning  tree  with 
edges  e\ , . . .  ,  e„-i  in  common  with  Tk  (and  so  equal  to  Tk),  proving  that  Tk  is 
indeed  a  minimum  spanning  tree. 

What  is  the  complexity  of  Kruskal’s  algorithm,  in  terms  of  comparisons?  To 
estimate  the  number  of  comparisons,  we  must  first  decide  how  we  are  going  to 
determine,  at  a  particular  stage,  whether  or  not  a  given  edge  forms  a  circuit  with 
some  of  the  edges  already  selected. 
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At  each  stage,  Kruskal’s  algorithm  has  selected  certain  edges.  Think  of  the 
subgraph  determined  by  these  edges  and  all  n  vertices  of  Q.  This  subgraph  has 
a  certain  number  of  connected  components,  each  of  which  is  a  tree.  The  first 
such  subgraph  has  n  components;  the  final  minimum  spanning  tree  has  just  one 
component.  To  begin,  assign  the  labels  1,2,...  ,  n  to  the  vertices  of  Q.  If  the 
first  edge  selected  is  uv,  with  u  labeled  i,  v  labeled  j,  and  i  <  j,  change  the 
label  of  u  to  i.  The  subgraph  consisting  of  this  edge  and  all  n  vertices  has  n  —  1 
components  and,  as  at  the  outset,  all  the  vertices  in  each  component  have  the  same 
label.  If  we  could  preserve  this  feature  of  our  labeling,  then  the  edges  eligible 
for  selection  at  a  certain  stage  (that  is,  those  edges  not  forming  a  circuit  with 
edges  already  chosen)  would  be  simply  those  whose  end  vertices  have  different 
labels. 

Suppose,  at  a  certain  stage,  vertices  have  the  same  label  if  and  only  if  they  lie 
in  the  same  component  of  the  subgraph  whose  vertices  are  all  those  of  Q  and  whose 
edges  are  those  the  algorithm  has  chosen  so  far,  and  that  each  such  component  is 
a  tree.  We  seek  an  edge  which  does  not  complete  a  circuit  with  previously  chosen 
edges,  hence  an  edge  with  end  vertices  lying  in  different  components.  Select  an 
edge  e  =  xy  whose  end  vertices  x  and  y  have  different  labels  and  which  has 
least  weight  among  such  edges.  If  x  is  labeled  k  and  y  is  labeled  £  and  k  <  £, 
then  x  belongs  to  a  component  all  of  whose  vertices  are  labeled  k  and  y  belongs 
to  a  component  all  of  whose  vertices  are  labeled  l.  By  changing  to  k  the  labels 
on  all  the  vertices  labeled  £,  the  next  subgraph  once  again  has  the  property  that 
the  vertices  have  the  same  label  if  and  only  if  they  lie  in  the  same  connected 
component.  Moreover,  each  new  component  is  again  a  tree  since  a  circuit  cannot 
be  produced  by  joining  two  vertices  in  different  components. 

Describe  the  labels  on  the  final  minimum  spanning  tree. 

This  is  our  suggested  procedure  for  ensuring  no  circuits.  How  many  compar¬ 
isons  does  it  require?  First  sort  the  edges  f\,  fi,  ■  ■  ■  ,  //v  of  Q  so  that  w(f\)  < 
w(f2)  <  •  •  •  <  w(fN).  Using  an  efficient  algorithm  such  as  the  merge  sort  dis¬ 
cussed  in  Section  8.3,  this  can  be  accomplished  with  0(N  log  N )  comparisons. 
The  algorithm  selects  e\  =  f\ ,  compares  the  labels  of  the  ends  of  e\ ,  and  changes 
one  of  these  labels.  Then  e2  =  fi  and,  after  one  more  comparison,  changes  some 
labels.  The  algorithm  then  looks  at  /)  and  compares  the  labels  of  its  end  vertices 
to  see  if  it  is  eligible  for  selection.  It  continues  successively  to  examine  f$,  f 5, .. . 
until  n  —  1  edges  have  been  selected.  Each  time  an  edge  is  examined,  one  com¬ 
parison  of  end  labels  is  required.  In  all,  this  part  of  the  algorithm  could  require  as 
many  as  N  comparisons.  Also,  for  each  of  the  n  —  1  edges  which  are  eventually 
selected,  it  is  necessary  to  check  the  labels  on  n  —  2  vertices  to  see  which  must  be 
changed.  This  contributes  a  further  ( n  —  1  ){n  —  2)  comparisons.  By  the  approach 
outlined  here,  Kruskal’s  algorithm  is  0(N  log  N  +  N  +  nr)  =  0(N  log  N  +  «2). 
In  fact,  the  rule  by  which  the  labels  of  vertices  are  changed  can  be  modified  so 
that  Kruskal's  algorithm  can  be  completed  with  just  0(N  log  N)  comparisons. 
(See  Exercise  12.) 

We  turn  now  our  attention  to  another  algorithm,  a  popular  one,  for  finding  a 
minimum  spanning  tree.  Prim’s  algorithm  is  like  Kruskal’s  insofar  as  it  selects  an 
edge  of  least  weight  at  each  stage.  This  time,  however,  instead  of  merely  guarding 
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against  circuits,  the  algorithm  requires  that  the  subgraph  determined  by  selected 
edges  is  always  a  tree. 


Figure  12.17 


An  application  of  Prim’s  algorithm  might  begin  with  vertex  K  and  then  select,  in  this  order, 
edges  KL,  LG,  CM,  MF,  FE,  DE,  KC,  BC,  JL,  AJ ,  JH,  AI. 


EXAMPLE  3 


12.4.2  PRIM'S 
ALGORITHM 


We  illustrate  Prim’s  approach  as  it  would  apply  to  the  graph  of  Fig  12.16,  which 
we  have  shown  again  in  Fig  12.17.  Start  at  any  vertex,  say  K,  and  select  the  edge 
incident  with  K  of  least  weight.  Thus,  we  select  edge  A'L  of  weight  3.  The  next 
edge  chosen  is  the  edge  of  least  weight  adjacent  to  KL.  Any  of  LG,  KM,  or 
KC  is  eligible.  Suppose  we  select  LG.  Then  we  select  the  edge  of  least  weight 
among  those  edges  forming  a  tree  with  the  two  edges  already  selected  ( GM ,  for 
instance).  We  continue  to  select,  in  succession,  the  edge  of  least  weight  among 
those  edges,  as  yet  unselected,  which  form  a  tree  with  those  already  chosen  and 
continue  until  we  have  a  spanning  tree.  You  should  verify  that  the  spanning  tree 
shown  in  Fig  12.17  is  one  which  Prim’s  algorithm  might  identify.  ▲ 


To  find  a  minimum  spanning  tree  in  a  connected  weighted  graph  with  n  >  1 
vertices,  proceed  as  follows. 


Step  1.  Choose  any  vertex  v  and  let  e\  be  an  edge  of  least  weight  incident  with 
v.  Set  it  =  1. 


Step  2.  While  k  <  n 

if  there  exists  a  vertex  which  is  not  in  the  subgraph  T  whose  edges  are 
ci ,  <?2, . . .  ,ek, 

•  let  ek+\  be  an  edge  of  least  weight  among  all  edges  of  the  form 
ux,  where  a  is  a  vertex  of  T  and  x  is  a  vertex  not  in  T ; 

•  replace  k  by  k  +  1 ; 

else  output  e\,e2,  ■  •  ■  ,e^  and  stop, 
end  while 


Perhaps  the  best  way  to  see  that  Prim’s  algorithm  succeeds  in  finding  a  min¬ 
imum  spanning  tree  in  a  connected  graph  Q  is  to  prove  that  at  each  stage  the 
algorithm  has  found  a  tree  which  is  a  subgraph  of  some  minimum  spanning 
tree.  This  is  obvious  at  the  beginning  since  any  vertex  is  a  tree  and  a  sub¬ 
graph  of  any  (minimum)  spanning  tree  of  Q.  Suppose  that  after  k  steps,  the 
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algorithm  has  produced  a  tree  T  that  is  a  subgraph  of  a  minimum  spanning 
tree  Tm  for  the  graph.  At  the  next  stage,  the  algorithm  produces  another  sub¬ 
graph  T\  by  adding  a  vertex  x  and  an  edge  ux  to  T,  where  the  weight  of 
ux  is  least  among  all  edges  joining  vertices  x  £  T  to  vertices  u  e  T .  Since 
T  contains  no  circuits  and  the  additional  vertex  x  is  incident  with  only  one 
edge  in  Tj,  the  subgraph  T\  also  contains  no  circuits.  It  is  also  connected  and 
hence  a  tree.  Why  is  it  contained  in  a  minimum  spanning  tree  for  Ql  This  is 
surely  the  case  if  it  is  contained  in  Tm.  If  it  is  not  contained  in  Tm,  then  the 
subgraph  Tm  with  edge  ux  adjoined  contains  a  circuit,  uxx\ X2  ■  ■  ■  xru.  Such  a 
circuit  must  contain  an  edge  of  the  form  yw,  with  y  £  T  and  w  e  T.6  It 
follows  that  w(ux)  <  w(yw),  since  yw  was  also  eligible  for  selection  by  the 
algorithm  when  ux  was  chosen.  Thus,  the  spanning  tree  (Tm  U  {ux))  \  {yut}, 
having  weight  not  exceeding  that  of  Tm,  is  also  a  minimum  spanning  tree,  and 
it  contains  T\.  By  the  Principle  of  Mathematical  Induction,  it  follows  that  each 
of  the  subgraphs  produced  by  Prim’s  algorithm  is  a  tree  contained  in  a  mini¬ 
mum  spanning  tree.  In  particular,  the  last  tree  selected  is  itself  a  minimum  span¬ 
ning  tree. 

We  now  show  how  to  implement  Prim’s  algorithm  with  complexity  0(n^) 
and  leave  to  the  exercises  an  implementation  which  is  0(n2).  (See  Exercise  13.) 
Recalling  that  Kruskal’s  algorithm  is  0(N  log  N  +  n2),  where  N  is  the  number  of 
edges,  we  see  that  the  choice  between  Kruskal’s  and  Prim’s  algorithms  depends 
primarily  on  the  relative  sizes  of  N  and  n. 

Suppose  Q  is  a  connected  graph  with  n  vertices.  In  Prim's  algorithm,  we 
select  a  vertex  v  and  find  the  minimum  weight  of  the  potentially  n  —  1  edges 
incident  with  v.  For  this,  at  most  n  —  2  comparisons  are  required.  Suppose  vu 
is  the  edge  selected.  Next,  we  find  the  edge  of  least  weight  among  those  which 
are  incident  with  u  or  with  v  (excluding  the  edge  vu).  In  the  worst  case,  in 
which  every  vertex  is  adjacent  to  each  of  these  vertices,  we  would  have  to  find 
the  minimum  of  2 (n  —  2)  numbers,  a  process  requiring  2 (n  —  2)  —  1  compar¬ 
isons.  After  k  edges  (and  k  +  1  vertices)  have  been  selected,  we  seek  the  edge 
of  least  weight  among  those  incident  with  one  of  the  vertices  already  selected. 
In  the  worst  case,  the  remaining  n  —  (k  +  1)  vertices  are  adjacent  to  each  of 
the  first  k  +  1  and  we  must  find  the  minimum  of  (k  +  1)(«  —  k  —  1)  num¬ 
bers.  This  would  require  (k  +  l)(n  —  k  —  1)  —  1  comparisons.  In  all  then,  we 
require 


n—2 


n—2 


y~^[(k  +  l)(n  —  k  —  1)  —  1]  =  ^(nk  +  n  —  —  2k  —  2) 


k= 0 

comparisons.  Recalling  that 
•y-'  j.  _  n(n  +  1) 
k= 0  2 


k= 0 


and  ^  k 2  = 


k=  0 


n(n  +  1 )  (2/i  +  1) 

6  ' 


6The  edge  yw  is  xx\  it'  .vj  e  T:  otherwise,  x |  X2  if  X2  €  T;  and  so  on.  This  process  eventually  leads 
to  edge  yw,  as  described,  because  the  last  vertex  u  is  in  T. 
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we  find  that  the  number  of  comparisons  is 

(n  —  2){n  —  1)  (n  —  2){n  —  1)(2«  —  3) 

n - - - 1 -n(n  -  1) - - - 


-2 


(n 


2 )(n  -  l) 
2 


—  2  (n 


1) 


7 

—  -n  +  1. 
6 


The  procedure  is  0(n3). 

A  Bound  for  a  Minimum  Hamiltonian  Cycle 

There  is  an  interesting  connection  between  the  Traveling  Salesman’s  Problem 
(Section  10.4)  and  minimum  spanning  trees  in  a  weighted  graph.  Recall  that  the 
Traveling  Salesman’s  Problem  is  to  find  a  Hamiltonian  cycle  of  minimum  weight 
in  a  weighted  Hamiltonian  graph.  Our  ability  to  find  minimum  spanning  trees  gives 
us  a  way  to  determine  lower  bounds  for  the  weight  of  a  minimum  Hamiltonian 
cycle. 

Suppose  Q  is  a  weighted  Hamiltonian  graph  with  Hamiltonian  cycle  H. 
Removing  any  vertex  v  of  Q  (and  the  edges  with  which  it  is  incident)  gives 
us  a  subgraph  Q'  of  Q  and  a  subgraph  H'  of  7i  which  is  a  spanning  tree  for  in 
fact,  H'  is  a  path  through  all  vertices  of  Q  except  v.  It  follows  that  the  weight  of 
H  is  the  weight  of  TC  plus  the  sum  of  the  weights  of  the  two  edges  in  H  incident 
with  v.  Hence, 


wCH)  >  u>(minimum  spanning  tree  for  Q') 

+  smallest  sum  of  weights  of  two  edges  incident  with  v. 


By  choosing  several  different  vertices  v,  we  can  obtain  various  lower  bounds  for 
the  weight  of  the  Hamiltonian  cycle  H  and  so,  in  particular,  for  the  weight  of  a 
minimum  Hamiltonian  cycle. 

Considering  again  the  graph  in  Fig  12.16  (which  is  Hamiltonian),  if  we 
remove  vertex  H  (and  the  three  edges  with  which  it  is  incident),  it  is  easily 
checked  that  all  the  edges  of  the  spanning  tree  indicated  in  Fig  12.16,  except  for 
H  J,  comprise  a  minimum  spanning  tree  for  the  subgraph,  of  weight  39  —  3  =  36. 
The  smallest  two  weights  of  edges  incident  with  H  are  3  and  5.  Therefore,  we 
obtain  36  +  3  +  5  =  44  as  a  lower  bound  for  the  weight  of  any  Hamiltonian  cycle. 

Suppose  we  remove  K  instead  of  H.  This  time,  the  heavy  edges  in  Fig  12.18 
form  a  minimum  spanning  tree  in  the  subgraph,  of  weight  38.  The  two  smallest 
weights  of  edges  incident  with  K  are  3  and  4,  so  we  obtain  38  +  3  +  4  =  45 
as  a  lower  bound  for  the  weight  of  any  Hamiltonian  cycle,  a  better  estimate  than 
before. 


Find  a  Hamiltonian  cycle  in  the  graph  of  Fig  12.16.  What  is  its  weight? 


Answers  to  Pauses 


11.  Each  time  an  edge  is  added,  we  changed  the  labels  on  its  end  vertices  to  the 
lower  of  the  two  previous  labels  (and  all  the  vertices  in  a  certain  component 
to  the  lower  as  well).  All  vertices  are  eventually  labeled  1. 
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Figure  12.18  A  graph  Q  and  a  spanning  tree  for  Q  \  {A-)  with  weight  38. 

12.  / H  J ABCDEFMKLGI  is  a  Hamiltonian  cycle  of  weight  75.  Note  how  the 
path  J  A  ■■■  GI  which  results  when  H  is  removed  is  a  spanning  tree  for  the 
subgraph  without  H,  and  how  the  path  LG  ■  ■  ■  FM  which  results  when  K  is 
removed  is  a  spanning  tree  for  the  subgraph  without  K. 


EXERCISES 


The  symbol  [BB]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 


1.  Use  Kruskal’s  algorithm  to  find  a  spanning  tree  of  min¬ 
imum  total  weight  in  each  of  the  graphs  in  Fig  12.19. 
Give  the  weight  of  your  minimum  tree  and  show  your 
steps. 

2.  [BB;  (a)]  For  each  of  the  graphs  of  Exercise  1,  apply 
Prim's  algorithm  to  find  a  minimum  spanning  tree.  Try 
to  find  a  tree  different  from  the  one  found  before.  Start 
at  vertex  E  in  each  graph,  explain  your  reasoning,  and 
draw  the  tree. 

3.  [BB]  Find  a  minimum  spanning  tree  for  the  graph  in 
Fig  12.13.  What  is  the  smallest  length  of  pavement  re¬ 
quired  to  connect  the  towns  in  this  graph? 

4.  (a)  Explain  how  Kruskal’s  algorithm  could  be  modified 

so  that  it  finds  a  spanning  tree  of  maximum  weight 
in  a  connected  graph. 

(b)  Apply  the  modified  algorithm  described  in  (a)  to  find 
a  maximum  spanning  tree  for  each  of  the  following 
graphs  and  give  the  maximum  weight  of  each. 

i.  [BB]  the  graph  in  Exercise  1(a) 

ii.  the  graph  in  Exercise  1  (b) 

iii.  the  graph  in  Exercise  1(c) 

iv.  the  graph  in  Exercise  1(d) 

v.  the  graph  in  Fig  12.16 


5.  Repeat  Exercise  4  for  Prim’s  algorithm.  For  each  part 
of  (b),  start  at  vertex  A. 

6.  [BB]  Explain  how  a  minimum  spanning  tree  algorithm 
can  be  used  to  find  a  spanning  tree  in  a  connected  graph 
where  the  edges  are  not  weighted. 

7.  How  could  a  minimum  spanning  tree  algorithm  be  used 
to  find  a  spanning  tree  in  an  unweighted  graph  which 
excludes  a  given  edge,  assuming  such  spanning  trees 
exist? 

8.  (a)  [BB]  Suppose  we  have  a  connected  graph  Q  and 

we  want  to  find  a  spanning  tree  for  Q  which  con¬ 
tains  a  given  edge  e.  How  could  Kruskal’s  algorithm 
be  used  to  do  this?  Discuss  both  the  weighted  and 
unweighted  cases. 

(b)  Use  Kruskal's  algorithm  to  show  that  if  Q  is  a  con¬ 
nected  graph,  then  any  (not  necessarily  connected) 
subgraph  which  contains  no  circuits  is  part  of  some 
spanning  tree  for  Q.  Consider  both  the  weighted  and 
unweighted  cases. 

9.  Answer  Exercise  8  using  Prim’s  algorithm  instead  of 
Kruskal’s. 

10.  [BB]  Prove  that  at  each  vertex  v  of  a  weighted  connected 
graph,  Kruskal’s  algorithm  always  includes  an  edge  of 
lowest  weight  incident  with  v. 
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(a)  [BB]  (b) 

4 


Figure  12.19  Graphs  for  Exercise  1 . 


A 


11.  (a)  Prove  that  any  minimum  weight  spanning  tree  in 

a  connected  weighted  graph  can  be  obtained  by 
Kruskal’s  algorithm.  [Hint'.  Look  at  the  argument 
on  p.  384  showing  that  Kruskal’s  algorithm  selects 
a  minimum  spanning  tree.] 

(b)  [BB]  If  all  the  weights  in  a  connected  weighted 
graph  are  distinct,  show  that  the  graph  has  a  unique 
spanning  tree  of  minimum  weight. 

12.  In  our  discussion  of  the  complexity  of  Kruskal's  algo¬ 
rithm,  we  suggested  a  certain  system  of  labeling  vertices 
(p.  385).  Consider  the  following  modification. 

Suppose  that  vertex  x  is  labeled  k,  vertex  y  is  labeled 
£,  and  that  we  have  just  selected  edge  xy.  Compare 
the  relative  sizes  of  the  components  to  which  x  and 
y  belong  and  change  the  labels  of  all  vertices  in  the 
smaller  of  these  components  to  the  label  of  the  larger.  (If 
the  components  have  equal  size,  label  with  the  smaller 
number.) 

With  reference  to  Example  2,  p.  383,  we  begin  with  13 
components  each  containing  a  single  vertex.  After  the 
selection  of  A  J,  component  1  contains  two  vertices — 
A  and  J — and  component  10  is  empty;  the  remaining 
components  are  unchanged.  Figure  12.20  shows  how  the 
components  change  as  the  edges  GM,  DE,  MF,  and 
FE  are  successively  selected. 

(a)  Continue  the  scheme  shown  in  this  figure  to  show 
the  relabeling  of  vertices  until  the  spanning  tree  of 
Fig  12.16  is  obtained. 


(b)  [BB]  Show  that  in  the  general  case  the  number  of 
times  the  label  on  any  vertex  is  changed  is  at  most 
log2  n,  n  the  number  of  vertices  of  Q. 

(c)  [BB]  Show  that  Kruskal’s  algorithm  can  be  imple¬ 
mented  with  a  number  of  comparisons  which  is 
0(N  log  AO,  N  the  number  of  edges  of  Q. 

13.  In  the  implementation  of  Prim’s  algorithm  discussed  on 
p.  387  it  is  not  really  necessary  at  each  stage  to  find  the 
minimum  of  so  many  numbers.  Consider  the  following 
alternative.  Select  any  vertex  iq  and  label  every  other 
vertex  v  with  the  weight  of  edge  vv\  if  vv\  is  an  edge 
and  oo  otherwise.  Then  select  edge  v\V2  such  that  the 
label  on  V2  is  a  minimum.  For  each  vertex  v  adjacent  to 
V2,  change  the  label  of  v  to  the  minimum  of  its  old  label 
and  the  weight  of  edge  W2 .  Select  1)3  with  minimum 
label  and  add  the  corresponding  edge  (vyvi  or  1131)2). 
The  first  graph  shows  the  labels  on  the  vertices  of  the 
graph  of  Fig  12.16  just  after  v\  =  K  has  been  selected; 
the  second  graph  depicts  the  situation  just  after  V2  has 
been  selected. 
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Figure  12.20  Components  not  listed  are  empty. 


(a)  Show  the  labels  just  after  V3  =  G  has  been  selected. 

(b)  Explain  carefully  how  the  process  started  here 
should  be  continued  to  produce  the  next  edge.  Illus¬ 
trate  by  showing  the  labels  on  the  graph  just  after 
V4  has  been  chosen. 

(c)  Show  that  the  implementation  of  Prim’s  algorithm 
described  here  is  0(n2)  (in  terms  of  comparisons). 


14.  By  considering  the  subgraphs  determined  by  deleting 
each  of  the  vertices,  find  the  best  (greatest)  lower  bound 
for  the  weight  of  a  minimum  Hamiltonian  cycle.  Find  a 
minimum  Hamiltonian  cycle. 


15.  This  exercise  concerns  the  graph  in  Exercise  1(a). 

(a)  [BB]  Try  to  find  a  Hamiltonian  cycle  of  lowest 
weight.  What  is  the  weight? 
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(b)  [BB]  By  removing  vertex  A,  use  the  method  dis¬ 
cussed  in  this  section  to  find  a  lower  bound  for  the 
weight  of  any  Hamiltonian  cycle. 

(c)  [BB]  Repeat  part  (b),  removing  vertex  B  instead 
of  A. 

(d)  Try  to  improve  (increase)  the  lower  bounds  found 
in  parts  (b)  and  (c)  by  removing  some  other  vertex. 


16.  Repeat  Exercise  15  for  the  graph  in  Exercise  Kb).  Use 
vertices  D  and  F  in  parts  (b)  and  (c). 

17.  Repeat  Exercise  15  for  the  graph  in  Exercise  1(c).  Use 
vertices  B  and  T  in  parts  (b)  and  (c). 

18.  Repeat  Exercise  15  for  the  graph  in  Exercise  1(d).  Use 
vertices  E  and  G  in  parts  (b)  and  (c). 


12.5  Acyclic  Digraphs  and  Bellman's  Algorithm 

A  (directed)  graph  is  acyclic  if  it  contains  no  (directed)  cycles.  For  example,  the 
digraph  on  the  left  of  Fig  12.21  is  acyclic,  but  the  one  on  the  right  is  not — there 
is  a  cycle  around  the  outer  vertices.  Is  there  any  easy  way  to  see  that  there  are 
no  cycles  in  Q,  on  the  left?  This  is  where  the  vertex  labels  help.  The  digraph  has 
been  labeled  so  that  VjVj  is  an  arc  only  when  i  <  j .  For  example,  there  are  no 
arcs  of  the  form  xvo,  the  only  arc  jcvi  is  i!q v \ ,  the  only  arcs  XV2  are  with  x  =  vq 
and  x  =  v\,  and  so  on.  So  it  is  clear  why  there  is  no  cycle:  In  any  path,  the 
subscripts  on  the  vertices  v,-  must  increase,  so  there  is  never  an  arc  on  which  to 
return  to  the  first  vertex.  A  labeling  of  the  vertices  such  as  that  shown  for  Q  has 
a  special  name. 


v2  V\ 


Figure  12.21 


The  digraph  on  the  left  is  acyclic;  the  digraph  on  the  right  is  not. 


12.5.1  DEFINITION 


A  labeling  vo,  v\,  . . .  ,  vn~\  of  the  vertices  of  a  digraph  is  called  canonical  if 
every  arc  has  the  form  vivj  with  i  <  j.  We  also  refer  to  the  list  i>o,  iq, . . .  ,  vn-\ 
as  a  canonical  ordering  of  the  vertices. 


Here’s  why  a  digraph  with  a  canonical  labeling  of  vertices  must  be  acyclic:  If 
v, ,  Vj2  ■  ■  ■  Vjk  is  a  walk,  then  i \  <  /T  <  •  •  •  <  4>  s°  such  a  walk  cannot  be  closed.  It 
is  interesting  that  the  converse  of  this  statement  is  also  true.  Any  acyclic  digraph 
has  a  canonical  labeling  of  vertices.  To  prove  this,  we  require  a  preliminary 
result. 


12.5.2  LEMMA 


Let  S  be  a  nonempty  set  of  vertices  of  an  acyclic  digraph  Q.  Then  there  exists 
v  G  S  such  that  Sv  —  {.r  €  S  \  xv  is  an  arc]  =  0. 


Proof  We  establish  this  result  by  contradiction.  Assume  the  result  is  false.  Thus,  for 
each  v  e  S,  there  is  an  arc  of  the  form  xv  with  x  e  S.  Let  i>o  be  any  vertex  in  S 
and  let  v\Vq  be  an  arc,  iq  e  S.  There  exists  an  arc  V2V\,  V2  e  S,  and,  in  general, 
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having  found  vertices  v\,V2, ...  ,  v*  in  S  with  u,+ iiy  an  arc,  i  =  1,2,...  ,  k  —  1, 
there  exists  an  arc  Vk+\Vk  with  Vk+\  €  <S.  Since  £/  has  only  a  finite  number  of 
vertices,  eventually,  v^+i  e  {tq,  v%, ...  ,  iq},  say  Vk+\  =  v,- ,  1  <  i  <  k.  If  k  is 
minimal  with  this  occurring,  t^+iiq-tq-i  ■  ■  ■  v,  is  a  cycle,  contradicting  the  fact 
that  Q  is  acyclic.  I 

This  is  key  to  the  proof  of  a  theorem  which  characterizes  acyclic  digraphs. 


12.5.3  THEOREM 


A  digraph  is  acyclic  if  and  only  if  it  has  a  canonical  labeling  of  vertices. 


Proof  We  gave  the  proof  of  sufficiency  prior  to  the  statement  of  Lemma  12.5.2.  For 
necessity,  suppose  Q  is  an  acyclic  digraph  with  n  >  1  vertices  and  vertex  set 
V.  Applying  the  lemma  to  S  =  V,  we  obtain  a  vertex  vq  with  the  property  that 
there  are  no  arcs  of  the  form  Jtwo-  Applying  Lemma  12.5.2  to  S  =  V  \  {i>o},  we 
obtain  a  vertex  v\  with  the  property  that  there  are  no  arcs  of  the  form  xv\  with 
x  e  S;  that  is,  the  only  possible  arc  xv\  is  with  x  =  vq-  Now  apply  the  lemma  to 
S  =  V  \  { I'o ,  v  i ) .  We  obtain  a  vertex  V2  with  the  property  that  the  only  possible 
arcs  of  the  form  XV2  are  vqV2  and  v\V2-  Continuing  in  this  manner,  we  obtain 
vertices  no,  v\, . . .  ,  vn-  >  with  the  property  that  for  any  j,  the  only  arcs  xvj  are 
with  x  —  Vi  and  i  <  j.  Thus  vq,  vi,  . . .  ,  vn-\  is  a  canonical  labeling.  I 

We  want  to  discuss  a  second  algorithm  to  which  the  name  of  Richard  Bellman 
is  attached. 

^ |  Name  the  first  algorithm  described  in  this  book  to  which  Bellman’s  name  is 
attached. 


The  algorithm  we  have  in  mind  takes  as  input  an  acyclic  digraph  whose 
vertices  have  been  labeled  canonically,  together  with  a  specified  vertex  v,  and 
outputs  the  lengths  of  shortest  paths  from  v  to  every  other  vertex.  The  algorithm 
also  describes  the  shortest  path  from  v  to  any  other  vertex  by  means  of  a  rooted 
spanning  tree.  Remember  that  a  root  in  a  tree  is  simply  a  designated  vertex.  In  a 
digraph,  the  concept  is  stronger.  There  is  only  one  possible  root. 


12.5.4  DEFINITION 


A  digraph  T  is  a  rooted  tree  with  vertex  v  as  the  root  if  the  unoriented  graph 
(ignoring  orientations  of  arcs)  is  a  tree,  if  v  has  indegree  0,  and  v  is  the  only  such 
vertex  of  T. 


For  example,  if  the  edges  of  the  tree  shown  in 
Fig  12.2  are  directed  down,  then  we  have  a  rooted 
tree  with  “start”  as  root.  On  the  other  hand,  the 
digraph  to  the  right  is  not  a  rooted  tree  because 
there  is  more  than  one  vertex  of  indegree  0. 

In  Proposition  12.1.2,  we  showed  that  any  two  vertices  of  a  tree  are  joined 
by  a  unique  path.  This  is  clearly  not  the  case  if  the  edges  of  the  tree  are  oriented 
(and  we  require  paths  to  respect  orientation).  There  is.  however,  a  unique  path 
from  the  root  to  every  other  vertex. 
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12.5.5  PROPOSITION 


Proof 


12.5.6  COROLLARY 


Proof 


12.5.7  BELLMAN'S 
ALGORITHM 


In  a  rooted  (directed)  tree  with  root  v,  there  is  a  unique  path  from  v  to  every  other 
vertex. 

Uniqueness  is  straightforward:  If  there  were  two  different  directed  paths  V\,  V2 
from  v  to  some  vertex  u,  then,  ignoring  orientations,  we  would  have  two  different 
paths  from  n  to  u,  contradicting  the  fact  that  the  undirected  graph  T  is  a  tree.  We 
now  show  existence  of  a  directed  path  from  v  to  any  other  vertex.  So,  let  u  be 
a  vertex  of  T  different  from  v.  Since  indeg u  ^  0,  there  is  a  directed  edge  u\u. 
If  ni  =  v,  we  have  a  path  from  v  to  «;  otherwise,  since  indeg u\  ^  0,  we  have 
a  directed  edge  K2«i-  Note  that  1J2  7^  u  since  the  arc  between  u  and  u\  is  u\u. 
Thus  U2U[U  is  a  path  and  if  112  =  v  we  are  done.  In  general,  suppose  k  >  2  and 
we  have  found  a  path  UkUk- 1  ■  ■  •  u\u  with  no  n,  =  v.  Since  indeg w*  7^  0,  there 
is  a  directed  arc  Uk+\Uk-  Also,  Uk+ 1  £  {u,  u\,  112, . . .  ,  Uk )  because  the  undirected 
graph  T  has  no  cycles.  Thus  Uk+\Uk  •  •  •  u\u  is  a  path  and,  if  Uk+\  =  v ,  we  have 
a  directed  path  from  v  to  u.  Since  our  tree  is  finite,  this  process  cannot  continue 
indefinitely,  so  eventually  some  Uk  =  v.  I 


In  a  rooted  (directed)  tree,  every  vertex  other  than  the  root  has  indegree  1 . 

Let  T  be  a  rooted  tree  with  root  v  and  suppose  u  ^  v  is  a  vertex  with  indeg  «  /  1 . 
Since  indeg u  ^  0,  we  have  indeg u  >  2,  so  there  are  distinct  arcs  u\it  and  u\u. 
The  proposition  says  there  is  a  path  V  from  v  to  u\  and  also  a  path  V  from  v  to 
u\.  Thus  we  obtain  two  different  paths  from  v  to  u  (with  final  arcs  u\u  and  u\u). 
This  contradicts  uniqueness  of  a  directed  path  from  v  to  u.  I 


Given  a  weighted  acyclic  digraph  Q  with  nonnegative  weights  and  a  canonical 
labeling  vo,  vi, .. .  ,  v„  1,  to  find  the  shortest  paths  from  i>o  to  every  other  vertex, 

Step  1.  Set  do  =  0  and  d,  =  00  for  /  =  1,  2, . . .  ,  n  —  1 . 

Set  pi  =  —  1  for  i  =  0,  1 , . . .  ,  n  —  1 . 

Step  2.  For  t  from  1  to  n  —  1, 

let  dt  =  min {dj  +  w(Vj,vt)  \  j  =  0, . . .  ,  t  —  1 }  and  let  p,  be  a  j  which  gives 
this  minimum. 

As  with  previous  shortest  path  algorithms,  a  final  value  dt  =  00  simply 
indicates  that  there  is  no  directed  path  from  no  to  v, . 

Consider  the  acyclic  canonically  labeled  graph  Q  with  arcs  weighted  as  shown 
on  the  left  in  Fig  12.22.  We  show  the  values  of  the  d \  and  p,  (p  for  predecessor) 
at  the  beginning,  and  after  each  iteration  of  the  loop  in  Step  2.  The  final  values 
of  d\  and  p\  are  2  and  0,  respectively.  Thus  the  shortest  distance  to  iq  has  length 
2,  and  the  last  arc  on  a  shortest  path  is  vqv\.  The  final  values  of  do  and  p2  are 
4  and  1,  respectively,  showing  that  the  shortest  path  to  vo  has  length  4,  and  the 
last  arc  on  the  corresponding  path  is  v\V2-  The  final  values  of  do,  and  773  are  2 
and  0,  respectively,  asserting  the  facts  that  a  shortest  path  to  U3  has  length  2  and 
V0V3  is  the  last  arc  on  such  a  path.  On  the  right  in  Fig  12.22,  we  show  just  those 
arcs  which  are  used  last  on  shortest  paths  from  vq  to  each  vertex.  It  is  not  a 
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Figure  12.22 
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Answers  to  Pauses 


coincidence  that  these  form  a  spanning  tree  rooted  at  t>o-  In  the  Exercises,  we  ask 
you  to  explain  why. 

13.  In  Section  11.2,  we  met  the  Bellman-Ford  shortest  path  algorithm. 


EXERCISES 


The  symbol  [BB]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 


1.  For  each  of  the  digraphs  in  Fig  12.23,  either  show  that 
the  digraph  is  acyclic  by  finding  a  canonical  labeling  of 
vertices  or  exhibit  a  cycle. 

2.  Suppose  vo.  v\ . v„-i  is  a  canonical  labeling  of  a 

digraph  Q.  What  can  be  said  about  the  indegree  and 
outdegree  of  uq,  if  anything?  Explain. 

3.  [BB]  Find  an  0(n1 2 3 4 5)  algorithm  which  computes  the  inde¬ 
grees  of  a  digraph  with  n  vertices.  (The  addition  of  two 
numbers  is  the  basic  operation.) 

4.  (a)  Describe  an  algorithm  which  finds  a  canonical 

labeling  for  the  vertices  of  an  acyclic  digraph. 

(b)  Find  a  Big  Oh  estimate  for  your  algorithm  in  terms 
of  some  reasonable  basic  operation  (or  operations). 

(c)  Write  a  computer  program  which  implements  your 
algorithm. 

5.  The  algorithm  described  in  the  proof  of  Theorem  12.5.3 
can  in  fact  be  adapted  to  find  a  cycle  in  a  digraph  which 
is  not  acyclic. 


(a)  [BB ]  Explain  how  this  can  be  accomplished. 

(b)  Describe  an  algorithm  which  implements  your  idea. 

(c)  Write  computer  code  which  implements  your  algo¬ 
rithm. 

6.  How  many  shortest  path  algorithms  can  you  name?  How 
many  of  these  can  you  describe? 

7.  [BB]  Suppose  T  is  a  rooted  (directed)  tree  with  n  ver¬ 
tices.  How  many  arcs  does  T  have?  Why? 

8.  Let  Q  be  a  connected  graph  and  suppose  we  orient  the 
edges  in  such  a  way  that  the  digraph  we  obtain  has 
a  unique  vertex  of  indegree  0.  Must  this  digraph  be  a 
rooted  tree? 

9.  The  following  digraphs  are  acyclic,  and  canonical  label¬ 
ings  are  shown.  Apply  Bellman's  algorithm  to  each 
digraph  in  order  to  find  the  lengths  of  shortest  paths 
from  vq  to  each  other  vertex.  Find  a  shortest  path  to  v, 
and  the  predecessor  vertex  vPt . 
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Figure  12.23  Digraphs  for  Exercise  1. 


10.  Show  that  Bellman’s  algorithm  works;  that  is,  if  Q  is  an 

acyclic  digraph  with  vertices  vo,  vj . v„-\  canoni¬ 

cally  labeled,  then  the  final  value  of  dt  is  the  length  of 
a  shortest  path  from  vq  to  v,  and  there  is  a  path  of  this 
length  whose  final  arc  is  v{>lVi. 

11.  [BBJ  Suppose  we  apply  Bellman’s  algorithm  to  an  acyc¬ 
lic  digraph  with  a  canonical  ordering  of  vertices.  Explain 
why  the  arcs  vPl  v,  produce  a  spanning  tree  rooted 
at  v0. 

12.  Find  the  complexity  of  Bellman's  algorithm  (in  terms 
of  additions  and  comparisons). 

13.  [BB]  Explain  how  Bellman’s  algorithm  can  be  mod¬ 
ified  to  find  the  length  of  a  shortest  path  to  each  v, 
from  a  given  vertex  Vk  which  is  not  necessa¬ 
rily  vo- 

14.  [BB;  (a)]  For  each  of  the  digraphs  which  accompany 
Exercise  9,  find  the  shortest  distance  to  each  v,  and  the 
corresponding  value  of  p,. 

i.  when  paths  start  at  vi  and, 

ii.  when  paths  start  at  i>2. 
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REVIEW  EXERCISES  FOR  CHAPTER  12 


1.  Make  a  tree  which  displays  all  monotonically  increas¬ 
ing  subsequences  of  4.  7,  3,  9,  5,  1,8.  How  many  are 
there? 

2.  Let  Q  be  a  weighted  graph  with  vertices  W\,W2, ...  ,W(, 
such  that  WjWk  is  an  edge  if  and  only  if  j  +  k  is 
odd,  in  which  case  the  weight  of  the  edge  WjWt  is 
(max{y,  k})2. 

(a)  Identify  the  unweighted  graph  described. 

(b)  Using  a  tree,  solve  the  Traveling  Salesman’s  Prob¬ 
lem  for  Q. 

3.  (a)  Is  it  possible  for  a  tree  to  have  nine  vertices  and 

degree  sequence  4,  2,  2,  2,  2,  2,  2,  1,  1  ? 

(h)  Is  it  possible  for  a  tree  to  have  nine  vertices,  two  of 
which  have  degree  5? 

4.  Prove  that  the  following  conditions  are  equivalent  for  a 
tree  T  with  n  >2  vertices. 

i.  T  has  an  Eulerian  trail. 

ii.  T  has  exactly  two  vertices  of  degree  1 . 

iii.  T  has  exactly  n  —  2  vertices  of  degree  2. 

5.  (a)  Prove  that  up  to  isomorphism  there  is  only  one  unla- 

beled  tree  on  seven  vertices  with  a  vertex  of  degree 
5  and  another  of  degree  2.  Draw  a  picture  of  this 
tree. 

(b)  Label  the  tree  obtained  in  (a)  in  two  different  ways 
so  as  to  produce  nonisomorphic  labeled  trees. 

6.  How  many  spanning  trees  has  ACs?  Why? 

7.  Use  Kirchoff  s  Theorem  to  find  the  number  of  spanning 
trees.  Draw  pictures  of  them  all. 


8.  Is  it  possible  for  a  graph  to  have  exactly  two  spanning 
trees?  Explain. 

9.  In  Exercise  8  of  Section  12.3,  you  proved  that  the  num¬ 
ber  of  spanning  trees  of  the  complete  bipartite  graph 
K-2,n  is  n2"-1.  If  e  is  an  edge  of  /C2./2,  how  many  of 
these  spanning  trees  contain  e ?  Explain. 

10.  Use  Kruskal’s  algorithm  to  find  a  minimum  spanning 
tree  of  the  weighted  graph  shown.  What  is  the  weight 
of  a  minimum  spanning  tree? 


11.  Repeat  Exercise  10  using  Prim’s  algorithm. 

12.  Let  v\.  ih,  ,  v„  denote  the  vertices  of  the  complete 
graph  AC„,  n  >  3,  and  give  edge  v,Vj  weight  i  +  j.  Find 
a  minimum  spanning  tree  for  this  weighted  graph  and 
the  weight  of  such  a  tree. 

13.  Determine  whether  or  not  each  of  the  digraphs  shown 
is  acyclic  by  finding  a  canonical  labeling  of  vertices  or 
exhibiting  a  cycle. 


14.  Change  the  direction  of  the  arrows  on  J F  and  FG  in 
each  digraph  of  Exercise  1 3  and  answer  the  questions 
again. 

15.  Can  an  acyclic  digraph  have  two  different  canonical 
orderings  of  vertices?  Explain. 
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16.  Given  that  the  digraph  is  acyclic,  find  a  canonical  order¬ 
ing  vq,  Vi .  of  the  vertices.  Then  find  the  lengths  of 
shortest  paths  from  the  root  to  all  other  vertices,  spec¬ 
ifying  the  predecessor  vertex  (as  defined  in  Bellman’s 
algorithm)  in  each  case  when 

(a)  the  root  is  i>o, 

(b)  the  root  is  i>2. 


17.  Repeat  Exercise  16  using  the  digraph  shown  and  root 
vertices  vq  and  v\. 
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13.1  Depth-First  Search 


13.1.1  THE 
DEPTH-FIRST  SEARCH 
ALGORITHM 


In  this  short  chapter,  we  describe  a  procedure  called  depth-first  search ,  which  is 
simple,  efficient,  and  the  basis  for  a  number  of  important  computer  algorithms  in 
graphs.  It  quickly  tests  whether  or  not  a  graph  is  connected,  it  produces  a  spanning 
tree  in  the  connected  case,  and  it  is  the  key  to  solving  a  problem  about  turning 
graphs  into  digraphs. 

Let  Q  be  a  graph  with  n  vertices. 


Step  1.  Choose  any  vertex,  label  it  1 .  Set  k  =  1 . 


Step  2.  While  there  are  unlabeled  vertices 

if  there  exists  an  unlabeled  vertex  adjacent  to  k ,  assign  to  it  the 
smallest  unused  label  l  from  the  set  { 1 , 2, ....  «}  and  set  k  =  i. 
else  if  k  =  1,  stop; 

else  backtrack  to  the  vertex  l  from  which  k  was  labeled  and  set 
k  =  i. 
end  while 


mmZmM  Suggest  a  way  that  a  machine  could  keep  track  of  the  labels  1,2,...  ,n  that  have 
been  used  as  this  algorithm  proceeds. 

When  the  algorithm  terminates,  there  is  a  path  from  vertex  1  to  each  labeled 
vertex  which  uses  only  edges  required  by  the  algorithm  (Exercise  3).  Thus,  if  the 
algorithm  successfully  labels  all  n  vertices  of  the  graph  Q.  the  graph  must  have 
been  connected.  The  converse  is  also  true  (Exercise  4).  Thus,  depth-first  search  is 
an  algorithm  which  tests  connectedness  in  a  graph. 


EXAMPLE  1 


We  consider  how  the  depth-first  search  algorithm  might  proceed  with  the  graph 
in  Fig  13.1(a).  Starting  at  vertex  1,  one  possible  forward  procedure  in  the 
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Figure  13.1  A  depth-first  search  and  the  corresponding  spanning  tree. 

depth-first  search  yields  the  labels  shown  in  Fig  13.1(a).  All  the  vertices  adja¬ 
cent  to  vertex  4  have  been  labeled  and  k  =  4  ^  1 ,  so  the  algorithm  backtracks 
first  to  3,  the  vertex  from  which  4  acquired  its  label.  All  vertices  adjacent  to  3 
have  also  been  labeled,  so  it  backtracks  to  2.  Vertex  2  has  an  unlabeled  adjacent 
vertex.  The  smallest  unused  label  to  date  is  5,  so  the  algorithm  would  label  5 
some  vertex  adjacent  to  2,  say  the  one  shown.  Vertex  5  has  an  adjacent  unla¬ 
beled  vertex,  so  this  gets  labeled  6.  The  current  assignment  of  labels  is  depicted 
in  Fig  13.1(b).  The  algorithm  now  backtracks  until  it  finds  a  vertex  with  an 
adjacent  unlabeled  vertex.  It  finds  vertex  1  in  this  way  and,  since  this  vertex 
has  adjacent  unlabeled  vertices,  the  algorithm  could  conceivably  assign  labels 
as  far  as  11,  as  shown  in  Fig  13.1(c).  Since  all  vertices  adjacent  to  11  have 
been  labeled,  the  algorithm  backtracks  through  vertex  10  to  vertex  9  and  then 
assigns  labels  12  and  13.  From  vertex  13,  the  algorithm  backtracks  to  12,  then 
to  9  (from  which  12  received  its  label),  then  to  8,  to  7  and  to  1,  after  which 
vertices  14  and  15  are  labeled.  Finally,  the  algorithm  backtracks  to  14,  then  to 
1,  and  stops.  The  final  labels  are  shown  in  Fig  13.1(d).  Since  15  labels  were 
used  and  the  graph  had  15  vertices,  the  graph  was  connected,  a  fact  admittedly 
obvious  by  sight,  but  not  to  a  computer.  A 

In  addition  to  testing  for  connectedness,  depth-first  search  has  another  use.  If 
the  graph  is  connected,  those  edges  which  were  used  in  the  search  form  a  spanning 
tree  for  the  graph  (see  Exercise  5).  Applied  to  the  graph  in  the  preceding  example, 
the  algorithm  produces  the  spanning  tree  shown  in  Fig  13.1(e). 
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For  the  graph  given  in  Fig  13.1(a),  carry  out  a  depth-first  search  different  from 
the  one  in  the  text  and  draw  the  corresponding  spanning  tree.  Start  at  vertex  1  as 
before. 

Show  that  the  complexity  function  for  depth-first  search  applied  to  a  graph  with 
n  vertices  is  0(n2).  [Hint:  Think  of  edges  instead  of  vertices.] 

One  critical  advantage  that  a  person  has  over  a  computer  in  solving  elementary 
graph  problems  is  sight.  The  reader  with  little  experience  writing  computer  pro¬ 
grams  may  well  wonder  from  time  to  time  at  the  rather  detailed  way  we  describe 
some  “obvious”  procedures.  When  a  person,  for  instance,  applies  the  depth-first 
search  algorithm  to  a  simple  graph  which  he  or  she  can  see,  some  ways  of  labeling 
the  vertices  are  much  more  sensible  than  others.  Furthermore,  a  person  can  see 
when  all  the  vertices  are  labeled  and  hence  stop  without  backtracking  to  vertex  1, 
as  required  by  the  algorithm. 

This  section  concludes  by  considering  a  problem  which  some  readers  may 
have  seen  previously  in  a  different  context.  Our  solution  involves  moving  without 
thought  around  the  vertices  of  a  graph  that  is  never  fully  seen,  just  the  way  a 
computer  does.  On  the  other  hand,  when  we  stop,  we  can  be  confident  that  we 
have  indeed  explored  every  vertex. 


PROBLEM  2.  An  innkeeper  has  a  full  eight-pint  flagon  of  wine  and  empty 
flagons  that  hold  five  and  three  pints,  respectively.  The  three  flagons  are  unmarked 
and  no  other  measuring  devices  are  available.  Explain  how  the  innkeeper  can 
divide  the  wine  into  two  equal  amounts  in  the  fewest  number  of  steps. 

Solution.  We  imagine  the  graph  whose  vertices  and  edges  are  as  follows:  There 
is  a  vertex  labeled  (a.  b,  c)  if  and  only  if  it  is  possible  for  the  innkeeper  to  hold, 
with  certain  knowledge,  a  pints  of  wine  in  the  eight-pint  flagon,  b  pints  in  the  five- 
pint  flagon,  and  c  pints  in  the  three-pint  flagon.  Thus,  vertex  (8.  0,  0)  corresponds 
to  the  initial  state.  There  is  an  edge  joining  (a,  b,  c)  and  ( a ',  b' ,  c')  if  it  is  possible 
to  reach  the  state  (a1,  b' ,  c')  from  ( a ,  b,  c),  or  vice  versa,  by  pouring  all  the  wine 
from  one  of  the  three  flagons  into  another.  For  example,  from  the  initial  state, 
the  innkeeper  can  reach  (3,  5,  0)  by  emptying  the  eight-pint  flagon  into  the  five- 
pint  flagon;  thus,  there  is  an  edge  between  (8,  0,  0)  and  (3,  5,  0).  We  explore  the 
vertices  of  this  graph  with  a  procedure  much  like  depth-first  search.  Continue 
from  (3,  5,  0).  Emptying  the  remainder  of  the  eight-pint  flagon  into  the  three-pint 
flagon  takes  us  to  (0,  5,  3),  emptying  the  five-pint  flagon  into  the  eight-pint  flagon 
takes  us  to  (5,  0,  3),  then  emptying  the  three -pint  flagon  into  the  five-pint  flagon 
takes  us  to  (5,  3,0).  (See  Fig  13.2.) 

We  continue  to  visit,  in  order,  vertices  (2,  3,  3),  (2,  5,  1),  (7,  0,  1),  (7,  1, 0), 
(4,  1, 3),  and  finally  (4,  4,  0).  This  state  is  what  the  innkeeper  sought;  it  has  been 
reached  in  ten  steps. 

Now  we  backtrack,  first  to  (4,  1,  3).  Emptying  the  contents  of  the  eight-pint 
flagon  into  the  five-pint  flagon  takes  us  to  a  previously  labeled  vertex  on  the  path 
to  (4,  1, 3),  namely,  (0,  5,  3).  If  we  could  get  from  (0,  5,  3)  to  (4,  1,3)  in  fewer 
than  seven  steps,  then  the  path  to  (4, 4,  0)  could  be  shortened,  but  that  issue 
will  be  decided  when  we  have  backtracked  to  (0,  5,  3)  and  can  be  ignored  for 
now.  Note,  however,  that  we  must  be  careful  because  an  edge  in  our  graph  only 
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Figure  13.2 


Answers  to  Pauses 


(7.0,  1) 


(4,  4.  0)  © 
(1,4,  3)  o 
0.5,  2)» 
(6,  0,  2)  o 
(6,  2,0)  o 
(3,2,3) 


'  (2,  5,  1) 

<>(2,3,3) 
o(5,  3,0) 

<>(5,0,  3) 

<f>  (0,  5,  3) 

>  (3,  5,  0) 

A  (8,  0,  0) 

The  spanning  tree  for  a  problem  of  flagons  and  wine. 


signifies  that  you  can  go  from  one  of  the  end  vertices  to  the  other,  not  necessarily 
in  both  directions  (see  Exercise  9).  We  continue  to  investigate  vertices  adjacent  to 
(4,  1,3).  Emptying  the  one  pint  remaining  in  the  five-pint  flagon  into  the  eight- 
pint  flagon  leads  to  (5,  0,  3),  again  a  vertex  already  labeled.  There  are  no  other 
vertices  adjacent  to  (4,  1,3),  so  we  backtrack  to  (7,  1,0).  We  leave  it  to  you 
to  confirm  that  all  vertices  adjacent  to  those  on  the  route  from  (7,  1,0)  back 
to  (3,  5,  0)  have  already  been  labeled  lower  on  the  path.  The  vertex  (3,  5,  0), 
however,  has  an  adjacent  vertex  not  yet  labeled,  namely,  (3,  2,  3).  From  this,  we 
move  in  order  to  (6,  2,  0),  (6,  0,  2),  (1, 5,  2),  (1, 4,  3),  and  then  (4,  4,  0).  We  have 
found  a  route  to  the  desired  state  which  requires  just  seven  steps.  Backtracking 
again  from  (4,  4.  0),  we  come  first  to  (1, 4,  3)  and  find  two  adjacent  vertices  on 
the  first  path — (5,  0,  3)  and  (0,  5.  3) — and  hence  two  more  routes  to  (4.  4.  0);  for 
example, 

(8,0,0)  (3,5,0)  ^  (3,2,3) 

(1,4,3)  ^  (5,0,  3)  -*  (5,3,0)  -* - *  (4,4,0). 

Since  each  of  these  routes  is  (much)  longer  than  the  current  shortest  route,  we  do 
not  display  them  on  the  graph.  Continuing  to  backtrack  from  (1,4,  3),  we  find 
that  (1, 5,  2)  is  adjacent  to  (0,  5,  3)  and  (6,  0,  2)  is  adjacent  to  (5,  0,  3),  but,  as 
before,  we  do  not  display  these  adjacencies  since  the  routes  they  give  to  (4,  4,  0) 
are  longer  than  the  current  shortest  route.  Eventually  we  reach  the  initial  vertex 
(8,  0,  0)  and  discover  that  all  adjacent  vertices  have  been  labeled,  so  we  know 
that  all  vertices  of  the  graph  have  been  visited.  We  also  discover  that  we  can 
pass  from  (8,  0,  0)  to  (5,  0,  3)  in  one  step,  giving  yet  another  route  to  the  desired 
vertex  (4,  4,  0),  a  route  requiring  eight  steps.  The  shortest  route,  however,  remains 
at  seven  steps.  I 


1.  Set  up  a  one-dimensional  array  of  length  n  (that  is,  a  1  x  n  matrix)  with 
all  entries  initially  0.  When  label  k  is  selected,  change  component  k  of  this 
vector  to  1 . 
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2.  One  possible  answer  is  given,  along  with  the  associated  spanning  tree. 


3.  Depth-first  search  proceeds  by  deciding  at  each  stage  either  to  proceed  along 
a  particular  edge  to  the  next  unlabeled  vertex  or  not  to  proceed  (because  the 
edge  joins  two  labeled  vertices).  If  all  edges  incident  with  a  given  vertex 
are  rejected,  then  the  algorithm  backtracks.  Hence,  focusing  our  attention  on 
the  edges,  each  edge  need  be  considered  at  most  twice.  (Note  that  once  an 
edge  has  been  rejected  or  used  in  backtracking,  it  is  never  considered  again.) 
Hence,  the  total  number  of  steps  is  at  most  twice  the  number  of  edges,  this 
number  being  at  most  (")  =  \n(n  —  1).  So  the  complexity  of  depth-first  search 
is  0(n2). 


EXERCISES 


The  symbol  [BB  j  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 


1.  For  each  of  the  following  graphs,  carry  out  a  depth-first 
search  starting  at  vertex  a.  Show  all  labels  and  list  those 
used,  in  order,  on  the  final  backtracking.  Highlight  any 
resulting  spanning  tree. 


2.  In  each  of  the  graphs  shown  in  Fig  13.3,  a  depth-first 
search  has  labeled  several  vertices.  Show  all  labels,  list 
in  order  those  used  on  the  final  backtracking,  and  show 
any  resulting  spanning  tree. 


Figure  13.3 


3.  Use  mathematical  induction  to  show  that  when  the  de¬ 
pth-first  search  algorithm  terminates,  there  is  a  path  from 
vertex  1  to  each  labeled  vertex  which  uses  only  edges 
required  by  the  algorithm. 
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4.  (a)  [BB]  Let  wbea  vertex  in  a  graph  Q  which  is  labeled 

by  the  depth-tirst  search  algorithm.  Prove  that  the 
algorithm  labels  all  the  vertices  adjacent  to  v. 

(b)  Prove  that  if  Q  is  connected,  the  depth-first  algo¬ 
rithm  labels  every  vertex  of  Q. 

5.  Suppose  the  depth-first  algorithm  is  applied  to  a  con¬ 
nected  graph  Q.  Prove  that  the  vertices  of  Q  together 
with  the  edges  used  by  the  algorithm  form  a  spanning 
tree.  [Hint:  Exercises  3  and  4.] 

6.  [BB]  Give  an  example  of  a  connected  graph  Q  which  has 
a  spanning  tree  not  obtainable  as  a  depth-first  spanning 
tree  for  Q. 

7.  (a)  [BB]  Explain  how  to  change  the  depth-first  search 

algorithm  so  that  it  can  count  the  number  of  con¬ 
nected  components  of  a  graph. 

(b)  Table  13.1  shows  the  adjacencies  in  a  graph  with 
vertices  v\,V2,  ■ . .  ,v\j.  Without  drawing  the 
graph,  apply  a  depth-first  search  and  use  the  method 
discovered  in  part  (a)  to  determine  the  number 
of  components.  Give  the  labels  which  the  search 
assigns  to  the  vertices. 

8.  Given  a  full  12-ounce  flagon  of  wine  and  two  empty 
flagons  holding  eight  and  five  ounces  respectively. 


explain  how  to  divide  the  wine  into  two  equal  six-ounce 
portions  in  the  fewest  number  of  steps.  The  flagons  are 
unmarked.  No  other  measuring  devices  are  at  hand. 

9.  [BB]  In  our  wine-pouring  Problem  2,  if  it  is  possible 
to  move  from  ( a,b,c )  to  ( a',b',c ')  in  a  single  step, 
is  it  always  possible  to  move  in  the  other  direction, 
(a',  b ',  c ')  to  (a,  b,  c),  in  a  single  step? 

10.  [BB]  What  might  the  phrase  breadth-first  search  mean? 
Design  such  a  procedure. 

11.  [BB;  (a)]  Apply  a  breadth-first  search  to  each  of  the 
graphs  shown  in  Exercise  1  (starting  at  vertex  a  in  each 
case). 

12.  Suppose  the  breadth-first  search  algorithm  is  applied  to 
an  arbitrary  graph  Q. 

(a)  Prove  that  when  the  algorithm  terminates,  there  is 
a  path  from  vertex  1  to  every  other  labeled  vertex 
using  only  edges  required  by  the  algorithm. 

(b)  Prove  that  if  Q  is  connected,  every  vertex  is  labeled. 

(c)  Prove  that  if  Q  is  connected,  then  the  vertices  of  Q 
together  with  the  edges  used  by  the  algorithm  form 
a  spanning  tree. 
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13.  Let  Q  be  a  connected  graph  with  n  >2  vertices,  one 
of  which  is  u.  Show  that  Q  has  a  spanning  tree  with 
the  property  that,  for  any  vertex  v,  the  edges  of  the 
tree  which  connect  u  and  v  define  a  shortest  path 
between  u  and  v.  [Hint:  Think  of  the  spanning  tree 


produced  by  the  breadth-first  search  algorithm.  See 
Exercise  10.] 

14.  Prove  that  the  complete  bipartite  graph  Ki,n  has  the 
property  that  every  spanning  tree  is  obtainable  as  a  de¬ 
pth-first  search  spanning  tree  if  and  only  if  n  =  2,  3,  4. 


13.2  The  One-Way  Street  Problem 

Over  the  years,  the  amount  of  traffic  on  the  streets  of  a  small  town  has  increased 
enormously  and  the  town  planners  are  trying  to  decide  what  should  be  done.  One 
suggestion,  looked  upon  favorably  by  certain  members  of  the  town  council,  is  that 
all  streets  be  made  one-way  in  order  to  simplify  traffic  flow.  Certain  minimum 
requirements  must  be  met,  however.  For  example,  it  must  remain  possible  to  travel 
from  any  point  in  town  to  any  other  point  along  one-way  streets.  Is  there  a  way 
of  putting  a  direction  on  each  of  the  streets  so  that  this  requirement  is  met?  In 
the  language  of  graph  theory,  the  one-way  street  problem  asks,  “Does  a  given 
connected  graph  have  a  strongly  connected  orientation?” 

Remember  that  a  digraph  is  strongly  connected  if  it  is  possible  to  move 
from  any  vertex  to  any  other  vertex  along  arcs,  in  the  proper  direction.  (See 
Section  1 1.2.)  For  example,  the  digraph  shown  on  the  right  in  Fig  13.4  is  strongly 
connected. 


;;  ,  i 
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Figure  13.4  A  graph  and  a  solution  to  the  One-Way  Street  Problem. 


13.2.1  DEFINITIONS 


To  orient  or  to  assign  an  orientation  to  an  edge  in  a  graph  is  to  assign  a  direction 
to  that  edge.  To  orient  or  assign  an  orientation  to  a  graph  is  to  orient  every  edge  in 
the  graph.  A  graph  has  a  strongly  connected  orientation  if  it  is  possible  to  orient 
it  in  such  a  way  that  the  resulting  digraph  is  strongly  connected. 


Figure  13.5  A  graph  for  which  the  One-Way  Street  Problem  cannot  be  solved. 

The  graph  on  the  left  in  Fig  13.4  has  a  strongly  connected  orientation  (shown 
beside  it).  On  the  other  hand.  Fig  13.5  displays  a  graph  which  does  not  have  a 
strongly  connected  orientation.  The  trouble  is  the  middle  edge;  once  this  is  given 
a  direction,  flow  will  only  be  permitted  from  one  side  of  that  edge  to  the  other, 
but  not  in  reverse.  Such  an  edge  is  called  a  bridge  or  cut  edge. 


o 

i. 
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An  edge  e  of  a  connected  graph  Q  is  called  a  bridge  or  a  cut  edge  if  the  subgraph 
Q  \  { e }  is  not  connected. 

Let  e  be  an  edge  which  is  not  a  bridge  in  a  connected  graph  Q.  Show  that  e  is 
part  of  a  circuit. 


We  shall  show  that  the  presence  of  a  bridge  in  a  connected  graph  Q  is  exactly 
what  prevents  Q  from  having  a  strongly  connected  orientation.  First,  however,  we 
illustrate  a  technique  that  will  be  useful  in  our  proof. 


PROBLEM  3.  Find  a  strongly  connected  orientation  for  the  graph  on  the  left  of 
Fig  13.6. 


W5  W\ 


Figure  13.6  A  graph  and  a  strongly  connected  orientation  for  it. 


Solution.  We  begin  by  finding  a  circuit — for  example,  u\ii2Uju\ — and  orienting 
the  edges  u  \  — >  in  ->  m?  -*■  u\  as  shown  (by  single  arrows)  on  the  right  of 
Fig  13.6. 

Next,  we  look  for  a  vertex  which  is  not  in  but  which  is  adjacent  to  some 
vertex  of  this  circuit;  v\  is  adjacent  to  «i,  for  instance.  Now  we  look  for  a  circuit 
containing  v\U\,  say,  v\V2{=  U\)vi(=  M3)ih(=  U2)vsv(,viv%v \.  We  would  like 
to  orient  the  edges  of  this  circuit  tq  — ►  V2  — »  •  •  •  — >  vs  -*■  tq  as  before,  but 
we  cannot  because  edges  V2V3  and  V3V4  have  already  been  oriented  (and  in  the 
opposite  direction).  So  we  do  the  best  we  can,  orienting  those  edges  of  this  circuit 
not  already  assigned  directions;  we  orient  tq  — ►  V2,  V4  — >  1)5,  V5  — »  i>6,  V(,  — >  tq, 
vq  — >  us,  vg  -*  i1 1  (with  double  arrows).  So  far,  our  procedure  has  identified  eight 
vertices  and  oriented  all  but  one  of  the  edges  between  pairs  of  these.  We  orient 
the  omitted  edge  v%U2  arbitrarily,  say  uq  — >  vg.  It  is  critical  to  observe  (and  we 
leave  it  for  you  to  check)  that,  at  this  stage,  the  subgraph  whose  vertices  are  the 
u,  and  vj  and  whose  edges  are  all  edges  of  the  given  graph  among  these  vertices 
is  strongly  connected;  you  can  get  from  any  w,  or  vj  to  any  other  of  these  vertices 
following  edges  in  the  direction  of  arrows. 

Again,  we  next  look  for  a  vertex  which  is  not  in  but  which  is  adjacent  to 
some  vertex  in  this  subgraph;  w\  is  adjacent  to  iq,  for  instance.  We  find  a  circuit 
which  includes  the  edge  uqtq,  for  example,  w\W2(—  v\)wi(=  vs)w4(—  vq)w^w\, 
and  orient  the  edges  of  this  circuit  which  have  not  so  far  been  oriented  in  the 
direction  this  sequence  of  vertices  indicates.  We  orient  uq  — *  uq,  W4  — >  1V5 
and  u>5  —*■  uq  as  shown  (with  triple  arrows).  You  should  check  that  a  strongly 
connected  orientation  has  now  been  assigned  to  the  given  graph.  I 


13.2.3  THEOREM 


Proof 


Figure  13.7 
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A  graph  has  a  strongly  connected  orientation  if  and  only  if  it  is  connected  and 
has  no  bridges. 

(— »■)  If  a  graph  Q  has  a  strongly  connected  orientation,  then  Q  is  surely  connected. 
Let  e  be  an  edge,  say  with  end  vertices  u  and  v.  If  e  is  oriented  in  the  direc¬ 
tion  u  — >  v  when  Q  is  assigned  its  strongly  connected  orientation,  then  in  this 
strongly  connected  digraph,  there  is  a  directed  path  from  v  to  u.  In  particular, 
there  is  a  path  V  in  Q  from  v  to  u  which  does  not  use  e  (definition  of  path). 
So  any  walk  in  Q  which  involves  e  can  be  replaced  by  a  walk  which  avoids  e, 
simply  by  replacing  e  with  V.  It  follows  that  Q  \  { e }  is  connected,  so  e  is  not  a 
bridge. 

(<— )  Conversely,  suppose  Q  is  connected  and  has  no  bridges.  We  first  remark 
that  every  edge  of  Q  must  be  part  of  a  circuit.  (See  Pause  4.)  Now  we  show  that  Q 
can  be  assigned  a  strongly  connected  orientation  by  mimicking  the  approach  we 
used  to  solve  Problem  3.  Let  C :  u\ui  ■  •  -unU]  be  a  circuit  in  Q.  (As  noted,  every 
edge  is  part  of  a  circuit,  so  some  such  circuit  certainly  exists.)  For  1  <  /  <  n  —  1, 
assign  edge  w,m/  +  i  the  orientation  Uj  — >  w;+|.  Assign  edge  unu\  the  orientation 
un  — *  u ) .  Then  orient  any  other  edges  between  vertices  of  C  arbitrarily. 

If  C  contains  all  the  vertices  of  Q,  then  clearly  we  have  an  orientation  of  Q 
which  is  strongly  connected,  since  C  was  strongly  connected.  Suppose,  on  the  other 
hand,  that  C  does  not  contain  all  the  vertices  of  Q.  Since  Q  is  connected,  there  must 
exist  a  vertex  v\  not  in  C  such  that  v\Uj  is  an  edge  for  some  uj.  Since  every  edge 
in  Q  is  part  of  a  circuit,  v\uj  is  part  of  a  circuit  v\Uj(=  •  •  •  vmv\.  Assign 
the  direction  v\  —>  V2  and  vmv\  the  direction  vm  — >  v\.  Leave  unchanged 
the  orientation  of  edges  on  the  circuit  which  have  already  been  oriented  (that  is, 
those  which  are  part  of  C),  but  orient  any  u/u,-+ 1  not  yet  oriented  in  the  direction 
Vj  ->  Vj + 1 .  Finally,  assign  an  arbitrary  orientation  to  any  remaining  edges  among 
vertices  considered  to  this  point.  We  have  drawn  two  possibilities  for  the  oriented 
subgraph  whose  vertices  are  u\, . . .  ,  v\, . . .  ,  vm  in  Fig  13.7.  The  subgraph  on 

the  right  in  this  figure  is  intended  to  serve  as  a  caution  that,  in  addition  to  V2,  some 
other  of  the  vertices  vj, . . .  ,vm  may  coincide  with  vertices  Uj.  For  example,  in 
the  rightmost  subgraph,  if  you  wanted  to  get  from  Vk+i  to  Vk,  you  would  have  to 
go  first  to  v\,  then  to  uj  and  around  C  to  Vk- 


In  Pause  5,  we  ask  you  to  verify  the  following  crucial  fact:  The  subgraph  of 
vertices  and  edges  identified  so  far  is  strongly  connected.  Thus,  if  this  subgraph  is 
the  entire  graph,  we  are  finished.  Otherwise  we  find  a  vertex  w \  which  is  not  part 
of  but  which  is  adjacent  to  some  vertex  in  this  subgraph,  find  a  circuit  containing 
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this  vertex,  and  orient  edges  as  before.  This  procedure  must  terminate,  and  with 
a  strongly  connected  orientation.  I 


13.2.4  AN  ALGORITHM 
FOR  A  STRONGLY 
CONNECTED 
ORIENTATION 


Establish  the  fact  mentioned  toward  the  end  of  the  preceding  proof. 

Since  the  identification  of  circuits  in  a  graph  is  difficult  without  the  advantage 
of  sight,  the  following  algorithm,  based  upon  depth-first  search,  is  important. 

Suppose  Q  is  a  connected  graph  without  bridges.  To  produce  a  strongly  connected 
orientation,  proceed  as  follows. 


Step  1.  Carry  out  a  depth-first  search  in  Q  and  let  T  be  the  spanning  tree  which 
this  produces. 

Step  2.  For  each  edge  e  of  Q,  consider  the  labels  /  and  j  of  its  ends.  Suppose 

i  <  j- 

If  e  is  in  T,  orient  it  /  — >  j; 
else  orient  it  j  -*■  i. 


9  10 


Figure  13.8  A  depth-first  search,  a  spanning  tree,  and  a  strongly  connected  orientation. 


We  apply  this  algorithm  to  the  graph  in  Fig  13.6,  show  in  Fig  13.8  the  labels 
which  a  depth-first  search  might  yield,  and,  with  heavy  lines,  show  the  corre¬ 
sponding  spanning  tree  as  well.  For  each  edge  ij  with  i  <  j,  the  algorithm 
orients  i  -*■  j  if  the  edge  is  part  of  the  tree  and  otherwise  j  ->  i.  Edge  45,  for 
example,  is  in  the  tree,  so  it  is  oriented  4  — »  5;  on  the  other  hand,  edge  18  is  not 
in  the  tree,  so  it  is  oriented  8  — »  1 . 

Use  Algorithm  13.2.4  to  exhibit  a  strongly  connected  orientation  for  the  graph  in 
Fig  13.1  based  upon  the  depth-first  search  shown  in  that  figure. 


Suppose  Q  is  a  connected  graph  without  bridges.  To  show  that  our  algorithm 
actually  works,  it  suffices  to  show  that  it  orients  the  edges  of  Q  in  such  a  way 
that  there  is  a  directed  path  between  1  and  k  (in  each  direction)  for  all  k.  For  this, 
we  use  the  strong  form  of  mathematical  induction  on  k. 

The  result  is  immediate  if  k  =  1,  so  assume  that  k  >  1  and  that  for  all 
i,  1  <  t  <  k,  there  is  a  path  from  1  to  l  and  from  i  to  1  which  respects  the 
orientation  of  edges.  We  must  show  that  such  paths  exist  between  1  and  k. 

Vertex  k  acquired  its  label  because  it  was  adjacent  to  one  which  was  already 
labeled,  say  l.  Since  the  depth-first  search  algorithm  always  chooses  the  smallest 
available  label,  we  have  i  <  k.  By  the  induction  hypothesis,  there  is  a  directed 
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path  from  1  to  i.  Since  edge  ik  is  part  of  the  depth-first  search  spanning  tree  and 
since  l  <  k,  the  edge  incident  with  l  and  k  is  oriented  l  —*  k;  thus,  there  is  a 
directed  path  from  1  to  k.  It  is  more  difficult  to  show  that  there  is  also  a  directed 
path  from  k  to  1 . 

First  assume  the  depth-first  search  algorithm  starts  backtracking  immediately 
after  labeling  k  (because  all  vertices  adjacent  to  k  have  already  been  labeled). 
Note  that  there  must  be  some  vertex  adjacent  to  k  in  addition  to  £;  otherwise,  Ik 
would  be  a  bridge.  Suppose  this  other  vertex  has  been  labeled  t  where,  necessarily, 
t  <  k.  Since  edge  tk  is  not  part  of  the  spanning  tree.  Algorithm  13.2.4  orients  it 
k  —*■  t.  By  the  induction  hypothesis,  there  is  a  directed  path  from  t  to  1.  Together 
with  kt,  we  obtain  the  desired  directed  path  from  it  to  1. 

On  the  other  hand,  suppose  the  algorithm 
does  not  backtrack  after  labeling  k.  Then  it  will 
proceed  from  k  to  an  adjacent  vertex,  which  will 
be  labeled  k  +  1,  and,  if  there  is  an  unlabeled 
vertex  adjacent  to  k  +  1,  to  k  +  2.  Eventually, 
the  algorithm  backtracks  from  a  vertex  labeled 
k+s,  s  >  1,  to  k,  all  vertices  adjacent  to  vertices 
on  this  path  already  labeled.  It  is  important  to 
note  that  there  is  a  directed  path  from  k  to  each 
vertex  k  +  /,  0  <  i  <  s.  Thus,  if  any  of  these 
vertices  is  adjacent  to  a  vertex  labeled  t  with  t  <  k,  the  induction  hypothesis 
makes  it  easy  to  establish  the  existence  of  a  directed  path  from  k  to  1.  On  the 
other  hand,  if  the  only  vertices  adjacent  to  vertices  of  the  form  k  +  i  are  also  of 
this  form,  then  there  is  no  path  in  Q  from  k  +  s  to  f  which  avoids  the  edge  AT,1 
so  kl  would  be  a  bridge,  a  contradiction. 

As  a  concluding  remark,  we  mention  in  passing  that  Algorithm  13.2.4  can 
be  applied  to  give  an  orientation  to  any  connected  graph  Q  although,  if  Q  has 
bridges,  the  result  will  not  be  strongly  connected. 


Answers  to  Pauses 


4.  Let  u  and  v  denote  the  end  vertices  of  e.  Since  G  \  {e}  is  connected,  there 
is  a  path  in  Q  \  [e]  from  v  to  u.  This  path,  followed  by  uv,  is  a  circuit  in  Q 
containing  e. 

5.  We  must  show  that  it  is  possible  to  get  from  any  u,  or  Vj  to  any  other  w,-  or 
Vj  following  edges  in  the  assigned  directions.  We  can  get  from  any  n,  to  any 
other  u  j  along  the  circuit  C.  Next,  we  show  that  there  is  a  directed  path  from 
each  Vi  to  u,  +  i  and  so,  since  vmv\  is  oriented  vm  — >  t>i,  there  is  a  directed  path 
from  any  v,  to  any  vj.  The  orientation  v\  ->  V2  provides  a  directed  path  from 
v\  to  ib  and,  for  any  i  =  2,  3, . . .  ,  m  —  1,  if  the  orientation  is  not  vt  -*  u(+i , 
then  both  the  vertices  u,  and  u;+i  are  part  of  the  circuit  C,  which  we  can  use  to 
find  a  directed  path  from  n,  to  u(+i .  In  any  case,  there  is  always  some  directed 
path  from  each  u,  to  Vj+\  as  asserted.  This  also  shows  that  we  can  pass  from 
C  to  any  u,  via  the  directed  edge  uj(=  V2)v^.  Since  we  can  move  in  the  other 
direction  via  i>iU2(=  Uj),  it  follows  that  the  subgraph  is  strongly  connected. 


'The  second  last  vertex  on  a  path  to  t  avoiding  kt  is  k  +  j  and,  by  assumption,  t  is  not  adjacent  to 
k  +  j  since  i  <  k  +  j. 
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6.  For  each  edge  ij  with  i  <  j,  we 
orient  i  — >  j  if  the  edge  is  in  the 
spanning  tree  and  otherwise 

j  i- 
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EXERCISES 


The  symbol  [BB]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 


1.  [BB;  (a)]  For  each  of  the  graphs  in  Exercise  1  of 
Section  13.1,  assign  an  orientation  based  on  your  depth- 
first  search  and  state  whether  or  not  this  orientation  is 
strongly  connected. 

2.  Find  strongly  connected  orientations  of  the  graphs  corre¬ 
sponding  to  the  five  so-called  Platonic  solids :  the  cube, 
tetrahedron  [BB],  octahedron,  icosahedron,  and  dodeca¬ 
hedron.  (See  pp.  314,  317,  and  415.) 

3.  (a)  [BB]  Does  there  exist  a  strongly  connected  orienta¬ 

tion  for  the  Petersen  graph?  (See  p.  312.)  If  so,  find 
one. 

(b)  Must  a  Hamiltonian  graph  have  a  strongly  connected 
orientation?  Explain. 

(c)  What  about  the  converse  of  (b)?  Explain. 

4.  How  many  bridges  does  a  tree  with  n  vertices  have? 
Explain. 

5.  [BB]  Can  an  Eulerian  graph  have  bridges?  Explain. 

6.  [BB;  (->)]  Prove  that  an  edge  e  in  a  connected  graph  Q 
is  a  bridge  if  and  only  if  there  are  vertices  u  and  v  in  Q 
such  that  every  path  from  it  to  v  requires  the  edge  e. 

7.  (a)  [BB]  Show  that  any  connected  graph  can  be  ori¬ 

ented  so  that  the  resulting  digraph  is  not  strongly 
connected. 

(b)  Let  Q  be  a  connected  graph  with  all  vertices  of 
even  degree.  Can  Q  be  oriented  so  that  the  resulting 
digraph  is  strongly  connected?  Explain. 

8.  [BB]  Let  Q  be  a  graph  with  a  strongly  connected  ori¬ 
entation  assigned  to  it.  Suppose  that  the  direction  of  the 


arrow  on  each  edge  of  Q  is  reversed.  Is  the  new  digraph 
still  strongly  connected?  Explain. 

9.  Answer  true  or  false  and  explain  your  answers. 

(a)  [BB]  A  graph  has  a  strongly  connected  orientation 
if  and  only  if  every  edge  is  part  of  a  circuit. 

(b)  A  graph  with  a  strongly  connected  orientation  has 
no  vertices  of  degree  1 . 

(c)  A  connected  graph  with  no  odd  vertices  has  a  stron¬ 
gly  connected  orientation. 

10.  Let  e  =  uv  be  a  bridge  in  a  connected  graph  Q.  Let 
Q\  be  the  subgraph  of  Q  \  \e)  whose  vertices  are  those 
from  which  there  is  a  path  to  u  and  whose  edges  are  all 
the  edges  of  Q  among  these  vertices.  Let  Q2  be  defined 
analogously  interchanging  the  roles  of  u  and  v. 

(a)  [BB]  Prove  that  there  is  no  path  between  u  and  v 
in  G  \  { e ). 

(b)  Prove  that  Q\  and  Q2  have  no  vertices  in  common. 

(c)  Prove  that  Q \  is  a  (connected)  component  of  Q  \  {<?}, 
that  is,  a  maximal  connected  subgraph  of  Q  \  \e). 

(In  Fig  13.5,  Q\  and  Q2  are  the  two  triangles.  This 
exercise  shows  that  Fig  13.5  shows  what  happens  in 
general  when  there  is  a  bridge  in  a  graph.) 

11.  [BB]  Faced  with  the  undeniable  fact  that  his  town’s 
street  system  does  indeed  have  some  bridges,  Mayor 
Murphy  decides  that  all  bridges  should  remain  two-way 
streets,  but  all  other  streets  should  be  made  one-way. 
Can  this  be  done  so  as  to  allow  (legal)  travel  from  any 
intersection  to  any  other? 
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12.  Explain  why  the  depth-first  search  algorithm  can  be  used 
to  find  all  bridges  in  a  connected  graph.  Design  a  pro¬ 
cedure  to  find  all  such  bridges  which  is  £>(«3)  (where  n 
is  the  number  of  vertices). 

13.  Let  Q  be  a  graph  with  n  vertices  and  m  edges. 


(a)  Assuming  m  >  ( 1  )2('-~2 1  +  1,  show  that  Q  can  be 
given  a  strongly  connected  orientation.  (You  may 
assume  that  the  condition  implies  that  Q  is  con¬ 
nected.  See  Exercise  22  of  Section  10.1.) 

(b)  Need  Q  have  a  strongly  connected  orientation  if  m  > 

(n  — l)(n-2)  O 
2 


REVIEW  EXERCISES  FOR  CHAPTER  13 


1.  In  each  of  the  following  graphs,  a  depth-first  search  has 
started  and  several  vertices  have  been  labeled.  Finish 
each  search,  show  all  labels,  list  in  order  those  used  on 
the  final  backtracking,  and  show  the  resulting  spanning 
tree. 


(a)  (b) 


1  2 
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2.  Let  v  be  an  arbitrary  vertex  and  e  an  arbitrary  edge  of 
a  connected  graph  Q.  Show  that  it  is  always  possible 
to  carry  out  a  depth-first  search  beginning  at  v  which 
includes  e  as  part  of  the  resulting  spanning  tree. 


3.  Assign  an  orientation  to  each  of  the  graphs  in  Exercise  1 
based  upon  your  depth-first  search  and  state  whether  or 
not  this  orientation  is  strongly  connected. 

4.  Which  graphs  have  the  property  that  it  is  possible  to  start 
at  some  vertex  v  and  carry  out  a  depth-first  search  which 
labels  all  vertices  without  ever  backtracking?  Prove  your 
answer. 

5.  If  a  connected  graph  Q  has  a  bridge  e ,  show  that  every 
spanning  tree  of  Q  must  contain  e. 

6.  Is  it  possible  for  a  Hamiltonian  graph  to  contain  a 
bridge?  Explain. 

7.  (a)  Is  the  orientation  which  appears  in  a  tournament 

always  strongly  connected?  Explain. 

(b)  Is  it  possible  to  change  the  directions  of  some  of 
the  arrows  in  a  tournament  and  achieve  a  strongly 
connected  orientation?  Explain.  Assume  the  graph 
has  at  least  three  vertices. 
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14.1  Planar  Graphs 

The  Three  Houses — Three  Utilities  Problem  posed  in  Chapter  9  asks  whether  or 
not  it  is  possible  to  draw  the  complete  bipartite  graph  /C33  without  any  crossovers 
of  edges.  (See  Fig  14.1.) 


Figure  14.1 


The  graph  K. 3  3,  representing  three  houses  and  three  utilities. 


Thinking  of  the  edges  of  this  graph  as  pieces  of  string  and  the  vertices  as 
knots,  the  question  is  whether  or  not  the  “net”  can  be  arranged  without  string 
crossovers.  In  the  language  of  graph  theory,  the  problem  asks  if  /C33  is  planar. 


14.1.1  DEFINITION 


A  graph  is  planar  if  it  can  be  drawn  in  the  plane  in  such  a  way  that  no  two  edges 
cross. 


In  this  chapter  (and  abusing  terminology)  we  find  it  convenient  to  refer  to  a 
picture  of  a  graph  in  which  there  are  no  crossovers  of  edges  as  a  plane  graph. 
Thus,  a  planar  graph  is  one  which  can  be  drawn  as  a  plane  graph.  There  are  five 
places  where  edges  in  the  graph  Q  of  Fig  14.2(a)  cross;  nevertheless,  Q  is  planar 
because  it  can  be  drawn  as  a  plane  graph,  as  shown  in  Fig  14.2(b). 

Figure  14.2(c)  shows  Q  with  all  edges  drawn  as  straight  line  segments.  In 
fact,  this  is  possible  for  any  planar  graph1  and  so  we  lose  no  generality  if  planar 
graphs  are  drawn  as  plane  graphs  with  straight  edges. 


1  This  result  is  due  to  I.  Fary,  “On  Straight  Line  Representation  of  Planar  Graphs,”  Acta  Scientiarum 
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Figure  14.2 


g  g  drawn  as  a  plane  graph  drawn  as  a  Plane  graPh 

with  straight  edges 

(a)  (b)  (c) 


A  plane  graph  divides  the  plane  into  various  connected  regions,  one  of  which 
is  called  the  exterior  region.  Every  region,  including  the  exterior,  is  bounded  by 
edges.  The  graph  in  Fig  14.2(b)  divides  the  plane  into  six  regions,  R(,  being  the 
exterior  region.  The  boundary  of  R2,  for  example,  consists  of  the  edges  v\vj, 
V3U4,  and  U4 u  1 .  The  boundary  of  R&  is  V2V4,  V4V5,  and  V5V2.  We  require  that 
every  edge  must  be  the  boundary  of  some  region.  Thus,  in  Fig  14.3(a),  the  edge 
labeled  e  is  part  of  the  boundary  of  region  R,  while  in  Fig  14.3(b)  it  is  part  of 
the  boundary  of  the  exterior  region. 


Figure  14.3  (a) 


A  tree  determines  just  one  region  in  the  plane,  the  exterior  region,  and  every 
edge  of  the  tree  is  part  of  the  boundary  of  this  region.  [See  Fig  14.3(c).] 

■KH  Consider  the  plane  graph  shown  on  the  left  of  Fig  14.4. 

(a)  How  many  regions  are  there? 

(b)  List  the  edges  which  form  the  boundary  of  each  region. 

(c)  Which  region  is  exterior? 

Answer  these  same  questions  for  the  graph  on  the  right  as  well. 

Planar  graphs  were  first  studied  by  Euler  because  of  their  connections  with 
polyhedra.  A  convex  regular  polyhedron  is  a  geometric  solid  all  of  whose  faces 
are  congruent.  There  are  in  all  just  five  of  these — the  cube,  the  tetrahedron, 
the  octahedron,  the  icosahedron,  and  the  dodecahedron — and  they  are  popularly 
known  as  the  Platonic  solids  because  they  were  regarded  by  Plato  as  symbolizing 
earth,  fire,  air,  water,  and  the  universe,  respectively. 


Mathematicarum  ( Szeged )  11  (1948),  229-233.  A  proof  also  appears  in  Combinatorial  Problems  and  Exer¬ 
cises  by  Laszlo  Lov£sz,  North-Holland  (1979)  (Problem  5.38). 
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Figure  14.4 
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When  we  discussed  Sir  William  Hamilton’s  “World  Tour”  in  Section  10.2, 
we  showed  how  to  associate  with  the  regular  dodecahedron  a  planar  graph  whose 
edges  and  vertices  correspond  to  the  edges  and  vertices  of  the  solid  and  whose 
regions  correspond  to  the  faces  of  the  polyhedron.  In  the  exercises  of  Section  10.2, 
we  introduced  planar  graphs  corresponding  to  the  cube  and  the  regular  icosahe¬ 
dron.  As  we  said  at  the  time,  we  can  obtain  the  planar  graph  associated  with  a 
particular  polyhedron  by  imagining  that  the  bottom  face  is  stretched  until  the  object 
collapses  flat.  Figure  14.5  shows  the  plane  graphs  associated  with  the  tetrahedron, 
the  cube,  and  the  octahedron. 


Figure  14.5 


The  tetrahedron,  cube,  and  octahedron,  and  their  corresponding  planar  graphs. 


In  1752,  Euler  published  the  remarkable  formula 

V  -  E  +  F  =  2, 

which  holds  for  any  convex  polyhedron  with  V  vertices,  E  edges,  and  F  faces.2 
(A  polygon  is  convex  if  the  line  joining  any  pair  of  nonadjacent  vertices  lies 
entirely  within  the  polygon.)  For  the  cube,  for  instance,  V  =  8,  E  =  12,  and 
F  =  6;  for  the  tetrahedron,  V  =  F  =  4  and  E  =  6.  Euler’s  formula  also  works 
for  any  plane  graph,  with  regions  replacing  faces.  For  example,  the  plane  graph 
in  Fig  14.2  has  V  =  5,  E  =  9,  R  =  6,  so,  V  -  E  +  R  =  5  -  9  +  6  =  2. 


14.1.2  THEOREM 


Let  Q  be  a  connected  plane  graph  with  V  vertices,  E  edges,  and  R  regions.  Then 
V  -  E  +  R  =  2. 


Proof  We  use  induction  on  E.  If  E  =  0,  then  V  =  R  =  1  (because  Q  is  connected)  and 
the  formula  is  true.  Now  assume  the  formula  holds  for  connected  plane  graphs 


2L.  Euler,  "Demonstratio  nonnullarum  insignium  proprietatum  quibus  solida  hedris  planis  inclusa  sunt 
praedita,”  Novi  Commentarii  Academiae  Scientarium  Petropolitanae  4  (1752/53),  140-160. 
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with  E  —  1  edges,  where  E  >  1,  and  that  Q  is  a  connected  plane  graph  with 

E  edges,  V  vertices,  and  R  regions.  We  must  show  that  V  —  E  +  R  =  2.  If 

Q  contains  no  circuits,  then  Q  is  a  tree  and  so  E  =  V  -  1  by  Theorem  12.2.3. 
Since  R  =  1,  we  have  V  —  E  +  R  =  V  —  (V  —  1)  -F  1  =  2  as  desired.  Suppose, 
on  the  other  hand,  that  G  does  contain  a  circuit  C.  Let  e  be  any  edge  of  C  and 
consider  the  subgraph  Q  \  {e},  which  is  a  plane  graph  and  still  connected.  The 
circuit  C  determines  a  region  of  Q  which  disappears  in  G  \  {e}.  So  G  \  {?}  contains 
R  —  1  regions,  all  V  vertices  of  Q,  and  E  —  1  edges.  By  the  induction  hypothesis, 
V  —  (E  —  1)  +  (R  —  1)  =  2;  that  is,  V  —  E  +  R  =  2.  I 

Show  that  this  theorem  is  not  necessarily  true  if  "connected”  is  omitted  from  its 

statement. 

We  are  now  ready  to  solve  the  Three  Houses — Three  Utilities  problem.  It 
turns  out  not  to  be  possible  to  connect  three  houses  to  three  utilities  without 
crossovers  of  connection  lines. 


14.1.3  COROLLARY 


K, 3,3  is  not  planar. 


Proof  We  provide  a  proof  by  contradiction.  The  graph  K, 3,3  has  six  vertices  and  nine  edges. 

If  it  is  planar,  it  can  be  drawn  as  a  plane  graph  with  R  regions.  Since  V  —  E  +  R  =  2, 

we  have  R  =  5.  Now  count  the  number  of  edges  on  the  boundary  of  each  region 
and  sum  over  all  regions.  Suppose  the  sum  is  N .  The  point  to  observe  about  £3,3 
is  that  it  contains  no  triangles  (every  edge  joins  a  house  to  a  utility;  there  are  no 
edges  from  house  to  house  or  utility  to  utility).  Therefore,  the  boundary  of  each 
region  contains  at  least  four  edges,  from  which  it  follows  that  N  >  4R  —  20.  On 
the  other  hand,  in  the  calculation  of  N,  each  edge  was  counted  at  most  twice,  so 
N  <  2E  =  18.  This  contradiction  establishes  the  corollary.  I 

(IIII^^O  Find  the  number  N  defined  in  this  proof  for  the  graph  on  the  left  of  Fig  14.4. 

Verify  that  N  <  2 E.  Give  an  example  of  an  edge  which  is  counted  just  once. 

Intuitively,  a  graph  with  a  lot  of  edges  cannot  be  planar  since  it  will  be 
impossible  to  avoid  crossovers  when  drawing  it.  Thus,  we  expect  planar  graphs  to 
have  relatively  few  edges.  The  next  theorem  makes  this  statement  more  precise. 

Let  G  be  a  planar  graph  with  V  >  3  vertices  and  E  edges.  Then  E  <  3  V  —  6. 


14.1.4  THEOREM 


Proof  We  give  an  argument  for  the  case  that  G  is  connected  and  ask  you  in  Exercise  19 
to  prove  the  general  case. 

If  V  =  3,  then  E  <  3  and  the  theorem  holds.  So  we  assume  that  V  >  3.  Thus, 
we  may  also  assume  that  E  >  3  (otherwise,  the  result  is  clearly  true).  Draw  Q  as  a 
plane  graph  with  R  regions.  Then,  as  in  the  proof  of  Corollary  14.1.3,  we  count  the 
number  of  edges  on  the  boundary  of  each  region,  sum  these  numbers,  and  denote 
the  sum  N.  As  before,  N  <  2 E.  Also,  since  each  boundary  contains  at  least  three 
edges,  N  >  3 R.  Hence,  3 R  <  2 E.  Theorem  14.1.2  says  that  V  —  E  +  R  =  2. 
Therefore,  6  =  3V  —  3E  4-  3R  <  3V  —  3E  +  2E  =  3 V  —  E  and  the  result 
follows.  I 
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14.1.5  COROLLARY 


/C5  is  not  planar. 


Proof  In  /C5,  V  —  5  and  £  =  (,)  =  10  (since  every  pair  of  vertices  is  joined  by  an 
edge).  Since  it  is  not  true  that  E  <  3V  —  6,  /C5  cannot  be  planar.  I 

In  a  graph  where  the  number  of  edges  is  not  too  large,  it  is  reasonable  to 
expect  some  restrictions  on  vertex  degrees.  In  Exercise  20,  we  ask  you  to  show 
that  a  planar  graph  with  at  least  four  vertices  has  at  least  four  vertices  of  degree 
d  <  5.  For  the  present,  we  content  ourselves  with  a  weaker  statement. 


14.1.6  COROLLARY 


Every  planar  graph  contains  at  least  one  vertex  of  degree  d  <  5. 


Proof  Suppose  degty  >  6  for  every  vertex  v;.  Since  ^degu,  =  2 E,  we  would  have 
2E  >  6V  and  hence  E  >  3V  >  3V  —  6.  This  contradicts  E  <  3V  —  6.  I 

It  was  the  Polish  mathematician  Kazimierz  Kuratowski  (1896-1980)  who 
discovered  the  crucial  role  played  by  /C3  3  and  /C5  in  determining  whether  or  not 
a  graph  is  planar.  (The  K.  in  /C„  and  tCmn  is  in  Kuratowski’s  honor.)  First,  since 
these  graphs  are  not  planar,  no  graph  which  contains  either  of  them  as  a  subgraph 
can  be  planar.  Second,  any  graph  obtained  from  either  /C3.3  or  1C 5  simply  by 
adding  more  vertices  to  edges  cannot  be  planar  either. 


14.1.7  DEFINITION 


Two  graphs  are  homeomorphic  if  and  only  if  each  can  be  obtained  from  the  same 
graph  by  adding  vertices  (necessarily  of  degree  2)  to  edges. 


EXAMPLE  1  The  graphs  Q \  and  Q2  in  Fig  14.6  are  homeomorphic  since  both  are  obtainable 
from  the  graph  Q  in  that  figure  by  adding  a  vertex  to  one  of  its  edges.  A 


Figure  14.6  Two  homeomorphic  graphs  obtained  from  Q  by  adding  vertices  to  edges. 

EXAMPLE  2  In  Fig  14.7,  we  show  two  homeomorphic  graphs,  each  obtained  from  /C5  by  adding 
vertices  to  edges  of  /C5.  (In  each  case,  the  vertices  of  1C 5  are  shown  with  solid 
dots.)  A 


Pause  4 


Any  two  cycles  are  homeomorphic.  Why? 


EXAMPLE  3  Two  graphs  are  homeomorphic  if  one  is  simply  obtained  from  the  other  by  adding 
vertices  to  edges;  that  is,  the  third  graph  mentioned  in  Definition  14.1.7  may  be 
one  of  the  two  given  graphs.  For  instance,  graphs  Q\  and  Q2  in  Fig  14.8  are 
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Figure  14.7 


Figure  14.8 


14.1.8  THEOREM 


EXAMPLE  4 


Figure  14.9 


Two  homeomorphic  graphs  obtained  from  /C5. 

homeomorphic  because  Qi  is  obtained  by  adding  two  vertices  (the  solid  dots)  to 
edges  of  Q\.  ▲ 

When  we  say  “adding  vertices  to  edges,”  we  do  not  include  the  possibility  of 
adding  a  vertex  where  edges  cross  over  each  other,  for  this  is  achieved  by  adding 
a  vertex  to  each  edge  and  then  joining  these  two  vertices  together.  Joining  vertices 
is  not  allowed.  For  example,  graphs  Q\  and  C/3  of  Fig  14.8  are  not  homeomorphic. 


Q\  O2  Si 


Q\  is  homeomorphic  to  Q2  but  not  to  £3. 

The  following  theorem  characterizes  planar  graphs  in  a  remarkably  simple 
way.  The  proof  in  one  direction  is  straightforward  and  was  given  earlier:  A  graph 
which  contains  fC 5  or  £3,3  cannot  be  made  planar  by  adding  vertices  to  edges. 
The  proof  of  the  converse  is  more  complicated  and  will  be  omitted. 

(KuratOWSki3)  A  graph  is  planar  if  and  only  if  it  has  no  subgraph  homeomorphic 
to  K, 5  or  £3,3. 

The  graph  Q  in  Fig  14.9  is  not  planar.  By  deleting  two  edges,  we  obtain  a  sub¬ 
graph  S  which  is  K*i,i  (the  bipartition  sets  are  the  sets  of  hollow  vertices  and 
solid  vertices)  except  for  one  vertex  (the  tiny  one)  added  to  an  edge.  Since  S  is 
homeomorphic  to  /C33,  Q  is  not  planar,  by  Kuratowski’s  Theorem.  ▲ 


Q 


A  nonplanar  graph  Q  and  a  subgraph  S  homeomorphic  to  /C3  3. 


K.  Kuratowski,  “Sur  le  Probleme  des  Courbes  Gauches  en  Topologie,”  Fundamenta  Mathematicae  15 
(1930),  271-283. 
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EXAMPLE  5  The  graph  on  the  left  in  Fig  14.10  is  not  planar  because  the  subgraph  shown  on 
the  right  is  homeomorphic  to  IC5.  Notice,  for  example,  that  A  is  adjacent  to  C 
and  D,  and,  except  for  intermediate  vertices  of  degree  2,  also  to  B  and  E.  ▲ 


A 


G  S 


Figure  14.10  A  nonplanar  graph  Q  and  a  subgraph  S  homeomorphic  to  X+. 


Answers  to  Pauses 


1.  The  graph  on  the  left  of  Fig  14.4  has  three  regions  whose  boundaries  are 
[d,  e,h],  {a,  b,  f,  g,  c},  and  {a,  b,  g,  c,  d,  e,  h};  the  last  region  is  exterior. 
The  graph  on  the  right  is  a  tree;  it  determines  only  one  region,  the  exterior 
one,  with  boundary  (a,  b,  c ,  d }. 

2.  In  the  graph  shown,  V  —  E  +  R  =  6  —  6  +  3  =  3.  c/^\ 

3.  The  boundaries  of  the  regions  are  given  in  the  answer  to  Pause  1:  N  = 
3  +  5  +  7=  15  <16  =  2  E.  Edge  /  is  counted  only  once. 

4.  Any  cycle  can  be  obtained  from  a  3-cycle  by  adding  vertices  to  edges. 


EXERCISES 


The  symbol  [BB]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 


1.  (a)  [BB]  Show  that  the  graph  is  planar  by  drawing  an 
isomorphic  plane  graph  with  straight  edges. 


(b)  [BB]  Label  the  regions  defined  by  your  plane  graph 
and  list  the  edges  which  form  the  boundary  of  each 
region. 

(c)  [BB]  Verify  that  V  —  E  +  R  =  2,  N  <  2 E,  and 
E  <  3V  —  6  (where,  as  defined  in  the  proof  of 
Corollary  14.1.3,  N  is  the  sum  of  the  numbers  of 
edges  on  the  boundaries  of  all  regions). 

2.  Repeat  Exercise  1  for  the  complete  bipartite  graph 

+2,5- 


3.  [BB]  Verify  Euler’s  formula  V  —  E  +  F  =  2  for  each 
of  the  five  Platonic  solids. 

4.  One  of  the  two  graphs  is  planar;  the  other  is  not.  Which 
is  which?  Explain.  (Note  that  the  graph  on  the  right  is  the 
Petersen  Graph ,  which  was  introduced  in  Section  10.2.) 


5.  Determine  which  of  the  graphs  in  Fig  14.11  are  planar. 
In  each  case,  either  draw  a  plane  graph  and  a  plane 
graph  with  straight  edges  isomorphic  to  the  one  pre¬ 
sented  or  exhibit  a  subgraph  homeomorphic  to  £3,3 
or  1C  5. 
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Note  that  this  is  just  AC 3  3  less 
one  edge. 


(c) 


Figure  14.11  Graphs  for  Exercise  5. 

6.  Let  Q  be  a  connected  plane  graph  such  that  every  region 
of  Q  has  at  least  five  edges  on  its  boundary.  Prove  that 
3£  <  5L  —  10. 

7.  | BB]  If  Q  is  a  connected  plane  graph  with  V  >  3  vertices 
and  R  regions,  show  that  R  <  2V  —  4. 

8.  (a)  LBB]  Give  an  example  of  a  connected  planar  graph 

for  which  E  =  3V  —  6. 

(b)  Let  Q  be  a  connected  plane  graph  for  which  E  = 

3  V  —  6.  Show  that  every  region  of  Q  is  a  triangle. 

9.  (a)  If  Q  is  a  connected  plane  graph  with  at  least  three 

vertices  such  that  no  boundary  of  a  region  is  a  tri-  (£$ 
angle,  prove  that  E  <  2V  —  4. 

(b)  Let  Q  be  a  connected  planar  bipartite  graph  with  E 
edges  and  V  >  3  vertices.  Prove  that  E  < 

2V  -4. 

10.  (a)  [BB]  For  which  n  is  AC„  planar? 

(b)  For  which  m  and  n  is  Km.n  planar? 

11.  [BB]  Show  that  AC 2,2  is  homeomorphic  to  ACs. 

12.  Show  that  any  graph  homeomorphic  to  AC 5  or  to  AC.s.s  is 
obtainable  from  AC5  or  AC 3,3,  respectively,  by  addition  of 
vertices  to  edges. 

13.  Suppose  a  graph  Q\  with  V\  vertices  and  E\  edges  is 
homeomorphic  to  a  graph  Q2  with  V2  vertices  and  Ei 
edges.  Prove  that  £3  —  V2  =  E\  —  V) . 


14.  (a)  [BB]  Let  Q  be  a  connected  graph  with  Vi  vertices 

and  E\  edges  and  let  H  be  a  subgraph  with  V2 
vertices  and  £2  edges.  Show  that  E2  —  V2  <  E\—V\. 

(b)  Let  Q  be  a  connected  graph  with  V  vertices,  E 
edges,  and  E  <  V  +  2.  Show  that  Q  is  planar. 

(c)  Is  (b)  true  if  E  =  V  +  3?  Explain. 

15.  [BB]  Let  Q  be  a  graph  and  let  H  be  obtained  from  Q 
by  adjoining  a  new  vertex  of  degree  I  to  some  vertex 
of  Q.  Is  it  possible  for  Q  and  H  to  be  homeomorphic? 
Explain. 

16.  Discover  what  you  can  about  Kazimierz  Kuratowski  and 
write  a  short  biographical  note  about  this  famous  Polish 
mathematician  (in  good,  clear  English,  of  course). 

17.  Answer  true  or  false  and  explain. 

(a)  If  Q  is  Eulerian  and  Tt  is  homeomorphic  to  Q ,  then 
H  is  Eulerian. 

(b)  [BB]  If  Q  is  Hamiltonian  and  7 i.  is  homeomorphic 
to  Q ,  then  H  is  Hamiltonian. 

18.  (a)  Show  that  any  planar  graph  all  of  whose  vertices 

have  degree  at  least  5  must  have  at  least  1 2  vertices. 
[Hint:  It  suffices  to  prove  this  result  for  connected 
planar  graphs.  Why?] 

(b)  Find  a  planar  graph  each  of  whose  vertices  has 
degree  at  least  5. 
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19.  (a)  [BB]  Prove  that  if  Q  is  a  planar  graph  with  n  con¬ 

nected  components,  each  component  having  at  least 
three  vertices,  then  E  <  3V  —  6 n. 

(b)  Prove  that  if  Q  is  a  planar  graph  with  n  connected 
components,  then  it  is  always  true  that  E  <  3V  — 
3/t.  Deduce  that  Theorem  14.1.4  holds  for  arbitrary 
planar  graphs. 

20.  (a)  [BB]  Prove  that  every  planar  graph  with  V  >  2 

vertices  has  at  least  two  vertices  of  degree  d  <  5. 

(b)  Prove  that  every  planar  graph  with  V  >  3  vertices 
has  at  least  three  vertices  of  degree  d  <  5. 

(c)  Prove  that  every  planar  graph  with  V  >  4  vertices 
has  at  least  four  vertices  of  degree  d  <  5. 

21.  (a)  [BB]  A  connected  planar  graph  Q  has  20  vertices. 

Prove  that  Q  has  at  most  54  edges. 

(b)  A  connected  planar  graph  Q  has  20  vertices,  seven 
of  which  have  degree  1 .  Prove  that  Q  has  at  most 
40  edges. 

22.  (a)  [BB]  Suppose  Q  is  a  connected  planar  graph  in 

which  every  vertex  has  degree  at  least  3.  Prove  that 
at  least  two  regions  of  Q  have  at  most  five  edges  on 
their  boundaries. 


(b)  Establish  (a)  for  planar  graphs  which  are  not  con¬ 
nected. 

23.  Suppose  that  a  convex  polygon  with  n  vertices  is  tri¬ 
angulated ,  that  is,  partitioned  into  triangles,  possibly  by 
the  introduction  of  new  vertices.  The  graphs  illustrate 
two  triangulations  of  a  convex  polygon  with  seven  ver¬ 
tices.  In  the  graph  at  the  left,  four  edges  were  required 
and,  in  the  graph  at  the  right,  1 3.  Show  that  the  number 
of  edges  added  in  order  to  effect  a  triangulation  is  at 
least  n  —  3. 


24.  [BB]  Find  a  formula  for  V  —  E  +  R  which  applies  to 
planar  graphs  which  are  not  necessarily  connected. 

25.  Prove  that  the  Platonic  solids  are  the  only  regular  poly- 
hedra.  [Hint:  In  a  regular  polyhedron,  every  vertex  has 
the  same  degree  d  and  every  face  has  the  same  number 
a  of  edges  on  its  boundary.  Try  to  solve  V  —  E  +  F  =  2 
under  such  conditions.] 


14.2  Coloring  Graphs 

One  of  the  most  exciting  mathematical  developments  of  the  twentieth  century 
was  the  proof,  in  1976,  of  The  Four-Color  Theorem,  which  is  easy  to  state  and 
understand,  but  whose  proof  had  remained  unsolved  since  1 852  when  it  was  first 
posed  to  his  brother  by  Francis  Guthrie.  Guthrie  had  discovered  that  he  could 
color  a  map  of  the  counties  of  England  with  only  four  colors  in  such  a  way  that 
each  county  had  exactly  one  color  and  bordering  counties  had  different  colors. 
He  guessed  that  it  was  possible  to  color  the  countries  of  any  map  with  just  four 
colors  in  such  a  way  that  bordering  countries  have  different  colors.4  Both  brothers 
had  been  students  of  Augustus  De  Morgan  and  so,  unable  to  answer  the  question 
themselves,  they  asked  their  former  teacher,  one  of  the  greatest  mathematicians  of 
the  age.  De  Morgan  was  able  to  show  that  it  is  impossible  to  have  five  countries 
each  adjacent  to  all  of  the  others,  but  this  result,  while  lending  support  to  Francis 
Guthrie’s  guess,  did  not  settle  it. 

It  has  long  been  known  that  five  colors  are  enough,  but  for  well  over  100  years, 
whether  one  could  make  do  with  just  four  colors  was  not  known.  In  1 879,  the  pres¬ 
tigious  American  Journal  of  Mathematics  published  a  “proof’  of  the  conjecture 


4Our  use  of  the  term  "map’’  is  not  intended  to  limit  us  to  those  maps  found  between  the  covers  of  an 
atlas.  In  the  context  of  map  colorings,  a  map  just  means  a  plane  (possibly  pseudo-)  graph  in  which  the 
edges  represent  borders  and  vertices  are  points  where  borders  meet. 
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Figure  14.12 


by  one  Alfred  B.  Kempe5  (the  final  “e”  is  not  pronounced),  a  London  barrister, 
but  1 1  years  later  a  fatal  flaw  was  discovered.  In  fact,  as  pointed  out  by  Percy 
Heawood,6  Kempe' s  argument  was  a  valid  one  for  five  colors,  but  not  for  four.  It 
turns  out  that  four  colors  are  indeed  enough  but  the  proof,  by  Kenneth  Appel  and 
Wolfgang  Haken,7  occupies  almost  140  pages  of  the  Illinois  Journal  of  Mathemat¬ 
ics  and  will  not  be  presented  here.  The  proof  involved  showing  that  any  planar 
graph  must  contain  a  subgraph  of  a  certain  type.  If  this  subgraph  were  deleted,  one 
could  4-color  the  resulting  reduced  graph.  With  the  assistance  of  J.  Koch  and  1200 
hours  of  computer  time,  it  was  shown  that  any  4-coloring  of  the  boundary  of  any 
of  the  subgraphs  identified  by  Appel  and  Haken  could  be  extended  to  a  4-coloring 
of  the  subgraph;  thus,  any  planar  graph  could  be  4-colored.  The  final  settling  of 
what  had  been  the  Four-Color  Conjecture  was  considered  such  an  achievement 
that  for  a  period  of  time,  the  postage  meters  at  the  University  of  Illinois  bore  the 
inscription,  “Four  Colors  Suffice.”  Some  readers  might  well  be  interested  in  an 
article  by  Robin  Thomas,  which,  after  a  brief  history  of  the  Four-Color  Problem, 
describes  subsequent  efforts  to  verify  independently  the  proof,  this  being  no  small 
task.8 

Our  modest  goal  here  is  to  show  how  graph  theory  can  be  used  to  attack  a 
coloring  problem  and  also  to  prove  that  any  map  can  be  5-colored. 


A  map  and  an  associated  planar  graph. 

Our  starting  point  is  the  observation  that  with  any  map  we  may  associate  a 
planar  graph  whose  vertices  correspond  to  countries  and  where  an  edge  joins  two 
vertices  if  the  corresponding  countries  share  a  common  border.  There  are  nine 
countries  in  the  map  depicted  in  the  left  of  Fig  14.12,  so  the  associated  planar 
graph  has  nine  vertices.  Since  countries  5  and  6  share  a  border,  vertices  5  and  6 
are  joined  by  an  edge,  and  so  on.  The  point  is  that  coloring  the  countries  of  the 
map  so  that  countries  with  a  common  border  receive  different  colors  is  equivalent 
to  coloring  the  vertices  of  the  associated  graph  so  that  adjacent  vertices  have 
different  colors.  For  example,  the  following  assignment  of  four  colors  to  countries 


SA.  B.  Kempe,  “On  the  Geographical  Problem  of  the  Four  Colors,"  American  Journal  of  Mathematics 
2  (1879),  193-200. 

®P.  J.  Heawood.  “Map-Color  Theorem,”  Quarterly  Journal  of  Mathematics  24  (1890),  332-339. 

7K.  Appel,  W.  Haken,  J.  Koch,  "Every  Planar  Map  is  Four-Colorable,”  Illinois  Journal  of  Mathematics 
21  (1977),  429-567. 

sRobin  Thomas,  “An  update  on  the  Four-Color  Theorem.”  Notices  of  the  American  Mathematical  Society 
45  (1998).  no.  7.  848-859. 
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Figure  14.13 


14.2.1  DEFINITIONS 


(or  vertices)  in  Fig  14.12  colors  the  map  (and  the  graph)  in  the  proper  sense: 

1  -  red;  2  -  blue;  3  -  green;  4  -  red;  5  -  green;  6  -  blue; 

7  -  green;  8  -  red;  9  -  yellow. 

Figure  14.13  shows  a  map  of  North  America  in  which  most  provinces  and 
states  have  been  colored.  Starting  the  coloring  with  the  western  states,  it  is  soon 
apparent  that  three  colors  are  not  enough. 


A  partial  4-coloring  of  a  map  of  North  America. 


We  formalize  the  notions  we  have  been  discussing. 

A  coloring  of  a  graph  is  an  assignment  of  colors  to  the  vertices  so  that  adjacent 
vertices  have  different  colors.  An  n-coloring  is  a  coloring  with  n  colors.  The 
chromatic  number  of  a  graph  Q,  denoted  x(G)>  is  the  minimum  value  of  n  for 
which  an  n-coloring  of  Q  exists. 

Suppose  x(Q)  =  1  for  some  graph  Q.  What  do  you  know  about  Ql 

The  graph  on  the  left  of  Fig  14.14  has  been  colored  with  three  colors.  Thus,  its 
chromatic  number  is  at  most  3.  Since  it  contains  several  triangles  (the  vertices  of 
which  must  be  colored  differently),  at  least  three  colors  are  required:  Its  chromatic 
number  is  3.  The  graph  on  the  right  has  been  colored  with  four  colors,  so  for  this 
graph  Q,  x(Q)  S  4.  Since  Q  contains  JC 4  as  a  subgraph  (consider  the  subgraph 
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determined  by  the  vertices  vj, 14, 1)5,  Vf,)  and  since  /C 4  requires  four  colors,  Q 
cannot  be  colored  with  less  than  four  colors.  For  this  graph  then,  x(G)  —  4. 


Figure  14.14 
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The  graph  on  the  left  has  chromatic  number  3;  the  graph  on  the  right  has  chromatic  number  4. 


14.2.2  THE 
FOUR-COLOR 
THEOREM 


14.2.3  THEOREM 


What  is  the  chromatic  number  of  the  graph  in  Fig  14.12? 
xOCn)  =  n,  xOCm.n)  =  2.  Why? 

For  any  planar  graph  Q,  x(G)  5  4. 

Because  of  the  Four-Color  Theorem,  a  cartographer  with  at  most  four  colors 
at  her  disposal  is  certain  to  be  able  to  color  the  countries  of  any  map  with  which 
she  is  confronted  (so  that  countries  with  a  common  border  have  different  colors) 
since  an  n -coloring  of  the  associated  (planar)  graph  with  n  <  4  translates  into  a 
coloring  of  the  countries  of  the  map. 

How  can  x(G)  be  determined  in  a  specific  situation?  For  a  small  graph,  trial 
and  error  is  likely  the  best  method,  though  this  approach  would  surely  not  be  fea¬ 
sible  for  graphs  with  a  large  number  of  vertices  and  edges.  Actually  it  is  unknown 
whether  or  not  a  “good”  (polynomial  time)  algorithm  exists  for  determining  x(G)', 
like  the  Traveling  Salesman’s  Problem,  this  problem  is  NP-complete. 

One  general  result  that  applies  to  any  graph  links  the  chromatic  number  to 
the  degrees  of  the  vertices. 

Let  A  (G)  be  the  maximum  of  the  degrees  of  the  vertices  of  a  graph  Q.  Then 

x(G)  <  i  +  MG). 


Proof  The  proof  is  by  induction  on  V,  the  number  of  vertices  of  the  graph.  When  V  =  1 , 
A(G)  =  0  and  x(G)  =  L  so  the  result  clearly  holds.  Now  let  k  be  an  integer, 
k  >  1,  and  assume  that  the  result  holds  for  all  graphs  with  V  =  k  vertices. 
Suppose  Q  is  a  graph  with  k  +  1  vertices.  Let  v  be  any  vertex  of  Q  and  let 
Go  =  0  \  { u }  be  the  subgraph  with  u  (and  all  edges  incident  with  it)  deleted.  Note 
that  A  (Go)  <  A  (Q).  Now  Go  can  be  colored  with  x(Go)  colors.  Since  Go  has  k 
vertices,  we  can  use  the  induction  hypothesis  to  conclude  that  x(Go)  <  1  +  A(Qo). 
Thus,  x(Go)  <  1  +  A (G),  so  Go  can  be  colored  with  at  most  1  +  A (G)  colors. 
Since  there  are  at  most  A (G)  vertices  adjacent  to  v,  one  of  the  available  1  +  A (G) 
colors  remains  for  v.  Thus,  Q  can  be  colored  with  at  most  1  +  A (G)  colors.  I 

The  result  of  Theorem  14.2.3  is  best  possible  as  we  see  by  considering  the 
complete  graph  on  n  vertices.  Since  x(/C„)  =  n  (Pause  7)  and  A (JCn)  =  n  —  1, 
we  have  x(K-n)  =  1  +  A(/C„). 
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14.2.4  THEOREM 


Proof 


Figure  14.15 


If  Q  is  an  n-cycle,  then  A((?)  =  2  and  it  is  not  hard  to  see  that  x(G)  =  2 
or  3  according  as  n  is  even  or  odd.  Thus,  if  Q  is  a  cycle  with  an  odd  number  of 
vertices,  we  again  have  x(G)  —  1  +  A  (Q).  A  theorem  of  R.  L.  Brooks9  asserts 
that  the  only  connected  graphs  Q  for  which  x(G)  =  1  +  A (Q)  are  tCn  and  cycles 
with  an  odd  number  of  vertices.  So  for  all  other  connected  graphs,  we  have 
X(G)  <  A (Q). 

For  either  graph  Q  in  Fig  14.14,  A  (Q)  =  5,  and  so  we  can  conclude  from 
Theorem  14.2.3  that  x(G)  <  6  or,  from  the  result  of  Brooks,  that  x(G)  5  5.  Nei¬ 
ther  observation  tells  us  what  is  evident  from  the  figure,  that  the  actual  chromatic 
numbers  are  3  and  4.  Nevertheless,  Theorem  14.2.3  and  its  improvement  due  to 
Brooks  do  give  us  some  information  about  the  chromatic  number  of  a  graph  and 
are,  therefore,  of  interest  since  there  is,  in  general,  no  easy  way  to  find  a  chromatic 
number.  Given  a  graph  Q,  we  must  simply  apply  ad  hoc  methods  and  hope  to 
find  x(G)  by  trial  and  error. 

While  we  cannot  prove  the  Four-Color  Theorem,  we  can  prove  the  Five-Color 
Theorem  without  too  much  difficulty. 

(Kempe,  HedWOOd)  If  G  is  a  planar  graph,  then  x(G)  <  5. 

We  must  prove  that  any  planar  graph  with  V  vertices  has  a  5-coloring.  Again  we 
use  induction  on  V  and  note  that  if  V  =  1  the  result  is  clear.  Let  k  >  1  be  an 
integer  and  suppose  that  any  planar  graph  with  k  vertices  has  a  5-coloring.  Let 
G  be  a  planar  graph  with  k  +  1  vertices  and  assume  that  G  has  been  drawn  as  a 
plane  graph  with  straight  edges.  We  describe  how  to  obtain  a  5-coloring  of  G- 


First,  by  Corollary  14.1.6,  Q  contains  a  vertex  v  of  degree  at  most  5.  Let 
Go  =  G  \  (u)  be  the  subgraph  obtained  by  deleting  v  (and  all  edges  with  which  it 
is  incident).  By  the  induction  hypothesis,  Go  has  a  5-coloring.  For  convenience, 
label  the  five  colors  1,  2,  3,  4,  and  5.  If  one  of  these  colors  was  not  used  to  color 
the  vertices  adjacent  to  v,  then  it  can  be  used  for  v  and  Q  has  been  5-colored. 
Thus,  we  assume  that  v  has  degree  5  and  that  each  of  the  colors  1  through  5 
appears  on  the  vertices  adjacent  to  v.  In  clockwise  order,  label  these  vertices 
v\,V2,  ■  ■  ■  ,  vs  and  assume  that  v /  is  colored  with  color  i.  (See  Fig  14.15.)  We 
show  how  to  recolor  certain  vertices  of  Go  so  that  a  color  becomes  available  for 
v.  There  are  two  possibilities. 

Case  1:  There  is  no  path  in  Go  from  v\  to  U3  through  vertices  all  of  which 
are  colored  1  or  3. 


9R.  L.  Brooks.  "On  Coloring  ihe  Nodes  of  a  Network,”  Proceedings  of  the  Cambridge  Philosophical 
Society  37  (1941).  194-197. 
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In  this  situation,  let  H  be  the  subgraph  of  Q  consisting  of  the  vertices  and 
edges  of  all  paths  through  vertices  colored  1  or  3  which  start  at  in .  By  assumption, 
t>3  is  not  in  'H.  Also,  any  vertex  which  is  not  in  H  but  which  is  adjacent  to  a 
vertex  of  'H  is  colored  neither  1  nor  3.  Therefore,  interchanging  colors  1  and  3 
throughout  Ti  produces  another  5-coloring  of  Qo.  In  this  new  5-coloring  both  V] 
and  U3  acquire  color  3,  so  we  are  now  free  to  give  color  1  to  u,  thus  obtaining  a 
5-coloring  of  Q. 

Case  2:  There  is  a  path  V  in  Qo  from  tq  to  113  through  vertices  all  of  which 

are  colored  I  or  3. 

In  this  case,  the  path  V,  followed  by  v  and  tq ,  gives  a  circuit  in  Q  which  does 
not  enclose  both  vj  and  iq.  Thus,  any  path  from  vi  to  v\  must  cross  V  and,  since 
Q  is  a  plane  graph,  such  a  crossing  can  occur  only  at  a  vertex  of  V.  It  follows 
that  there  is  no  path  in  Qo  from  to  to  V4  which  uses  just  colors  2  and  4.  Now 
we  are  in  the  situation  described  in  Case  1,  where  we  have  already  shown  that  a 
5-coloring  for  Q  exists.  I 

Readers  may  notice  that  vertex  V5  appears  to  play  no  role  in  this  proof.  Can 
our  proof  be  adapted  to  prove  the  Four-Color  Theorem?  Recall  that  Alfred  Kempe 
thought  he  had  such  a  proof.  See  Exercise  1 6. 

We  conclude  this  section  with  an  application  of  the  idea  of  chromatic  number 
to  a  problem  familiar  to  most  university  registrars,  the  scheduling  of  exams.  At 
a  university  of  even  moderate  size,  it  is  not  unusual  for  500  or  600  exams  to 
be  scheduled  within  a  reasonably  short  period  of  time.  The  principal  problem  is 
always  to  minimize  "conflicts,”  that  is,  to  try  to  avoid  situations  where  a  stu¬ 
dent  finds  the  exams  in  two  of  his  or  her  courses  scheduled  for  the  same  time 
period. 


PROBLEM  6.  (Examination  Scheduling)  Suppose  that  in  one  particular 
semester,  there  are  students  taking  each  of  the  following  combinations  of  courses. 

•  Mathematics,  English,  Biology,  Chemistry 

•  Mathematics,  English,  Computer  Science,  Geography 

•  Biology,  Psychology,  Geography,  Spanish 

•  Biology,  Computer  Science,  History,  French 

•  English,  Psychology,  History,  Computer  Science 

•  Psychology,  Chemistry,  Computer  Science,  French 

•  Psychology,  Geography,  History,  Spanish 

What  is  the  minimum  number  of  examination  periods  required  for  exams  in  the 
ten  courses  specified  so  that  students  taking  any  of  the  given  combinations  of 
courses  have  no  conflicts?  Find  a  possible  schedule  which  uses  this  minimum 
number  of  periods. 

Solution.  In  order  to  picture  the  situation,  we  draw  a  graph  (shown  on  the  left  of 
Fig  14.16)  with  ten  vertices  labeled  M,  E,  B,  . . .  corresponding  to  Mathematics, 
English,  Biology,  and  so  on,  and  join  two  vertices  with  an  edge  if  exams  in  the 
corresponding  subjects  must  not  be  scheduled  together. 

The  minimal  number  of  examination  periods  is  evidently  the  chromatic  num¬ 
ber  of  this  graph.  What  is  this?  Since  the  graph  contains  /C5  (with  vertices 
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Figure  14.16 
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M,  E.  B.  G ,  CS),  at  least  five  different  colors  are  needed.  (The  exams  in  the 
subjects  which  these  vertices  represent  must  be  scheduled  at  different  times.)  Five 
colors  are  not  enough,  however,  since  P  and  H  are  adjacent  to  each  other  and  to 
each  of  E ,  B,  G,  and  CS.  The  chromatic  number  of  the  graph  is,  in  fact,  6.  In 
Fig  14.16,  we  show  a  6-coloring  and  the  corresponding  exam  schedule.  I 


Answers  to  Pauses 


5.  If  Q  has  an  edge,  its  end  vertices  must  be  colored  differently,  so  x(G)  >  2. 
Thus,  \(Q )  =  1  and  or|ly  if  G  has  no  edges. 

6.  A  way  to  4-color  the  associated  graph  was  given  in  the  text.  From  this,  we 
deduce  that  x(G)  <  4.  To  see  that  x(G)  =  4,  we  investigate  the  consequences 
of  using  fewer  than  four  colors.  Vertices  1 , 2,  3  form  a  triangle,  so  three 
different  colors  are  needed  for  these.  Suppose  we  use  red,  blue,  and  green, 
respectively,  as  before.  To  avoid  a  fourth  color,  vertex  4  has  to  be  colored  red 
and  vertex  5  green.  Thus,  vertex  6  has  to  be  blue.  Since  vertex  9  is  adjacent 
to  vertices  1,  5,  and  6  of  colors  red,  green,  and  blue,  respectively,  vertex  9 
requires  a  fourth  color. 

7.  It  takes  n  colors  to  color  K.n  because  any  two  vertices  of  K.n  are  adjacent: 
X(ICn)  =  n.  On  the  other  hand,  xOGm.n)  =  2;  coloring  the  vertices  of  each 
bipartition  set  the  same  color  produces  a  2-coloring  of 


EXERCISES 


The  symbol  [BB]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 
Assume  that  all  graphs  in  these  Exercises  are  connected. 


1.  Name  and  describe  two  major  mathematical  accomplish¬ 
ments  of  the  twentieth  century. 

2.  (a)  [BB]  Draw  a  graph  corresponding  to  the  map  shown 

at  the  right  and  find  a  coloring  which  requires  the 
least  number  of  colors.  What  is  the  chromatic  num¬ 
ber  of  the  graph? 

(b)  [BB]  Answer  true  or  false  and  explain:  The  Four- 
Color  Theorem  says  that  the  chromatic  number  of  a 


planar  graph  is  4. 
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3.  (a)  [BB]  When  we  discussed  the  coloring  of  maps  at 

the  beginning  of  this  section,  we  assumed  implicitly 
that  "bordering”  countries  were  countries  which  had 
some  positive  length  of  border  in  common.  Suppose 
we  deem  countries  to  border  if  they  merely  have  a 
point  in  common.  Will  four  colors  still  suffice  to 
color  a  map? 

(b)  [BB]  We  also  assumed  that  a  country  should  consist 
of  a  single  region.  If  we  drop  this  restriction,  will 
four  colors  still  suffice  to  color  a  map? 

4.  Compute  x(G )  for  each  of  the  graphs  shown  in 
Fig  14.17.  In  each  case,  explain  your  answer  and  exhibit 
a  x(S)-coloring. 

5.  Answer  Exercise  4  for  each  of  the  graphs  in  Exercise  5, 
Section  14.1. 

6.  (a)  [BB]  Find  the  chromatic  number  of  each  of  the 

graphs  in  Exercise  4  of  Section  14.1. 

(b)  [BB]  State  the  converse  of  the  Four-Color  Theorem. 
Is  it  true? 

7.  (a)  [BB]  Is  a  tree  planar?  Explain. 

(b)  Use  (a)  to  prove  that  a  tree  with  n  vertices  has  n  —  1 
edges. 

(c)  Suppose  T  is  a  tree  with  n  vertices.  What  is  x(T) 
and  why? 

8.  Consider  the  graph  shown  on  the  left. 

(a)  Is  it  planar?  Either  draw  an  isomorphic  plane  graph 
or  explain  why  the  graph  is  not  planar. 


(b)  Find  its  chromatic  number  and  explain  why  this 
piece  of  information  is  consistent  with  the  Four- 
Color  Theorem. 


9.  [BB]  Consider  the  graph  shown  on  the  right. 

(a)  Find  a  subgraph  homeomorphic  to  K. 5. 

(b)  Find  a  subgraph  homeomorphic  to  JC 3,3. 

(c)  Is  the  graph  planar?  Explain. 

(d)  What’s  the  chromatic  number  of  the  graph? 

(e)  Use  this  graph  to  comment  on  the  converse  of  the 
Four-Color  Theorem. 

10.  (a)  [BB]  What  is  x(/Ci4)?  what  <s  *(£5.14)?  Why? 

(b)  Let  Q\  and  Qi  be  cycles  with  38  and  107  edges, 

respectively.  What  is  x(G  1)?  What  is  x(Gl) ? 
Explain. 

11.  Answer  true  or  false  and  explain: 

(a)  [BB  ]  If  x(G)  =  3,  then  Q  contains  a  triangle. 

(b)  If  x(!?)  =  4,  then  Q  contains  /C4. 

12.  Let  n  >  4  be  a  natural  number.  Let  Q  be  the  graph 
which  consists  of  the  union  of  K,n-s  and  a  5-cycle  C 
together  with  all  possible  edges  between  the  vertices  of 
these  graphs.  Show  that  x(G)  =  n ,  yet  Q  does  not  have 
K„  as  a  subgraph. 

13.  Answer  true  or  false  and  explain.  If  Q  is  homeomorphic 
to  H,  then  x(G)  =  X (70- 


Figure  14.17  Graphs  for  Exercise  4. 
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14.  Answer  true  or  false  and  explain: 

(a)  [BB]  A  Hamiltonian  graph  with  chromatic  number 
two  must  be  planar. 

(b)  A  planar  graph  with  chromatic  number  two  must  be 
Hamiltonian. 

(c)  A  graph  which  is  both  Hamiltonian  and  Eulerian 
and  has  chromatic  number  two  must  be  planar. 

(d)  [BB]  A  graph  which  is  both  Hamiltonian  and  Eule¬ 
rian  must  have  chromatic  number  less  than  six. 

15.  [BB]  The  graph  arises  in  a  certain  problem  of  exam 
scheduling  such  as  that  described  in  Problem  6.  Find 
the  minimum  number  of  exam  periods  required  so  that 
conflicts  are  avoided. 
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16.  Can  our  proof  of  the  Five  Color  Theorem  be  adapted  to 
prove  the  Four-Color  Theorem?  Go  through  the  proof 
presented  in  this  section  assuming  that  only  four  colors 
are  available  and  find  any  possible  flaw. 

17.  In  addition  to  the  combinations  of  courses  described  in 
Problem  6,  suppose  that  there  are  also  students  taking  all 
of  Geography.  Computer  Science,  Spanish,  and  French. 
Does  this  force  a  change  in  the  exam  schedule?  If  it 
does,  find  a  new  examination  schedule  which  avoids 
conflicts  and  uses  the  fewest  number  of  periods. 

18.  1  BB]  There  are  ten  students  who,  in  the  coming 
semester,  will  be  taking  the  courses  shown  in  the  fol¬ 
lowing  table.  How  many  time  periods  must  be  allowed 
in  order  for  these  students  to  take  the  courses  they  want 
without  conflicts? 


Arnold 

Physics,  Mathematics,  English 

Bill 

Physics.  Earth  Science,  Economics 

Carol 

Earth  Science,  Business 

Calvin 

Statistics,  Economics 

Eleanor 

Mathematics,  Business 

Frederick 

Physics,  Earth  Science 

George 

Business,  Statistics 

Huber 

Mathematics,  Earth  Science 

Ingrid 

Physics,  Water  Skiing,  Statistics 

Jacquie 

Physics,  Economics,  Water  Skiing 

19.  [BB]  The  following  semester,  all  the  students  in  the 
previous  problem,  except  Calvin,  plan  to  take  second 
courses  in  the  same  subjects.  Calvin  decides  not  to  take 


further  courses  in  statistics.  How  many  time  periods  will 
then  be  required? 

20.  Hubert  Noseworthy  loves  snakes  and  keeps  a  dozen 
different  varieties  in  his  apartment  (contrary  to  regu¬ 
lations).  Since  some  varieties  of  snake  attack  other  vari¬ 
eties,  as  shown  in  the  following  tables,  Hubert  needs 
several  boxes  in  which  to  keep  his  snakes  in  order  to 
separate  antagonists.  What  is  the  minimum  number  of 
boxes  he  needs? 


Variety 

Attacks  Variety 

Variety 

Attacks  Variety 

1 

3,4,  5,8,  10,  12 

2 

1,3, 6, 7,  10,  11 

3 

4,9,  12 

4 

5,8,9 

5 

6,7,  10 

6 

9,  12 

7 

10.  12 

8 

7 

9 

8,  1 1 

10 

11 

11 

12 

21.  The  local  day  care  center  has  a  problem  because  certain 
children  do  not  get  along  with  certain  others.  The  table 
shows  which  of  15  children  don’t  get  along  with  whom. 
Finally,  it  is  decided  that  children  who  do  not  get  along 
with  each  other  will  have  to  be  put  into  separate  rooms. 
Find  the  minimum  number  of  rooms  required. 


Child 

doesn’t  get 
along  with 

Child 

doesn’t  get 
along  with 

1 

2.6,9,  10,  11,  13 

2 

1.8.  10,  12.  13.  14.  15 

3 

4,  5,7,  8.  12 

4 

3. 5, 6, 8.  11,  14 

5 

3,  4,  7,  8.  12,  13 

6 

1,4,7,  12,  15 

7 

3,5,6.  8.  11.  14 

8 

2,  3,  4.  5,  7,  9,  14 

9 

1.8.  10.  12.  15 

10 

1,2,9,  11,  15 

1 1 

1,4.  7,  10,  12 

12 

2,  3.5,6,  9,  11,  15 

13 

15 

1,2,5,14 

2,6,9,10,12 

14 

... 

2,4,  7,  8,  13 

22.  Television  channels  are  to  be  assigned  to  stations  based 
in  nine  cities  A.B,...  ,1.  Broadcasting  regulations 
require  that  cities  within  150  km  of  each  other  be 
assigned  different  channels.  What  is  the  least  number 
of  channels  required  if  the  distances  between  the  cities 
are  as  given  in  the  table. 
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23.  Continuing  the  previous  problem,  city  J  is  to  acquire  a  connecting  vertices  if  corresponding  countries  shared  a  bor- 
television  station  too.  Can  it  be  assigned  one  of  the  exist-  der.  By  introducing  one  additional  vertex  corresponding  to 

ing  channels  or  must  it  have  a  new  one?  Its  distances  the  exterior  region  and  additional  edges  as  before,  we  obtain 

from  cities  A,  . . .  ,  /  are  as  follows.  a  graph  known  as  the  dual  graph  of  the  map. 


24.  (a)  [BB]  Let  Q  be  a  connected  graph  with  n  vertices 
and  n  edges.  Prove  that  x(*3)  <  3. 

(b)  Show  that  (a)  remains  true  if  Q  has  n  +  1  edges. 

(c)  Does  (a)  remain  true  if  Q  has  n  +2  edges?  Explain. 


14.2.5  THE  DUAL  OF  A 
PLANAR  GRAPH 


In  this  section,  we  showed  how  to  associate  a  planar  graph 
with  a  map,  by  introducing  a  vertex  for  each  country  and 
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In  this  section,  we  present  two  interesting  applications  of  the  material  in  Sections 
14.1  and  14.2.  The  first  shows  how  knowledge  of  graph  colorings  can  be  used  to 
assist  in  testing  printed  circuit  boards  for  the  existence  of  possible  short  circuits. 
In  the  second,  we  apply  what  we  have  learned  about  graph  colorings,  planarity, 
and  Hamiltonian  cycles  to  the  design  of  floor  plans  for  facilities  such  as  hospitals 
or  shopping  malls  in  order  to  meet,  so  far  as  possible,  various  requirements  of 
juxtaposition. 

Circuit  Testing 

A  printed  circuit  board  can  be  represented  by  a  finite  rectangular  grid  composed 
of  evenly  spaced  rows  of  evenly  spaced  grid  points  called  nodes,  which  are  con¬ 
nected  by  horizontal  and  vertical  line  segments  called  grid  segments.  On  this 
board,  certain  vertices  are  connected  via  disjoint  paths  (called  nets)  along  grid 
segments.  For  example.  Fig  14.18  shows  a  grid  with  49  nodes  and  seven  nets 
labeled  N\ ,  N2 . N7. 

In  general,  the  grid  segments  in  a  particular  net  pattern  correspond  to  conduc¬ 
tor  paths  in  the  circuit  board.  The  problem  of  interest  is  to  determine  whether  or 
not  any  extra  conductor  paths  (short  circuits)  between  nets  have  been  introduced 
into  the  board  during  the  manufacturing  process.  A  short  circuit  need  not  begin 
or  end  at  a  node. 

One  definite  way  to  determine  whether  or  not  there  are  short  circuits  is  to  take 
each  pair  of  distinct  nets,  in  turn,  and  to  apply  an  electrical  signal  to  one  member 
of  the  pair.  If  that  signal  appears  in  the  other  net,  then  there  must  be  a  short.  This 
procedure  can  be  extremely  time-consuming;  for  instance,  in  Fig  14.18,  (2)  =21 


25.  (a)  [BB]  Draw  the  dual  graph  of  the  cube  (considered 

as  a  map)  (Fig  14.5)  as  a  plane  graph  with  straight 
edges.  Identify  this  dual  graph.  [Hint:  It’s  another 
of  the  Platonic  solids.] 

(b)  Repeat  (a)  for  the  dodecahedron  (Fig  10.13). 

26.  [BB]  Is  it  possible  for  a  plane  graph,  considered  as  a 
map,  to  be  its  own  dual? 
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Figure  14.18 


Figure  14.19 


/v, 


N(, 


N7 


A  grid  of  49  nodes  and  7  nets,  comprising  altogether  24  grid  segments. 


comparisons  would  be  required.  Many  of  these  comparisons  may  be  unnecessary, 
however.  As  an  example,  the  left-hand  grid  in  Fig  14.19  shows  two  nets,  A  and 
C,  that  cannot  possibly  have  a  short  between  them  unless  there  is  also  a  short 
between  each  of  them  and  B.  Hence,  it  suffices  to  test  B  against  each  of  A  and  C. 


C 


B 


In  the  right-hand  grid  of  Fig  14.19,  it  is  theoretically  possible  to  have  a  short 
between  A  and  C  which  does  not  touch  B.  as  shown  by  the  dotted  line,  but 
this  seems  unlikely.  We  shall  assume  henceforth  that  any  short  circuit  between 
nets  must  be  a  single  straight  horizontal  or  vertical  line  (of  arbitrary  length)  not 
touching  any  other  net  (but  not  necessarily  through  nodes). 

An  efficient  way  of  testing  either  grid  in  Fig  14.19  would  be  to  combine  nets 
A  and  C  into  a  single  set  which  could  be  tested  against  B.  In  general,  a  good 
procedure  is  to  partition  the  set  of  nets  into  subsets  such  that  if  two  nets  A;, 
A j  are  in  the  same  subset,  then  A,-  and  Nj  cannot  have  a  short  between  them. 
Then  we  need  only  test  these  subsets  against  each  other  to  determine  whether  a 
short  circuit  exists.  For  example,  in  Fig  14.18,  the  subsets  could  be  {A|,  A3,  N(, }, 
{N2,  A5,  A7},  {A4},  and  three  tests  would  suffice  (instead  of  21).  Note  that  N\ 
and  A3  cannot  have  a  short  between  them,  nor  can  A3  and  Ag,  nor  A|  and  Nf,. 
The  other  subsets  are  obtained  by  similar  reasoning. 

Find  another  partition  of  the  net  pattern  in  Fig  14.18  into  three  appropriate  subsets. 

Pause  8  illustrates  an  important  point:  The  sorts  of  partitions  we  have  been 
discussing  are  not  necessarily  unique.  From  an  efficiency  point  of  view,  however, 
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we  don’t  care  about  this.  What  matters  is  that  we  find  a  partition  into  as  few 
subsets  as  possible.  Here  is  where  graph  theory  helps. 


N2  N5 


Given  a  grid  with  a  net  pattern,  we  construct  the  graph  which  has  a  vertex 
corresponding  to  each  net  and  where  two  vertices  are  joined  by  an  edge  if  and 
only  if  there  could  be  a  short  circuit  between  the  corresponding  nets.  Figure  14.20 
shows  the  graph  corresponding  to  Fig  14.18.  Note  that  N\  is  joined  to  AG  but  to 
no  other  vertex  because  the  only  possible  short  involving  N\  must  be  between  N\ 
and  N2-  Other  edges  are  obtained  by  similar  reasoning.  Any  graph  which  can  be 
obtained  by  the  procedure  described  will  be  called  a  line-of-sight  graph. 

How  would  the  line-of-sight  graph  change  if  net  N2  were  not  present  in  the  net 
shown  in  Fig  14.18?  Draw  the  revised  graph. 


Since  two  vertices  in  a  line-of-sight  graph  are  joined  by  an  edge  if  and  only  if 
the  corresponding  nets  could  have  a  short  between  them,  it  is  easy  to  see  that  any 
coloring  of  such  a  graph  yields  subsets  of  the  type  desired  for  the  corresponding 
nets.  For  example,  a  3-coloring  of  the  graph  in  Fig  14.20  could  be  obtained  by 
coloring  Ni,  N3,  and  A^  blue;  AG,  A/5,  and  A/7  red;  and  A/4  white.  Placing  like- 
colored  nets  in  the  same  subset  gives  the  partition  described  earlier. 

Another  possibility  would  be  to  color  N\  and  N(,  blue;  A3  and  A/4  red;  and 
A/2,  N5,  and  A7  white.  This  yields  another  suitable  partition.  Since  the  graph  in 
Fig  14.20  contains  a  triangle,  we  know  that  at  least  three  colors  are  required  and, 
hence,  we  cannot  reduce  further  the  number  of  partitions.  In  general,  the  minimum 
number  of  subsets  obtainable  in  a  suitable  partition  of  nets  is  the  chromatic  number 
of  the  associated  line-of-sight  graph. 

Find  the  chromatic  number  of  the  graph  you  obtained  in  Pause  9,  and  determine 
a  corresponding  partition  of  the  nets. 


Is  our  approach  to  finding  partitions  of  nets  really  useful?  After  all,  chromatic 
numbers  are  notoriously  hard  to  determine.  Fortunately,  only  certain  graphs  are 
obtainable  as  line-of-sight  graphs.  Any  planar  graph  is  a  line-of-sight  graph  but 
/C 9,  for  example,  is  not.  It  is  known  that  no  matter  how  large  the  grid  is  or  how 
many  nets  are  involved,  the  corresponding  graph  Q  has  x(G)  —  12.  It  follows  that 
the  nets  can  always  be  partitioned  into  at  most  12  subsets  and,  hence,  we  never 
need  to  carry  out  more  than  ('4)  =  66  tests.  Is  12  best  possible?  Perhaps  nets 
can  always  be  partitioned  into  at  most  11  subsets,  or  at  most  10.  The  answer  is 
not  known,  but  it  is  not  less  than  8  since  there  exists  a  line-of-sight  graph  Q  for 
which  x(Q)  —  8-  (See  Exercise  9.) 
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The  preceding  results,  as  well  as  many  others,  appear  in  a  fundamental  and 
very  interesting  paper  by  Garey,  Johnson,  and  So.10  There  are  still  a  number  of 
intriguing  open  problems  here — for  example,  is  it  possible  to  characterize  precisely 
those  graphs  which  are  line-of-sight  graphs?  You  are  encouraged  to  investigate 
this  area  further,  first  by  examining  the  aforementioned  paper. 

Facilities  Design 

As  mentioned  earlier,  we  are  interested  here  in  the  design  and  layout  of  physical 
facilities  such  as  hospital  floors.  In  such  problems,  there  are  a  number  of  design 
areas  (rooms  perhaps)  and  it  is  often  desirable  (or  critical)  that  certain  of  these 
areas  be  adjacent  to  each  other.  The  planner  of  such  a  project  will  be  presented 
with  a  list  of  relationships  from  which  it  must  be  determined  whether  or  not  it 
is  possible  to  construct  such  a  floor  plan  and  if  so  how.  If  it  is  not  possible,  the 
question  becomes  how  close  can  one  come  to  meeting  the  requirements. 

If  it  is  possible  to  produce  a  layout  satisfying  all  the  given  relationships  of 
one  area  to  another,  then  we  say  that  the  given  relationships  are  feasible.  If  a  set 
of  relationships  is  not  feasible,  we  would  like  to  determine  the  smallest  number 
of  relationships  whose  omission  produces  a  feasible  set. 

The  first  thing  to  do  is  to  draw  a  graph  (called  the  relationship  graph )  whose 
vertices  represent  design  areas  and  in  which  an  edge  between  vertices  indicates  that 
corresponding  areas  are  to  be  adjacent.  Thinking  of  a  given  layout  of  the  design 
areas  as  a  map  whose  associated  graph  (see  Fig  14.12)  is  the  relationship  graph, 
we  see  that  a  set  of  relationships  will  be  feasible  if  and  only  if  the  relationship 
graph  is  planar. 

Theoretically  then,  Kuratowski's  Theorem  (14.1.8)  could  be  used  to  decide 
whether  or  not  a  given  set  of  relationships  is  feasible.  There  are,  however,  two 
difficulties  with  this  approach.  First,  Kuratowski’s  Theorem  is  difficult  to  apply, 
and  most  architects  would  prefer  an  easier  algorithm.  Second,  and  much  more 
significant,  should  the  relationship  graph  not  be  planar,  Kuratowski’s  Theorem 
provides  no  indication  of  which  relationships  should  be  deleted  in  order  to  achieve 
planarity. 

What  we  are  looking  for,  therefore,  is  a  simple  test  for  planarity  which  will 
readily  identify  those  edges  which  are  causing  a  particular  graph  not  to  be  pla¬ 
nar.  While  this  is  too  much  to  expect,  progress  has  been  made  in  the  case  of 
Hamiltonian  graphs.11  The  procedure  we  outline  determines,  in  a  rather  nice  way, 
whether  or  not  a  Hamiltonian  graph  is  planar.  While  it  does  not  rigorously  identify 
those  edges  which  should  be  deleted  in  the  case  of  nonplanarity,  it  does  model 
the  problem  in  such  a  way  that  it  is  generally  not  difficult  to  locate  such  edges. 
We  demonstrate  this  procedure  by  working  through  a  specific  example. 

EXAMPLE  7  Consider  the  relationship  graph  Q  drawn  in  Fig  14.21.  This  graph  has  a  Hamil¬ 
tonian  cycle,  1243561.  We  start  by  drawing  a  graph  Q\  isomorphic  to  Q  with  the 


lflMichael  R.  Garey,  David  Stifler  Johnson,  and  Hing  C.  So,  “An  Application  of  Graph  Coloring  to  Printed 
Circuit  Testing,"  IEEE  Transactions  on  Circuits  and  Systems,  Vol.  CAS-23,  (1976),  no.  10,  591-599. 

1  lG.  Demourcron,  V.  Malgrance.  and  R.  Pertuiset,  “Graphes  Planai res:  Reconnaissance  et  Construction 
de  Representations  Planaires  Topologiques,”  Recherche  Operationelle  30  (1964),  33. 
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vertices  of  the  Hamiltonian  cycle  in  Q  appearing  in  order  around  the  circumfer¬ 
ence  of  a  circle.  We  then  label  arbitrarily  those  edges  of  Q\  which  are  not  on  the 
cycle.  Next,  we  draw  a  new  graph  H  whose  vertices  correspond  to  the  edges  of 
Qi  just  labeled.  Two  vertices  of  7i  are  joined  by  an  edge  if  the  corresponding 
edges  cross  inside  the  circle.  The  graph  we  obtain  in  this  way  is  shown  on  the 
left  in  Fig  14.22. 


Now  examine  this  new  graph  H.  We  claim  that  the  original  graph  Q  is  planar 
if  and  only  if  xCH)  <2,  and  here’s  why.  If  xW)  —  2,  then  there  is  a  coloring 
of  the  vertices  of  7i  requiring  at  most  two  colors.  Therefore,  the  interior  edges  of 
Q i  could  be  colored  with  at  most  two  colors  in  such  a  way  that  edges  crossing 
inside  the  circle  (if  any)  have  different  colors.  By  drawing  all  the  edges  of  one 
color  outside  the  circle  and  leaving  alone  all  edges  of  the  other  color,  we  would 
then  obtain  a  representation  of  Q\  with  no  crossovers  of  edges;  in  other  words, 
Q  would  be  drawn  as  a  plane  graph.  Conversely,  if  Q  is  planar  and  drawn  as  a 
plane  graph,  then  coloring  all  edges  inside  the  cycle  with  one  color  and  all  edges 
outside  with  another  produces  a  coloring  of  the  associated  graph  H  which  uses  at 
most  two  colors. 

We  return  to  our  example  and  note  from  Fig  14.22  that  xW)  7^  2  since  H 
contains  a  5-cycle,  adbfga\  hence,  the  original  graph  Q  is  not  planar.  So  we  must 
now  look  for  the  smallest  number  of  vertices  which,  when  removed  from  H,  leave 
a  graph  whose  chromatic  number  is  2.  In  our  example,  if  we  remove  any  of  a,  b, 
/,  or  g,  we  will  achieve  the  desired  result. 

Suppose  we  remove  g.  We  picture  the  resulting  graph  and  a  2-coloring  on 
the  right  in  Fig  14.22.  This,  in  turn,  provides  a  coloring  of  the  interior  edges  of 
Q\  \  {g}.  In  Fig  14.23(a),  we  draw  Q\  \  {g}  as  a  plane  graph  with  all  edges  of  one 
of  the  two  colors  drawn  outside  the  circle.  In  order  to  draw  a  layout  with  Q\  \  {g} 
as  its  relationship  graph,  it  is  helpful  first  to  draw  this  plane  graph  with  straight 
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edges,  as  in  Fig  14.23(b).  In  Fig  14.23(c),  we  show  the  corresponding  layout.  If 
this  were,  for  example,  the  floor  plan  of  a  hospital,  Fig  14.23(c)  shows  a  way 
that  the  rooms  can  be  laid  out  so  as  to  satisfy  the  aforementioned  relationships. 
Strange-looking  rooms,  perhaps,  but  feasible!  ▲ 


Q\  \  {g}  drawn  as  a 
plane  graph 


(a) 


6 


Q\  \  (g)  drawn  as  a 
plane  graph  with 
straight  edges 

(b) 


An  associated 
layout 


(c) 


Figure  14.23 


A  plane  graph,  then  drawn  with  straight  edges,  and  an  associated  layout  of  design  areas. 


14.3.1  CLOSING 
REMARKS 


Carry  out  the  steps  in  the  previous  paragraph,  assuming  /  is  removed  instead 
of  g. 


1.  We  proved  in  Section  14.1  that  connected  planar  graphs  with  at  least  three 
vertices  must  satisfy  E  <  3  V  —  6  (Theorem  14.1.4).  Thus,  3V  —6  is  an  upper 
bound  for  the  number  of  relationships  which  are  theoretically  possible  in  a 
facilities  design  having  V  rooms.  Despite  what  the  planner  might  want,  it  is 
impossible  to  do  better! 

2.  In  many  practical  problems,  some  relationships  are  more  important  than 
others.  Certain  ones,  in  fact,  might  be  crucial  (in  a  hospital,  for  example). 
Relative  importance  of  relationships  can  be  incorporated  into  the  model  by 
assigning  weights  to  the  edges  of  the  relationship  graph  and  then  attempting  to 
achieve  planarity  by  removing  the  least  important  edges.  Kruskal’s  algorithm, 
adapted  to  choose  a  maximal  weight  spanning  tree  (instead  of  a  minimal  one), 
has  been  applied  to  this  case  with  some  success. 

3.  The  authors  first  read  about  this  fascinating  area  in  Roberts’s  book  Applied 
Combinatorics ,12  Certainly  there  is  great  potential  here  for  further  investiga¬ 
tion.  We  refer  the  interested  reader  also  to  Chapter  8  of  Chachra,  Ghare,  and 
Moore13  for  additional  information. 


Answers  to  Pauses 


8.  {NUN6},  {N2,  N5,  N7],  [N3,N4}. 


12Fred  S.  Roberts,  Applied  Combinatorics,  Prentice  Hall.  New  Jersey,  1984. 

13  V.  Chachra,  P.  M.  Ghare.  and  J.  M.  Moore,  Applications  of  Graph  Theory  Algorithms,  Elsevier.  North- 
Holland,  New  York.  1979. 
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10.  The  chromatic  number  of  the  preceding  graph  is  4  since  it  contains  /C 4  as  a 
subgraph.  One  partition  would  be  {N\,  N3},  {A4},  { N(, },  {N5,  Nj}. 


11.  A  2-coloring  of  7i  \  {/}  is  shown  at  the 
right.  On  the  left  in  the  graphs  which  fol¬ 
low  is  Q 1  \{/},  drawn  as  a  plane  graph  with 
all  white  edges  drawn  outside  the  circle. 
In  the  middle,  it’s  redrawn  with  straight 
edges,  and  on  the  right  there  is  a  layout 
whose  relationship  graph  is  Q\  \  {/). 


EXERCISES 


The  symbol  [BB]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 


1.  [BB]  The  following  questions  refer  to  the  net  pattern 
drawn  below. 

V'l  #  V5 

V2  .  V4  , 

Vi 


H 

(a)  Find  the  number  of  nodes,  grid  segments,  and 
nets. 


(b)  Give  an  example  of  two  nets  which  cannot  have 
a  vertical  or  horizontal  short  circuit  between  them 
unless  there  is  a  similar  short  circuit  between  one 
of  them  and  another  net. 

(c)  Draw  the  associated  line-of-sight  graph  Q. 

(d)  Determine  x(0)  and  find  a  corresponding  partition 
of  the  nets. 

2.  Repeat  Exercise  1  if  the  nodes  of  the  far  right-hand  col¬ 
umn  are  all  connected  by  a  vertical  line  of  grid  segments, 
thus  forming  another  net,  Vf,. 

3.  [BB]  True  or  false?  A  line-of-sight  graph  is  always 
connected. 

4.  Let  A f  be  the  set  of  nets  in  a  given  net  pattern.  Define 
a  relation  ~  on  A  by  A  ~  B  if  there  could  be  a  short 
circuit  between  A  and  B.  Determine  whether  or  not  ~  is 
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(a)  [BB]  reflexive, 

(b)  symmetric, 

(c)  antisymmetric, 

(d)  transitive. 

5.  Refer  to  the  net  pattern  shown  below. 

(a)  [BB]  Draw  the  associated  line-of-sight  graph  Q. 

(b)  Determine  x(G )  and  find  a  corresponding  partition 
of  the  nets. 


Ni  Nb 


6.  Repeat  Exercise  5  if  net  AI3  is  removed. 

7.  [BB]  True  or  false?  A  line-of-sight  graph  is  always 
planar. 

8.  (a)  [BB]  Assume  that  the  only  short  circuits  in  a  printed 

circuit  board  are  horizontal  straight  lines;  that  is, 
vertical  shorts  are  not  possible.  Prove  that  the  asso¬ 
ciated  line-of-sight  graph  Q  is  then  planar  and  con¬ 
clude  that  x(G)  <  4. 

(b)  [BB]  Let  V  and  £  be  the  vertices  and  edges, 
respectively,  of  a  general  line-of-sight  graph  Q. 
Show  that  £  can  be  partitioned  into  subsets  £\,  £2 
such  that  the  graphs  Q(V,£\),  Q(V,  £2)  are  both 
planar. 

(c)  Using  (a)  and  (b),  show  that  <  16  for  any 

line-of-sight  graph  Q. 

(d)  Using  (b),  show  that  K.\\  is  not  a  line-of-sight 
graph. 

9.  [BB]  The  net  pattern  shown  has  a  line-of-sight  graph  Q 
with  xiG)  =  8.  Why?  (This  example  is  due  to  M.  R. 
Garey,  D.  S.  Johnson,  and  H.  C.  So,  cited  in  foot¬ 
note  10.) 


10.  Find  a  best  possible  feasible  relationship  graph  and  draw 
the  corresponding  floor  plan  for  JCj  [BB],  £3,3,  and  Kb, 
each  of  these  considered  as  a  relationship  graph. 

11.  For  each  of  the  following  relationship  graphs,  find  a 
best  possible  feasible  relationship  graph  and  draw  the 
corresponding  floor  plan. 

(a) 


(c) 


12.  A  contractor  is  building  a  single-story  house  for  a  newly 
married  couple.  The  house  is  to  consist  of  a  living  room, 
dining  room,  family  room,  kitchen,  two  bedrooms,  bath¬ 
room.  and  hall.  The  couple  insists  that  all  rooms  border 
on  the  hall.  The  kitchen  is  to  share  a  wall  with  the  dining 
room,  family  room,  and  bathroom.  Both  bedrooms  should 
share  a  wall  with  the  bathroom  and  the  family  room.  The 
living  room  is  to  border  the  family  room  and  the  dining 
room.  Is  it  possible  for  the  contractor  to  meet  the  couple’s 
demands?  If  not,  how  close  could  the  contractor  come  to 
meeting  all  requirements?  Justify  your  answer. 

13.  Jack  is  commissioned  to  build  a  nine-hole  golf  course 
which  meets  the  following  specifications. 

•  The  first  hole  must  return  to  the  clubhouse  and  hence 
have  a  common  border  with  the  ninth  hole. 

•  To  satisfy  thirsty  patrons,  the  fifth  hole  also  returns 
to  the  clubhouse;  it  must  have  a  common  border  with 
the  first  hole,  but  not  necessarily  with  the  ninth. 
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•  All  even-numbered  holes  share  a  common  water  haz¬ 
ard  and  have  some  common  border  with  each 
other. 

•  Each  hole  (after  the  first)  has  a  common  border  with 
the  one  preceding  it. 

(a)  [BB]  Show  that  it  is  impossible  for  Jack  to  build  a 
golf  course  meeting  all  these  requirements. 


(b)  State  three  different  ways  in  which  Jack  could  sat¬ 
isfy  all  but  one  of  the  requirements. 

14.  [BB]  Apply  Kuratowski’s  Theorem,  Theorem  14.1.8,  to 
the  graph  Q  in  Fig  14.21  to  show  that  Q  is  not 
planar. 

15.  [BB]  Apply  Brooks’s  Theorem  (p.  425)  to  find  the  chro¬ 
matic  number  of  the  graph  H  in  Fig  14.22. 


REVIEW  EXERCISES  FOR  CHAPTER  14 


1.  (a)  Show  that  the  graph  is  planar  by  drawing  an  iso¬ 
morphic  plane  graph  with  straight  edges. 

(b)  Label  the  regions  defined  by  your  plane  graph  and 
list  the  edges  which  form  the  boundary  of  each 
region. 


(c)  Verify  that  V-E+R  =  2,  N  <  2E  and  E  <  3V-6. 
(Recall  that  N  is  the  sum  of  the  numbers  of  edges 
on  the  boundaries  of  all  regions.) 

2.  Determine  whether  or  not  each  of  the  graphs  is  planar.  In 
each  case,  either  draw  a  plane  graph  with  straight  edges 
isomorphic  to  the  one  presented,  or  exhibit  a  subgraph 
homeomorphic  to  K. 3,3  or  K. 5. 


3.  If  Q  is  a  connected  plane  graph  with  V  >  3  vertices  and 
R  regions,  each  with  at  least  five  edges  on  its  boundary, 
prove  that  3 R  <  2V  —  4.  (You  may  use  the  result  of 
Exercise  6  of  Section  14.1,  but  first  be  sure  you  can 
prove  it!) 

4.  (a)  A  connected  planar  graph  Q  has  30  vertices.  Prove 

that  Q  has  at  most  84  edges. 

(b)  A  connected  planar  graph  Q  has  30  vertices,  15  of 
which  have  degree  1.  Prove  that  Q  has  at  most  54 
edges. 

5.  Show  that  K.2,7,  is  homeomorphic  to  a  subgraph  of  K, 4. 

6.  True  or  false?  If  Q  is  a  tree  and  H  is  homeomorphic  to 
Q,  then  H  is  a  tree.  Explain  your  answer. 

7.  Compute  *(<?),  where  Q  is  the  graph  in  Exercise  1. 
Explain  your  answer  and  exhibit  a  x(5)-coloring. 

8.  Compute  x(G)  for  the  graphs  of  Exercise  2.  Do  your 
results  say  anything  about  the  converse  of  the  Four- 
Color  Theorem? 

9.  Answer  true  or  false  and  explain  your  answers. 

(a)  A  planar  graph  with  chromatic  number  3  must  be 
Hamiltonian. 

(b)  If  Q  is  a  planar  graph  which  contains  /C4  as  a  sub¬ 
graph,  then  x(0)  =  4. 

10.  Let  Q  be  a  connected  graph  with  at  least  two  vertices. 
Prove  that  x(S)  =  2  if  and  only  if  Q  is  bipartite  with  at 
least  one  edge. 

11.  (a)  Give  an  example  showing  that  the  following  theo¬ 

rem,  which  resembles  Theorem  14.2.3,  is  false: 

Let  E((7)  be  the  minimum  of  the 
degrees  of  the  vertices  of  a  graph  Q. 

Then  X(Q)  >  1  +  £(5). 

(b)  Is  the  “theorem”  stated  in  11(a)  true  if  we  restrict 
our  attention  to  trees  with  at  least  two  vertices? 
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Figure  14.24  Net  patterns  for  Exercises  !3  and  14. 


12.  A  town  jail  contains  four  holding  cells.  On  a  particularly 
busy  night,  twelve  people  are  arrested.  Certain  prisoners 
do  not  get  along  with  certain  others  and  must  be  put 
into  separate  cells,  as  shown  in  the  following  tables.  If 
possible,  find  a  way  of  putting  the  prisoners  into  the  four 
cells  in  such  a  way  as  to  avoid  possible  conflicts  during 
the  night. 


Prisoner 

doesn’t  get  along  with 

1 

3,  5,  8,  9,  10,  11 

2 

3,4,  6.  7,9,  11 

3 

1,2, 6, 8,  11,  12 

4 

2,  5,6,  8,  10,  12 

5 

1,4, 7, 9,  10 

6 

2, 3, 4, 7, 9,  11,  12 

7 

2,  5,6,  8,  10 

8 

1,3,  4,  7,  12 

9 

1,2, 5, 6.  11 

10 

1,4, 5, 7,  12 

11 

1.2,  3,6,9 

12 

3,  4,  6,  8,  10 

13.  Draw  the  line-of-sight  graph  associated  with  the  net  pat¬ 
tern  shown  on  the  left  in  Fig  14.24.  Determine  x(G)  and 
find  a  corresponding  partition  of  the  nets. 

14.  Repeat  Exercise  13  for  the  net  pattern  shown  on  the 
right  of  Fig  14.24. 

15.  Give  an  example  of  a  net  pattern  whose  line-of-sight 
graph  is  Eulerian  but  not  Hamiltonian. 

16.  Show  that  the  complete  bipartite  graph  £7,7  is  not  a  line- 
of-sight  graph.  [Hint:  Use  Exercise  8(b)  of  Section  14.3 
and  Exercise  9(a)  of  Section  14.1.] 

17.  Find  a  best  possible  feasible  relationship  graph  and  draw 
the  corresponding  floor  plan  for  each  of  the  graphs  in 
Exercise  2. 


CHAPTER 


15 

Flow— Min  Cut 
Theorem 


15.1  Flows  and  Cuts 

In  this  chapter,  we  return  to  the  concept  of  a  directed  network,  introduced  in 
Section  11.5,  and  prove  the  very  important  Max  Flow — Min  Cut  Theorem.  Recall 
that  a  directed  network  is  a  digraph  in  which  each  arc  is  assigned  an  integer 
weight. 

Whereas  in  Section  1 1 .5,  the  weights  on  arcs  were  viewed  as  times  required 
to  complete  certain  activities,  the  point  of  view  we  adopt  now  is  rather  different. 
Here,  we  think  of  the  arcs  of  a  digraph  as  pipes  and  the  weight  on  an  arc  as  the 
capacity  of  the  pipe,  the  maximum  amount  of  some  commodity  that  can  flow 
through  it  in  a  given  unit  of  time — liters  of  oil  or  gas,  kilowatts  of  electricity, 
numbers  of  people,  messages,  trucks,  or  letters. 


a  12  c 


Figure  15.1  A  directed  network  with  a  source  5  and  a  sink  t. 

Figure  15.1  shows  a  network  typical  of  the  sort  we  consider  in  this  chapter. 
It  has  two  distinguished  vertices  s  and  t  called,  respectively,  the  source  and  the 
sink.  Intuitively,  we  think  of  the  source  as  the  start  of  the  flow  and  expect  that 
all  arcs  with  source  5  as  an  end  point  should  be  directed  away  from  s.  While 
this  is  generally  the  situation,  it  is  not  important  for  the  theory  which  follows  and 
so  we  do  not  impose  such  a  special  condition  on  s.  Similarly,  while  we  think  of 
the  sink  as  the  place  toward  which  all  the  flow  is  directed,  we  do  not  make  this 
assumption. 
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Remember  that  when  we  say  that  uv  is  an  arc  in  a  digraph,  we  mean  that  the 
arrow  on  the  edge  joining  u  and  v  is  in  the  direction  u  —>  v.  We  denote  by  cllv 
the  capacity  of  arc  mu  in  a  directed  network  and,  in  order  that  cuv  has  meaning 
for  any  pair  of  vertices  «,  u,  we  define  cuv  =  0  if  uv  is  not  an  arc.  We  require 
that  all  cuv  be  nonnegative  integers. 

In  contrast  to  the  capacity  of  an  arc,  the  actual  amount  of  flow  passing  through 
uv  is  denoted  fuv.  Again,  we  define  fuv  =  0  if  uv  is  not  an  arc  so  that  fuv  has  a 
meaning  for  every  u  and  v.  In  the  physical  situations  we  are  trying  to  model,  the 
flow  through  an  arc  cannot  exceed  its  capacity;  thus,  we  require  that  0  <  fuv  <  Cuv 
for  every  u  and  v.  We  also  assume  a  conservation  law  at  each  vertex  (except  the 
source  and  the  sink).  This  says  that  for  any  vertex  u  (other  than  s  or  t)  the  flow 
out  of  u  equals  the  flow  into  n;  symbolically,  fuv  =  ^f vu.  We  hope  you 
agree  that  such  a  law  makes  good  sense  in  any  physical  situation. 


15.1.1  DEFINITIONS 


Given  a  directed  network  with  vertex  set  V,  the  capacity  of  arc  uv  denoted  cuv,  and 
given  two  distinguished  vertices  x  and  t,  called  the  source  and  sink,  respectively, 
an  (x,  t)-flow  is  a  set  J-  of  numbers  {f,,,,}  satisfying 


1.  0  <  f„„  <  Cuv  for  all  u,  v  €  V,  and 

2.  XXeV  v  =  XlreV  for  all  u  e  V  \  (x,  1 1  (Conser\>ation  of  Flow). 


Figure  15.2  shows  an  (i,/)-flow  for  the  directed  network  of  Fig  15.1.  The 
label  on  each  arc  shows  its  capacity  and  the  flow  through  it,  in  that  order.  Thus, 
the  label  12,  10  on  arc  ac  means  that  this  arc  has  capacity  12  and  currently  a  flow 
of  10.  Note  that  the  flow  fH„  in  each  arc  never  exceeds  its  capacity  cuv.  Also 
observe  that  the  law  of  conservation  of  flow  is  satisfied  at  each  vertex.  Consider 
vertex  c,  for  example.  The  flow  away  from  c  is  fcv  =  fcr  =  1 1  while  the  flow 
into  c  is  Y.v  fw  =  +  be  —  10+1  =  11. 


a  12,  10  c 


Figure  15.2 


An  (,v,  O-flow. 


Verify  the  law  of  conservation  at  vertices  a,  b,  and  d. 

The  flow  out  of  the  source  x  equals  10  +  2=12  and  the  flow  into  the  sink 
t  is  11  +  1,  which  equals  12  as  well.  Physically  this  equality  makes  sense:  Since 
conservation  of  flow  means  that  no  flow  is  lost  at  any  vertex,  whatever  flow  leaves 
the  source  must  eventually  reach  the  sink.  A  general  proof  follows. 

We  start  with  the  observation  that 


E  f' 

u.veV 


E<~ 

K.ueV 
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and  hence, 

E  -  e  f„«  =  o. 

u.veV  u.veV 

Summing  first  over  v  and  then  over  u,  we  obtain 

ueV  reV  veV 

Conservation  of  flow  says  that  the  term  in  the  parentheses  here  is  0,  except  when 
w  —  s  and  u  =  t.  Therefore, 

X) -  E f«*  +  E  -  E  =  °- 

V  V  V  V 

Hence, 


CD  Ef--Ef-  =  Ef"-Ef'- 

V  V  V  V 


15.1.2  DEFINITION 


The  value  of  a  flow  T  =  {fut)]  in  a  directed  network  with  source  s,  sink  t,  and 
vertex  set  V  is  the  integer 

val(JF)  =  E  -  Efw 

veV  veV 


or,  equivalently,  by  equation  (1), 

valfiF)  =  E  fl"  _  E  f,u 

v€V  dcV 


The  value  of  a  flow  is  the  net  amount  of  flow  per  unit  time  leaving  the 
source,  or,  equivalently,  the  net  amount  of  flow  per  unit  time  entering  the  sink.  In 
the  most  likely  situation  where  there  is  no  flow  into  the  source  and  no  flow 
out  of  the  sink,  then  fUJ  =  0  =  f,„  for  all  vertices  v,  so  the  value  of  the 
flow  is 

val(JF)  =  X]  =  E  fw- 

V  V 


This  was  the  case  with  the  network  in  Fig  15.2;  the  flow  there  has  value  12. 

Our  goal  in  this  chapter  is  to  find  the  (s,  f)-flow  with  the  largest  possi¬ 
ble  value  through  a  given  directed  network.  Such  a  flow  is  called  a  maximum 
flow. 


With  reference  to  the  directed  network  of  Fig  15.2,  find  a  flow  whose  value 
exceeds  12. 


In  addition  to  (s,  fi-flow,  the  other  important  concept  in  directed  networks  is 
that  of  an  is.  t)-cut. 
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15.1.3  DEFINITIONS 


Let  V  be  the  vertex  set  of  a  directed  network  with  source  s  and  sink  t.  An  (s,  t)- 
cut  is  a  partition  (<S,  T)  of  V  such  that  s  e  S  and  t  €  T.  The  capacity  of  an 
(s,  r)-cut  {5,  T)  is  the  sum  cap(iS,  T)  of  the  capacities  of  all  arcs  from  S  to  T: 


cap  (S,T)=  Y  cuv 

ueS.veT 


What  does  it  mean  to  say  that  {5,  T)  is  a  partition  of  V? 

Every  path  from  s  to  t  contains  an  arc  joining  a  vertex  in  S  to  a  vertex  in  T. 
Thus,  if  every  such  arc  is  cut,  there  is  no  path  left  from  s  to  t.  The  flow  of  goods 
or  information  from  5  to  t  has  been  “cut.” 

With  reference  to  the  network  in  Fig  15.1,  the  sets  S  =  {5}  and  T  = 
{a,  b,  c,  d,  t]  form  an  (<S,  T)-cut.  The  arcs  from  S  to  T  are  sa  and  sb:  Sev¬ 
ering  both  of  these  would  eliminate  any  flow  from  s  to  /.  Another  example  is 
provided  by  the  sets  S  =  {5,  a,  c}  and  T  =  {b,  d,  t }.  This  time,  the  arcs  from  S 
to  T  are  sb,  ad,  and  ct.  Arc  be  is  not  included:  It  goes  from  T  to  S  and  so  can 
never  affect  the  flow  from  v  to  1. 

The  capacity  of  the  first  cut,  { {a,  b,  c,  d,  t} },  is  csa+csb  =  10+1 1  =  21. 
The  capacity  of  the  cut  { {5,  a,  c),  {b,  r/, /} }  is  csb  +  cad  +  Cct  =  11+4+11  =  26. 
The  value  of  the  flow  (12)  in  Fig  15.2  is  less  than  either  of  these  capacities.  This 
is  no  accident. 


15.1.4  THEOREM 


In  any  directed  network,  the  value  of  an  (5,  t)-flow  never  exceeds  the  capacity  of 
any  (s,  r)-cut. 


Proof  Let  T  =  {f uv)  be  any  (s,  f)-flow  and  {5,  T)  any  (s,  t)-cut.  Conservation  of  flow 
tells  us  that  v  1UV  —  fvu  =  0  for  any  u  e  S,  u  ^  5.  (The  possibility  u  =  t 
is  excluded  because  t  S.)  Hence, 

val(JF)  =  Y  f sv  -  Y  f”'s 
ueV  reV 


-E(EfMu  Yjvu) 

ueS  hsV  usV 


(since  the  term  in  parentheses 
is  0  except  for  u  =  s) 


-  E 

ue5,ueV 


E  f. 

ueS.ueV 


Since  (<S,  T }  is  a  partition,  this  last  sum  can  be  written 


'Y,  ^ uv 

U  V€iS 


E 

ueS.veT 


E  f 

u  £<5, 


E  f 

«e5,neT 


=  Y 

ueS,veS 


Y  f-+  E  (f«» 

li€<S,VE<S 


f  vu)’ 


The  first  two  terms  in  the  last  line  are  the  same,  so  we  obtain 

(2)  val(JC)  =  Y  (f«w  -  fun)- 

us5,hsT 
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15.1.5  COROLLARY 


15.1.6  COROLLARY 


Proof 


But  fuv  <  cuv  and  fvu  >  0.  so  fM„  —  fvu  <  cuv  for  all  u  and  v.  Therefore, 
valfiF)  <  ^  Cuv  =  cap (5,  T ) 

ueS.veT 

as  desired.  I 

Equation  (2)  relates  the  value  of  an  (s,  r)-flow  to  any  (5,  r)-cut.  We  highlight 
this  rather  surprising  fact. 

If  T  is  any  (s,  r)-flow  and  {5,  T)  is  any  (s,  r)-cut,  then 

val(JF)  =  (f«v  -  fvu)- 

u 


With  reference  to  the  network  in  Fig  15.2  and  the  cut  S  =  {5,  a,  c},  T  = 
{by  dy  t},  the  sum  specified  in  the  corollary  is 

(f«D  fvu)  —  fsb  +  f(id  ~  f be  +  fc/  =  2  +  0—1  +  11  =  12, 

ueS.veT 

which  is  the  value  of  the  flow  in  this  network.  We  have  also  the  following  fun¬ 
damental  result  about  directed  networks. 

Suppose  there  exists  some  (5,  ?)-flow  JF  and  some  (s,  f)-cut  {5,  T }  such  that  the 
value  of  F  equals  the  capacity  of  {5,  T).  Then  val(jF)  is  the  maximum  value  of 
any  flow  and  cap(5,  T)  is  the  minimum  capacity  of  any  cut. 

Let  T\  be  any  flow.  To  see  that  val(.Fi)  5  val(jF),  note  that  the  theorem  says 
that  val(JF[)  <  cap(5,  T)  and,  by  hypothesis,  cap(<S,  T)  =  val(JF).  So  val(JF)  is 
maximum,  as  asserted.  The  proof  that  cap(<S,  T)  is  minimum  is  similar  and  left 
to  the  Exercises.  I 

It  turns  out  that  the  hypotheses  of  Corollary  15.1.6  can  always  be  satisfied; 
that  is,  in  any  directed  network,  there  is  always  a  flow  and  a  cut  such  that  the 
value  of  the  flow  is  the  capacity  of  the  cut.  By  the  corollary,  such  a  flow  has 
maximum  value. 


The  Construction  of  Flows 


We  conclude  this  section  with  an  indication  as  to  how  we  can  construct  flows  in 
a  directed  network  and  continue  to  center  our  discussion  around  the  network  of 
Fig  15.1.  First  locate  a  path  V  from  s  to  t  (which  follows  the  direction  specified 
by  the  arrows  on  arcs)  and  define  a  flow  by  setting 


f  1  if  uv  e  V 
I  0  if  uv  g  V. 


Such  a  flow  is  called  a  unit  flow  because  it  has  value  1.  In  Fig  15.1,  sact  is  a 
path  V  from  j  to  f  for  which  the  unit  flow  just  defined  is  shown  in  Fig  15.3. 

It  is  not  difficult  to  see  that  conservation  of  flow  is  guaranteed:  If  v  is  a  vertex 
not  on  Vy  then  all  arcs  entering  and  leaving  v  have  flow  0;  if  v  is  on  V,  there  is 


446  Chapter  1 5  The  Max  Flow— Min  Cut  Theorem 


a  12,  1  c 

10,  1  /  \ 

/  4,0 

ll,o\  ; 8.0 

b  2,0  d 

Figure  15.3  A  unit  flow  in  a  network. 

precisely  one  arc  entering  v  which  has  nonzero  flow  and  precisely  one  arc  leaving 
with  nonzero  flow,  and  each  of  these  arcs  has  a  flow  of  1. 

Having  obtained  one  flow,  we  can  continue  to  increment,  by  1  at  a  time,  the 
flow  on  the  arcs  of  V  until  we  reach  the  smallest  capacity  of  an  arc  on  the  path; 
that  is,  until  the  path  has  a  saturated  arc,  meaning  one  carrying  a  flow  equal  to 
its  capacity. 

In  our  example,  we  can  achieve  a  flow  of  10  on  each  arc  of  V,  but  no  larger 
number,  since  at  10,  arc  sa  is  saturated.  This  flow  is  not  by  any  means  maximum, 
however:  We  showed  a  flow  of  value  12  in  Fig  15.2.  By  what  method  can  we 
obtain  such  a  flow,  or  any  flow  of  value  greater  than  10?  We  search  for  another 
path,  V',  from  s  to  t,  and  a  flow  in  V  such  that  all  fuv  are  equal  and  some  arc 
is  saturated. 

In  Fig  15.4,  we  show,  in  addition  to  a  flow  through  V  with  a  saturated  arc, 
a  flow  through  the  path  V :  sbdt  with  the  capacity  of  2  in  each  arc  and  arc  bd 
saturated.  We  continue  to  check  every  path  from  i  to  ;.  For  any  such  path,  we 
increase  the  flow  in  each  arc  one  unit  at  a  time  until  the  path  contains  a  saturated 
arc.  We  observe  that  the  flow  in  each  arc  of  the  path  V" :  sbct  from  s  to  t  can  be 
incremented  by  1,  as  indicated  in  Fig  15.5. 

a  12,  10  c 

10, 

s 

11,2 

Figure  15.4  b  2,2  d 

a  12,  10  c 

10, 
s 

11,3 

Figure  15.5  b  2,2  d 

At  this  stage,  every  path  between  s  and  t  contains  a  saturated  arc,  and  we 
have  a  flow  of  value  13.  Unfortunately,  this  is  still  not  a  maximum.  In  fact,  there 
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exists  a  flow  of  value  17  (see  Fig  15.7)  which,  as  we  shall  soon  see,  is  maximum 
for  this  network.  By  what  process  might  this  maximum  flow  be  obtained?  We 
require,  and  describe  in  the  next  section,  a  refinement  of  the  current  technique. 


Answers  to  Pauses 


1.  The  law  of  conservation  holds  at  a  because 

Y  Ua  =  f sa  =  10  and  Y  fav  =  f ac  +  Ud  =  10  +  0  =  10. 

V  V 

It  holds  at  b  because  ^  f  =  f sb  =  2  and  f fv  =  f f,c  +  f m  =  1  +  1=2. 

V  V 

It  holds  at  d  because  Y_,  fV(/  =  ffl^  +  fbd  =  0  +  1  =  1  and  ^  fjv  =  f dt  =  1 . 

V  V 

2.  A  flow  with  value  13  appears  in  Fig  15.5  and  one  with  value  17  is  shown  in 
Fig  15.7. 

3.  To  say  that  sets  S  and  T  comprise  a  partition  of  V  is  to  say  that  S  and  T 
are  disjoint  subsets  of  V  whose  union  is  V. 


EXERCISES 


The  symbol  [BB]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 


1.  (a)  [BB]  Verify  the  law  of  conservation  of  flow  at  a,  e. 
and  d. 

(b)  [BB]  Find  the  value  of  the  indicated  flow. 

(c)  [BB]  Find  the  capacity  of  the  (s,  t)-cut  defined  by 
S  =  {.?,  a ,  b }  and  T  =  [c,  d.  e,  ?}. 


a  3, 2  c 


(d)  [BB]  Can  the  flow  be  increased  along  the  path 
sbedtl  If  so,  by  how  much? 

(e)  [BB]  Is  the  given  flow  maximum?  Explain. 

(f)  [BB]  Illustrate  Corollary  1 5. 1 .5  for  S  and  T  as  in  (c). 

2.  (a)  Verify  the  law  of  conservation  of  flow  at  a,  e,  and  d. 

(b)  [BB]  Find  the  value  of  the  indicated  flow. 

(c)  Find  the  capacity  of  the  (.v,  r)-cut  defined  by  S  = 
[s,  a ,  b,  c,  d,  e}  and  T  =  [r). 

(d)  Name  all  saturated  arcs. 

(e)  Is  the  given  flow  maximum?  Explain. 


(f)  Illustrate  Corollary  15.1.5  for  S  and  T  as  in  (c). 


a  4.3  c 


3.  (a)  Verify  the  law  of  conservation  of  flow  at  a ,  c,  and  d. 

(b)  Find  the  value  of  the  indicated  flow. 

(c)  [BB]  Find  the  capacity  of  the  (s,  r)-cut  defined  by 
S  =  {.?,  a,  b.  d }  and  T  =  [c,  e,  /,  f). 


a  3,3  c 


(d)  Is  the  given  flow  maximum?  Explain. 

(e)  Illustrate  Corollary  15.1.5  for  S  and  T  as  in  (c). 
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4.  (a)  Verify  the  law  of  conservation  of  flow  at  a,  b,  and  e. 

(b)  Find  the  value  of  the  indicated  flow. 

(c)  Find  the  capacity  of  the  (s,  t)-c ut  defined  by  S  = 
{s,  a,  c,  d\  and  T  =  {b,  e,  t}. 


a  2,  1  b 


(d)  Can  the  flow  be  increased  along  the  path  sadtl  If 
so,  by  how  much?  Can  it  be  increased  along  scdtl 
If  so,  by  how  much? 


(e)  Is  the  given  flow  maximum?  Explain. 

(f)  Illustrate  Corollary  15.1.5  for  5  and  T  as  in  (c). 

5.  Answer  the  following  questions  for  each  of  the  networks 
shown  in  Fig  15.6. 

i.  Exhibit  a  unit  flow. 

ii.  Exhibit  a  flow  with  a  saturated  arc. 

iii.  Find  a  “good”  and,  if  possible,  a  maximum  flow  in 
the  network.  State  the  value  of  your  flow. 

6.  [BB]  Give  an  example  of  a  flow  { f w L, }  in  a  directed 
network  which  has  the  following  properties: 

•  there  is  no  flow  into  the  source  or  out  of  the  sink; 

.  not  all  fuv  =  0; 

•  the  flow  has  value  0. 

7.  [BB]  Complete  the  proof  of  Corollary  15.1.6  by  show¬ 
ing  that  if  a  network  has  an  (,v,  f)-flow  T  and  an  ( s ,  t)- 
cut  [5,  T)  such  that  val(.F)  =  cap(S,  T),  then  cap(S,  T) 
is  the  minimum  capacity  of  any  cut. 


15.2  Constructing  Maximal  Flows 

Our  goal  now  is  to  modify  the  procedure  for  constructing  flows  given  at  the 
end  of  the  previous  section  in  order  to  construct  a  maximum  flow  for  a  directed 
network. 

In  the  theory  of  directed  networks,  chain  is  a  more  common  term  than  trail 
to  describe  a  walk  with  distinct  arcs  which  can  be  followed  in  either  direction. 
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An  arc  of  a  chain  is  a  forward  arc  if  it  is  followed  in  the  direction  of  the  arrow 
and  a  backward  arc  if  it  is  followed  in  a  direction  opposite  that  of  the  arrow.  In 
Fig  15.5,  the  chain  sbcadt  has  forward  arcs  sb,  be,  ad,  and  dr,  and  backward 
arc  ca. 


15.2.1  DEFINITION 


Given  an  (5,  r)-flow  { fuv}  in  a  network,  a  chain  is  called  flow-augmenting  if  and 
only  if  fuv  <  cuv  for  each  forward  arc  uv  of  the  chain  and  fuv  >  0  for  each 
backward  arc. 


The  chain  C :  sbcadt  from  s  to  t  in  Fig  15.5  is  flow-augmenting  because, 
checking  forward  arcs,  3  =  fv/,  <  csb  =  1 1;  1  =  f be  <  C-bc  =  8;  0  =  fad  <  cad  = 
4;  2  =  f dt  <  c-di  =  14;  and,  checking  the  backward  arc,  10  =  fac  >  0. 

The  term  “flow-augmenting”  chain  derives  from  the  fact  that  such  a  chain  can 
be  used  to  increase  flow:  increase  the  flow  on  each  forward  arc  by  some  fixed 
amount  and  decrease  the  flow  on  each  backward  arc  by  the  same  amount.  The 
largest  “amount”  that  can  be  used  here  is  a  number  called  the  slack  of  the  chain. 
The  slack  is  determined  as  follows. 

Calculate  the  unused  capacity  cuv  —  fuv  of  each  forward  arc  and  the 
flow  fuv  in  each  backward  arc.  The  slack  of  the  chain  is  the  minimum 
of  all  these  numbers. 

In  our  example,  the  slack  is  4,  the  value  of  c ad  —  fad-  Increasing  by  4  the 
flow  in  the  forward  arcs  of  C  and  decreasing  by  4  the  flow  in  the  backward  arcs, 
we  obtain  the  network  and  flow  shown  in  Fig  15.7. 


a  12.6  c 


Figure  15.7  A  maximum  flow. 

Why  does  the  procedure  just  described  of  adding  an  amount  q  to  the  forward 
arcs  of  a  chain  and  subtracting  the  same  amount  from  the  backward  arcs  preserve 
conservation  of  flow  at  each  vertex? 

By  checking  all  possibilities,  it  can  be  seen  that  there  are  no  further  flow- 
augmenting  chains  in  this  network,  and.  marvelously,  this  means  that  a  maximum 
flow  has  been  achieved!  We  shall  see  why  shortly,  but  for  now,  we  content 
ourselves  with  showing  that  the  flow  in  Fig  15.7  is  maximum.  For  this,  we  consider 
the  (5,  f)-cut  given  by  S  =  {s,  a,  b,  c)  and  T  =  {d,  t },  which  has  capacity  c ad  + 
Cbd+Cct  =  4+2+1 1  =  17.  Since  the  displayed  flow  has  value  17,  Corollary  15.1.6 
guarantees  that  it  is  maximum. 

It  is  important  to  observe  that  the  vertices  of  S  are  exactly  those  which  can  be 
reached  from  s  by  flow-augmenting  chains.  In  addition  to  s,  S  includes  b  (since 
fsb  =  7  <  1 1  =  csb)  and,  therefore,  c  (since  fbc  =  5  <  8  =  c^.).  From  c ,  we  are 
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able  to  continue  on  a  backward  arc  to  a  (since  fac  =  6>  0).  From  a,  a  backward 
arc  leads  to  s,  which  is  already  in  S,  and  the  forward  arc  is  saturated.  The 
vertices  which  can  be  reached  from  ,y  by  flow-augmenting  chains  are  {5,  a,  b,  c}, 
precisely  the  vertices  of  S.  This  observation  is  key  to  the  proof  of  the  famous 
Max  Flow — Min  Cut  Theorem  of  Lester  R.  Ford  Jr.  and  D.  R.  Fulkerson.1 


15.2.2  MAX 
FLOW— MIN  CUT 


THEOREM 


Where  have  you  heard  Ford’s  name  before? 


In  a  directed  network,  the  maximum  value  of  an  (s,  t)-flow  equals  the  minimum 
capacity  of  an  (5,  t)-cut. 


Proof  Using  the  procedure  just  described,  we  construct  an  (,?,  r)-flow  T  that  does  not 
contain  any  flow-augmenting  chains  from  s  to  t.  Corollary  15.1.6  tells  us  that 
the  proof  will  be  complete  if  we  can  construct  an  (s,  f)-cut  {<S,  T }  such  that 
valiJC)  =  cap(<S,  T). 

With  the  hindsight  of  our  earlier  example,  we  let  S  be  the  set  of  all  ver¬ 
tices  which  can  be  reached  by  flow-augmenting  chains  starting  at  s  (including 
s  itself)  and  we  let  T  be  the  set  of  all  remaining  vertices.  Since  there  are  no 
flow-augmenting  chains  from  s  to  f,  it  cannot  be  that  t  e  S,  hence  t  e  T.  Thus, 
{<S,  T)  is  an  ( s ,  t)-cut. 

Let  u  be  any  vertex  of  <S  and  let  v  be  any  vertex  of  T.  Because  u  e  S,  there 
is  a  flow-augmenting  chain  from  s  to  u.  Since  v  £  S,  we  conclude  that  iuv  =  cuv 
and  fvu  =  0,  for  otherwise  the  flow-augmenting  chain  from  s  to  h  (which  exists 
by  definition  of  S)  could  be  extended  to  a  flow-augmenting  chain  from  .v  to  v. 
Now  Corollary  15.1.5  tells  us  that 

val(.F)  =  ^uv-fvu) 

ueS.veT 

=  ^2  (c»v~0)=  Cuv  =  cap(S.T). 

u€S,v€.'2~  ueS,vcT 

Hence,  we  have  found  a  cut  whose  capacity  equals  val(JF),  proving  that  T  is 
maximum.  I 

The  proof  of  this  theorem  shows  that  if  an  (5,  ;)-flow  has  the  property  that 
there  are  no  flow-augmenting  chains  from  .v  to  t,  then  the  flow  is  maximum  (a 
fact  to  which  we  earlier  alluded). 

Also  observe  that  we  have,  in  this  section,  provided  both  a  formal  algorithm 
for  constructing  a  maximum  flow  and  a  method  for  checking  that  it  is  maximum 
(via  the  explicit  construction  of  a  cut  with  capacity  the  value  of  the  flow).  We 
illustrate  with  a  final  example. 


PROBLEM  1.  Find  a  maximum  flow  in  the  directed  network  shown  in  Fig  15.8 
and  prove  that  it  is  a  maximum. 


'l.  R.  Ford,  Jr.  and  D.  R.  Fulkerson.  “Maximal  flow  through  a  network,"  Canadian  Journal  ofMathe- 
matics  8  (1956),  399—404, 


15.2  Constructing  Maximal  Flows  451 


Figure  15.8  A  directed  network. 

Solution.  We  start  by  sending  a  flow  of  2  units  through  the  path  sadt,  a  flow 
of  3  units  through  sbet,  and  a  flow  of  3  units  through  scft,  obtaining  the  flow 
shown  on  the  left  in  Fig  15.9.  Continue  by  sending  flows  of  2  units  through  sbdt 
and  2  units  through  sbft,  obtaining  the  flow  shown  on  the  right. 

a  6, 2  d  a  6, 2  d 


Figure  15.9  Two  flows. 

At  this  point,  there  are  no  further  flow-augmenting  chains  from  s  to  t  involving 
only  forward  arcs.  However,  we  can  use  the  backward  arc  da  to  obtain  a  flow- 
augmenting  chain  scbdaet.  Since  the  slack  of  this  chain  is  2,  we  add  a  flow  of  2 
to  sc,  cb,  bd,  ae,  and  et,  and  subtract  2  from  ad.  The  result  is  shown  in  Fig  15.10. 


Figure  15.10 


a  6,0  d 


A  search  for  further  flow-augmenting  chains  takes  us  from  s  to  c  or  b  and  on  to 
d,  where  we  are  stuck.  This  tells  us  that  the  current  flow  (of  value  14)  is  maximum. 
It  also  presents  us  with  a  cut  verifying  maximality,  namely,  <S  =  [s,  b,  c,  d)  (those 
vertices  reachable  from  j  by  flow-augmenting  chains)  and  T  =  {a,  e,  f,  t)  (the 
complement  of  S).  The  capacity  of  this  cut  is 

c  sa  +  Cbe  +  c  bf  +  c  cf  +  c*  =  2  +  3  +  2  +  3-F4  =  14. 
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Since  this  is  the  same  as  the  value  of  the  flow,  we  have  verified  that  our  flow  is 
maximum.  1 


Rational  Weights 


Throughout  this  chapter  there  has  been  the  underlying  assumption  that  the  weights 
attached  to  arcs  are  nonnegative  integers.  In  fact,  there  is  no  additional  complica¬ 
tion  if  capacities  are  arbitrary  rational  numbers  since  any  problem  related  to  such 
a  network  is  equivalent  to  a  problem  in  the  related  network  (where  capacities  are 
integral)  obtained  by  multiplying  all  capacities  by  the  least  common  multiple  of 
the  denominators. 

Even  if  some  capacities  are  not  rational,  the  Max  Flow — Min  Cut  Theo¬ 
rem  is  still  true,  but  the  proof  we  have  given  breaks  down.  In  fact,  Ford  and 
Fulkerson2  exhibit  a  small  network  with  irrational  capacities  and  the  following 
properties: 


Answers  to  Pauses 


•  there  is  a  flow,  not  maximum,  for  which  the  process  we  have  described 
of  obtaining  flows  of  higher  value  via  flow-augmenting  chains  never  termi¬ 
nates; 

•  the  infinite  sequence  of  flows  (of  higher  and  higher  value)  does,  however, 
converge,  to  a  flow  of  some  value  Q\  but 

•  the  maximum  value  of  a  flow  is  4 Q. 

4.  The  flow  on  the  arcs  incident  with  a  vertex  not  on  the  chain  are  not  changed,  so 
conservation  of  flow  continues  to  hold  at  such  a  vertex.  What  is  the  situation 
at  a  vertex  on  the  chain?  Remember  that  a  chain  in  a  directed  network  is  just 
a  trail  whose  edges  can  be  followed  in  either  direction;  thus,  each  vertex  on 
a  chain  is  incident  with  exactly  two  arcs.  Suppose  a  chain  contains  the  arcs 
uv,  vw  (in  that  order)  and  that  the  flows  on  these  arcs  before  changes  are  fuv 
and  fUU).  There  are  essentially  two  cases  to  consider. 

Case  1:  Suppose  the  situation  at  vertex  v  in  the  network  is  u  — >  v  — >  w. 
In  this  case,  both  uv  and  vw  are  forward  arcs,  so  each  has  the  flow 
increased  by  q.  The  total  flow  into  v  increases  by  q ,  but  so  does  the  total 
flow  out  of  v,  so  there  is  still  conservation  of  flow  at  v.  (The  analysis  is 
similar  if  the  situation  at  v  is  u  <—  v  w.) 

Case  2:  The  situation  at  v  is  u  — >  v  w. 

Here  the  flow  on  the  forward  arc  uv  is  increased  by  q  and  the  flow  on 

the  backward  arc  wv  is  decreased  by  q.  There  is  no  change  in  the  flow 

out  of  v.  Neither  is  there  any  change  in  the  flow  into  v  since  the  only 
terms  in  the  sum  frv  which  change  occur  with  r  =  u  and  r  =  w,  and 
these  become,  respectively,  ful)  +q  and  f„,„  — q .  (The  analysis  is  similar 
if  the  situation  at  v  is  u  <—  v  — >  w.) 


5.  Ford  was  one  of  the  co-discovers  of  the  “Bellman-Ford”  shortest  path  algo¬ 
rithm.  See  Section  1 1 .2. 


-L.  R.  Ford.  Jr.  and  D.  R.  Fulkerson.  Flows  in  Networks ,  Princeton  (1962).  p.  21. 
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EXERCISES 


The  symbol  [BB]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 


1.  Answer  the  following  two  questions  for  each  of  the 
directed  networks  shown. 

i.  Show  that  the  given  flow  is  not  maximum  by  finding 
a  flow-augmenting  chain  from  s  to  t.  What  is  the 
slack  in  your  chain? 

ii.  Find  a  maximum  flow,  give  its  value,  and  prove  that 
it  is  maximum  by  appealing  to  Theorem  15.2.2. 


a  3,2  d 


e  6,6  /  7,6  g 


2.  [BB;  (a)]  Find  a  maximum  flow  for  each  of  the  networks 
in  Exercise  5  of  Section  15.1.  In  each  case,  verify  that 
your  flow  is  maximum  by  finding  a  cut  whose  capacity 
equals  the  value  of  the  flow. 

3.  Find  a  maximum  flow  for  each  of  the  networks  shown 
in  Fig  15.1 1.  In  each  case,  verify  your  answer  by  find¬ 
ing  a  cut  whose  capacity  equals  the  value  of  the 
flow. 

4.  Shown  are  two  networks  whose  arc  capacities  are  ratio¬ 
nal  but  not  always  integral.  Use  the  idea  suggested  at 
the  end  of  this  section  to  find  a  maximum  flow  in  each 
of  these  networks.  In  each  case,  state  the  maximum  flow 
value. 
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Figure  15.11  Graphs  for  Exercise  3. 


15.3  Applications 

Obvious  examples  of  directed  networks  include  the  intake  and  waste  systems 
of  plumbing  in  a  house,  oil  and  natural  gas  pipelines,  telephone  and  electrical 
networks,  mail  courier  services,  and  the  flow  of  people  through  some  system  of 
tunnels.  There  are,  however,  less  obvious  applications,  some  of  which  we  shall 
discuss  here.  One,  a  problem  of  supply  and  demand,  will  be  discussed  in  detail; 
the  others  are  merely  sketched. 

Multiple  Sources  and/or  Sinks 

Some  directed  networks  have  several  sources  and/or  several  sinks  (an  electronic 
mail  network,  for  instance).  Such  networks  can  be  handled  easily  by  adding  two 
new  vertices,  s'  and  /,  drawing  an  arc  from  .v  to  each  source  and  an  arc  from 
each  sink  to  t ,  and  putting  a  sufficiently  large  capacity  on  each  of  these  new 
arcs  so  that  their  presence  does  not  change  the  various  possibilities  for  flow 
through  the  original  network.  A  maximum  (s,  f)-flow  in  this  new  network  will 
give  a  maximum  flow  in  the  other  network,  simply  by  forgetting  about  the  added 
arcs.  We  shall  see  a  worked  example  of  this  sort  in  conjunction  with  our  next 
application. 
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Figure  15.12 


Figure  15.13 


Supply  and  Demand  Problems 

A  small  Alberta  department  store  chain  maintains  three  warehouses  A,  B,  and  C 
in  the  south  of  the  province  and  three  stores  D,  E,  and  F  in  rural  communities 
in  the  far  north.  The  warehouses  have,  respectively,  500,  500,  and  900  snow 
blowers  in  stock  on  October  1 .  The  first  blizzard  of  winter  is  forecast  to  start  near 
midnight,  October  2,  and  there  is  an  immediate  demand  from  the  stores  for  700, 
600,  and  600  snow  blowers,  respectively.  Various  routes  are  available  for  shipping 
merchandise  to  the  stores.  These  are  shown  in  Fig  15.12,  where  the  capacity  of 
an  arc  uv  is  the  largest  number  of  snow  blowers  that  can  be  shipped  from  u  to  v 
in  the  course  of  a  single  day. 


A  supply  and  demand  network  (capacities  are  in  hundreds). 

The  vertices  in  the  middle  should  be  thought  of  as  “middlemen,-’  baggage 
handlers,  for  example,  who  can  only  cope  with  so  many  snow  blowers  at  once. 
We  will  assume  that  freight  can  cover  as  many  arcs  as  necessary  in  a  single  day. 
Can  all  the  required  snow  blowers  reach  the  stores  before  the  blizzard  arrives?  If 
not,  how  close  can  the  company  come  to  satisfying  demand? 

The  first  step  in  solving  this  problem  is  to  view  it  as  one  with  three  sources 
and  three  sinks.  Following  the  plan  proposed  earlier,  we  add  a  new  source  i  joined 
to  each  of  A,  B,  C  and  a  new  sink  t  to  which  D ,  E,  and  F  are  joined.  We  assign 
to  each  of  the  arcs  incident  with  s  a  capacity  equal  to  the  number  of  snow  blowers 
present  at  the  given  warehouse  and  to  each  arc  incident  with  t  a  capacity  equal 
to  the  number  of  snow  blowers  demanded  at  the  given  store.  The  new  network  is 
shown  on  the  left  in  Fig  15.13. 


5 

t  s 

9 


A  supply  and  demand  network  and  a  maximum  flow  (capacities  and  flows  in  hundreds). 

The  network  we  have  constructed  has  a  nice  physical  interpretation.  Imagine 
a  new  giant  warehouse  s  containing  all  1900  snow  blowers.  The  labels  on  the 
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arcs  from  s  indicate  the  number  of  snow  blowers  that  can  be  transferred  to  each 
warehouse  if  need  be.  Similarly,  we  have  created  a  single  sink  which  could  take 
up  to  1900  snow  blowers.  The  routing  of  these  machines  will  indicate  which 
demands  are  actually  met. 

Applying  the  Max  Flow — Min  Cut  procedure  of  the  previous  section,  we 
obtain  the  maximum  flow  for  our  network  shown  on  the  right  in  Fig  15.13.  We 
know  that  the  flow  is  maximum  because  the  cut 

S  =  {s,  B,C,  1,  •/},  T  =  {A,  G,  H,  D,  E,  F,  t] 

has  capacity  cap(<S,  T)  =  500  +  100  +  500  +  100  +  300  =  1500,  which  is  the 
value  of  the  flow. 

We  conclude  that  the  snow  blowers  cannot  all  be  delivered  before  the  blizzard 
arrives;  in  fact,  the  best  one  can  do  is  to  deliver  1500.  On  the  right  of  Fig  15.13, 
we  see  one  routing  by  which  this  occurs,  one  in  which  stores  D  and  E  receive 
600  snow  blowers  each  while  store  F  receives  300.  This  solution  is  not  unique, 
however.  For  example,  some  of  the  snow  blowers  destined  for  E  could  be  diverted 
to  F  via  arc  H  F.  Such  a  modification  in  shipping  arrangements  could  be  justified 
on  the  grounds  that  F  is  currently  receiving  only  half  of  its  requirements  while 
the  demands  of  E  have  been  completely  met. 


Undirected  Edges 

Suppose  a  network  contains  some  edges  which  are  undirected.  (Some  of  the  pipes 
allow  flow  in  two  directions.)  This  turns  out  to  be  an  easy  situation  to  han¬ 
dle.  Replace  each  undirected  edge  xy  by  two  directed  arcs,  xy  and  yx,  each 
having  the  same  capacity  as  the  original  undirected  edge.  Two  examples  are 
presented  in  the  exercises  (Exercise  5).  For  these,  it  is  easiest  just  to  “imag¬ 
ine”  two  arcs,  one  going  each  way,  rather  than  to  draw  them  physically  on  the 
graph. 


Edge  Disjoint  Paths 


In  order  to  deliver  an  important  message,  messengers  are  to  be  sent  from  s  to  t  via 
a  network  of  roads.  Because  terrorists  are  active  in  the  area,  it  is  prudent  for  the 
messengers  to  use  different  roads.  What  is  the  largest  number  of  messengers  that 
can  be  sent,  subject  to  this  constraint?  Equivalently,  how  many  edge  disjoint  paths 
are  there  from  s  to  t?  A  beautiful  theorem  of  Karl  Menger,  which  is  essentially  a 
corollary  of  the  Max  Flow — Min  Cut  result  (Theorem  15.2.2)  (though  it  was  first 
established  much  earlier),  answers  this  question. 


15.3.1  THEOREM 


(Menger3)  The  maximum  number  of  edge  disjoint  paths  between  two  vertices  s 
and  t  in  a  graph  is  the  minimum  number  of  edges  whose  removal  leaves  no  path 
between  5  and  t. 


3K.  Menger,  “Zur  allgemeinen  Kurventheorie,”  Fundamenta  Mathematicae  10  (1927),  95-1 15. 
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EXAMPLE  2  Consider  the  graph  shown  to  the  right. 

The  removal  of  edges  ab  and  cf  severs  all 
paths  from  i  to  t  and,  by  inspection,  we  see  that 
the  removal  of  no  single  edge  achieves  this. 

Thus,  the  minimum  number  of  edges  required 
to  cut  all  paths  from  s  to  t  is  two.  Note  that 
sabt  and  scfbgt  are  two  edge  disjoint  paths 
from  s  to  t.  According  to  Menger’s  Theorem, 
there  do  not  exist  more  than  two  edge  disjoint 
paths,  a  fact  easily  checked.  A 

To  obtain  this  result  from  the  Max  Flow — Min  Cut  Theorem,  we  convert  the 
given  graph  to  a  network  by  replacing  each  edge  between  vertices  v  and  w  by 
arcs  vw  and  wv,  each  of  capacity  1,  and  then  argue  that  a  maximum  flow  in  this 
network  is  precisely  the  number  of  edge  disjoint  paths  from  s  to  t.  We  refer  you 
to  Dantzig  and  Fulkerson4  for  details. 

In  Section  15.2,  we  developed  a  procedure  for  finding  a  maximum  flow  in 
a  network.  Therefore,  we  have  an  algorithm  for  finding  all  the  edge  disjoint 
paths  in  a  graph.  Be  forewarned  that  the  application  of  this  algorithm  with¬ 
out  computer  assistance  may  not  be  easy.  While  it  is  simple  for  a  computer 
to  take  account  of  arcs  in  each  direction  between  two  vertices  (via  the  adjacency 
matrix,  for  instance),  as  we  mentioned  in  connection  with  undirected  edges,  it 
is  better  when  proceeding  by  hand  just  to  imagine  the  arcs  rather  than  trying  to 
draw  them. 

Job  Assignments 

Suppose  there  are  m  people  P\,  P2 . Pm ,  and  n  jobs  J\,  J2,  ■ .  ■  ,  and  that 

only  certain  people  are  qualified  for  certain  jobs.  Is  it  possible  to  assign  to  each 
person  a  job  for  which  he  or  she  is  qualified?  If  not,  how  close  can  we  come?  We 
represent  this  problem  as  a  network  in  Fig  15.14. 


c  d 


Figure  15.14 


A  source  5  is  joined  to  each  person  by  an  arc  having  capacity  1  and  each 
job  is  joined  to  t  by  an  arc  having  capacity  1  also.  If  person  P,  is  qualified  for 
job  .4,  then  these  vertices  are  joined  and  given  any  nonzero  integer  capacity.  (A 
capacity  of  1  is  fine,  but  in  the  next  section,  we  will  see  that  other  capacities  are 


4G.  B.  Dantzig  and  D.  R.  Fuikerson,  “On  the  Max  Flow — Min  Cut  Theorem  of  Networks,”  Linear 
Inequalities  and  Related  Systems,  215-221.  Annals  of  Mathematics  Studies,  no.  38,  Princeton  University 
Press,  Princeton.  N.  J..  1956. 
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sometimes  useful  as  well.)  A  maximum  flow  through  this  network  will  provide 
the  best  possible  set  of  job  assignments. 


EXERCISES 


The  symbol  [BB]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 


1.  [BB]  In  the  figure,  warehouses  a ,  b ,  and  c  have  sup¬ 
plies  of  30,  20,  and  10  Klein  bottles,  respectively;  retail 
outlets  d  and  e  require  30  and  25  Klein  bottles,  respec¬ 
tively.  The  capacities  on  arcs  represent  the  maximum 
number  of  Klein  bottles  which  can  be  shipped  along  that 
path.  Can  the  demand  be  met?  If  not,  find  the  maximum 
number  of  Klein  bottles  which  can  be  sent. 
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2.  Repeat  the  previous  question  after  reversing  the  arrow 
on  edge  AB. 

3.  [BB]  Four  warehouses  A ,  B ,  C,  and  D  with  monthly 
shipping  capacities  of  4,  10,  3,  and  2  units,  respectively, 
are  linked  via  the  network  shown  to  three  retail  outlets 
E.  F,  G  whose  monthly  requirements  are  4,  5,  and  6 
units,  respectively.  The  monthly  capacity  of  each  link 
in  the  network  is  as  indicated. 


Can  these  warehouses  meet  the  needs  of  the  retail  out¬ 
lets?  If  not,  find  the  maximum  number  of  units  which 
the  three  outlets  (combined)  can  receive  in  a  month. 
Explain  your  answer. 

4.  Answer  the  previous  question  again,  this  time  assuming 
that  the  arc  is  ab.  not  ba. 


5.  Find  a  maximum  flow  for  each  of  the  following  net¬ 
works.  (Note  that  some  edges  are  undirected.) 

(a)  [BB]  , 


6.  Verify  Menger's  Theorem,  Theorem  15.3.1,  for  the  pla¬ 
nar  graph  of  the  cube  with  .?  and  t  as  shown. 

s 


7.  [BB]  The  diagram  represents  a  system  of  roads  from  A 
to  B.  The  mayor  of  town  A  wishes  to  send  a  message 
to  the  mayor  of  town  6,  and  since  all  other  means  of 
communication  are  tapped,  he  sends  out  a  certain  num¬ 
ber  of  trusted  individuals  on  foot.  If  all  paths  taken  by 
the  individuals  are  to  be  disjoint,  how  many  messengers 
can  he  employ? 


8.  Repeat  Exercise  7  for  the  following  network  of  roads. 
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9.  [BB]  Verify  Menger’s  Theorem  for  the  complete  graph 

ICn- 

10.  Verify  Menger’s  Theorem  for  the  complete  bipartite 
graph  £„,m  in  each  of  the  following  cases. 

(a)  vertices  .v  and  t  lie  in  the  bipartition  set  of  order  n 

(b)  vertices  s  and  t  lie  in  different  bipartition  sets 


15.4  Matchings 


In  Section  15.3,  we  saw  how  the  procedure  we  have  described  for  finding  a 
maximum  flow  could  be  applied  to  the  problem  of  assigning  jobs  to  job  appli¬ 
cants.  Actually,  this  is  just  one  example  of  a  whole  host  of  problems  concerning 
matchings  in  graphs.  We  take  a  brief  look  at  this  area  in  this  section. 


15.4.1  DEFINITIONS 


A  matching  in  a  graph  is  a  set  of  edges  with  the  property  that  no  vertex  is  incident 
with  more  than  one  edge  in  the  set.  A  vertex  which  is  incident  with  an  edge  in 
the  set  is  said  to  be  saturated.  A  matching  is  perfect  if  and  only  if  every  vertex  is 
saturated;  that  is,  if  and  only  if  every  vertex  is  incident  with  precisely  one  edge 
of  the  matching. 


EXAMPLES  3  In  the  graph  shown  on  the  left  in  Fig  15.15, 

•  the  single  edge  be  is  a  matching  which  saturates  b  and  c,  but  neither  a  nor  d ; 

•  the  set  { be ,  bd\  is  not  a  matching  because  vertex  b  belongs  to  two  edges; 

•  the  set  { ab ,  cd }  is  a  perfect  matching.  A 

«1  «2  «3 

.M 

VI  v2  v3  v4 


Figure  15.15  Edge  set  ( ab ,  cd]  is  a  perfect  matching  in  the  graph  on  the  left.  In  the  graph  on  the  right,  edge 
set  lujv2,  u2v4,  u3vj)  is  a  matching  which  is  not  perfect. 

It  is  worth  noticing  that  if  a  matching  is  perfect,  the  vertices  of  the  graph 
can  be  partitioned  into  two  sets  of  equal  size  and  the  edges  of  the  matching 
provide  a  one-to-one  correspondence  between  these  sets.  In  the  graph  on  the  left 
in  Fig  15.15,  for  instance,  the  edges  of  the  perfect  matching  { ab ,  cd }  establish  a 
one-to-one  correspondence  between  {a,  c}  and  {b,  d}\  a  h*  b ,  c  i-*  d. 

EXAMPLES  4  In  the  graph  on  the  right  of  Fig  15.15, 

•  the  set  of  edges  [u\v2.  u2v4.  u3v\]  is  a  matching  which  is  not  perfect  but 
which  saturates  Vi  =  {u\,  u2 ,  u 3}; 

•  no  matching  can  saturate  V2  =  {iq,  v2,  v3, 14}  since  such  a  matching  would 

require  four  edges,  but  then  at  least  one  «,  would  be  incident  with  more  than 
one  edge.  A 
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Any  assignment  of  workers  to  jobs  in  the  job  assignment  application  of 
Section  15.3  is  a  matching  since  no  worker  is  given  two  jobs  and  each  job  is 
filled  by  at  most  one  worker.  In  this  case,  we  are  thinking  of  the  graph  whose 
vertices  correspond  to  workers  and  jobs  and  where  edges  join  workers  to  jobs 
for  which  they  are  qualified.  This  graph  is  bipartite,  and  our  discussion  here  will 
initially  be  restricted  just  to  this  type  of  graph.  Later  we  will  comment  briefly  on 
matchings  for  arbitrary  graphs. 

Let  Q  be  a  bipartite  graph  with  bipartition  sets  Vi  and  V2  and  assume  that 
|Vi|  <  |V2|.  Thinking  of  V\  as  “workers”  and  V2  as  “jobs,”  we  can  follow  the 
procedure  outlined  at  the  end  of  Section  15.3  to  obtain  a  maximal  matching  for 
Q,  that  is,  a  matching  which  saturates  as  many  vertices  as  possible.  One  question 
remains  unanswered,  however — is  every  vertex  in  Vi  saturated  by  this  maximal 
matching?  In  the  language  of  job  assignments,  was  every  worker  successful  in 
finding  a  job? 

Clearly,  if  there  are  more  workers  than  jobs — |Vi  |  >  |  V2I — it  would  be  impos¬ 
sible  to  employ  everybody.  Thus,  the  condition  |Vi  |  <  |  V2I  is  necessary  for  Vj  to 
be  saturated.  More  generally,  if  for  any  subset  of  k  workers,  the  number  of  jobs 
for  which  they  are  collectively  qualified  is  less  than  k,  then  again  it  would  be 
impossible  to  employ  everybody.  In  other  words,  if  X  is  any  subset  of  Vi  and  we 
define  A(X)  to  be  the  set  of  all  vertices  in  V2  which  are  adjacent  to  a  vertex  in  X, 

A(X)  =  {v  €  V2  |  vx  is  an  edge  for  some  a:  6  X}, 
then  we  require  |X|  <  |A(X)|. 

EXAMPLE  5  In  the  figure  to  the  right,  if  X  =  {u\  ,U2,U/\],  u  1  f<2  W3  M4  V] 

then  A(X)  =  {n3,  u4}.  Since  |X|  £  |A(X)|,  the 
workers  in  X  cannot  all  find  jobs  for  which 
they  are  qualified.  There  is  no  matching  in  this 
graph  which  saturates  Vj.  A 

V]  v2  1)3  r4  y2 


It  is  surprising  that  the  so  obviously  necessary  condition  |X|  <  |A(X)|  for 
all  X  c  Vi  is  also  sufficient  for  the  existence  of  a  matching  which  saturates  Vi . 
This  was  discovered  in  1935  by  the  English  mathematician  Philip  Hall. 


15.4.2  THEOREM 


(Hall5)  If  Q  is  a  bipartite  graph  with  bipartition  sets  Vj  and  V2,  then  there  exists 
a  matching  which  saturates  V]  if  and  only  if,  for  every  subset  X  of  Vi,  |X|  < 
|A(X)|. 


Proof  It  remains  to  prove  that  the  given  condition  is  sufficient,  so  we  assume  that 
| X |  <  |  A(X) |  for  all  subsets  X  of  V\.  In  particular,  this  means  that  every  vertex 
in  Vi  is  joined  to  at  least  one  vertex  in  V2  and  also  that  |Vi  |  <  |V2|.  Assume  that 
there  is  no  matching  which  saturates  all  vertices  of  Vi  -  We  derive  a  contradiction. 

We  turn  Q  into  a  directed  network  in  exactly  the  same  manner  as  with  the  job 
assignment  application  of  Section  15.3.  Specifically,  we  adjoin  two  vertices  s  and 
Mo  Q  and  draw  directed  arcs  from  .v  to  each  vertex  in  V]  and  from  each  vertex 
in  V2  to  t.  Assign  a  weight  of  1  to  each  of  these  new  arcs.  Orient  each  edge  of 


5  P.  Hall,  “On  representations  of  subsets,”  Journal  of  the  London  Mathematical  Society  10  ( 1 935 ),  26-30. 
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Q  from  its  vertex  in  V]  to  its  vertex  in  V2,  and  assign  a  large  integer  I  >  |Vi| 
to  each  of  these  edges.  As  noted  before,  there  is  a  one-to-one  correspondence 
between  matchings  of  Q  and  (s,  t)-flows  in  this  network,  and  the  value  of  the 
flow  equals  the  number  of  edges  in  the  matching. 

Since  we  are  assuming  that  there  is  no  matching  which  saturates  Vi,  it  follows 
that  every  flow  has  value  less  than  |Vj|  and,  hence,  by  the  Max  Flow-Min  Cut 
Theorem,  there  exists  an  (5,  r)-cut  {5,  T)  (s  £  S,  t  e  T)  whose  capacity  is  less 
than  ]Vi|. 

Now  every  original  edge  of  Q  has  been  given  a  weight  larger  than  |  V;  |.  Since 
the  capacity  of  our  cut  is  less  than  |  Vi  | ,  no  edge  of  Q  can  join  a  vertex  of  S  to 
a  vertex  of  T.  Letting  X  =  V]  H  S,  we  have  A(X)  C  S.  Since  each  vertex  in 
A(X)  is  joined  to  /  e  T,  each  such  vertex  contributes  1  to  the  capacity  of  the  cut. 
Similarly,  since  .v  is  joined  to  each  vertex  in  Vi  \  X,  each  such  vertex  contributes 
1.  Since  |X|  <  |A(X)|,  we  have  a  contradiction  to  the  fact  that  the  capacity  is 
less  than  |  V|  |.  I 

Theorem  15.4.2  is  often  called  Hall’s  Marriage  Theorem  and  stated  in  the 
following  way: 

Given  a  set  of  boys  and  a  set  of  girls  some  of  whom  know  some  of 
the  boys,  then  the  girls  can  get  married,  each  to  a  boy  she  knows,  if 
and  only  if  any  k  of  the  girls  collectively  know  at  least  k  boys. 


PROBLEM  6.  Let  Q  be  a  bipartite  graph  with  bipartition  sets  Vi ,  V2  in  which 
every  vertex  has  the  same  degree  k.  Show  that  Q  has  a  matching  which  satu¬ 
rates  Vi. 

Solution.  Let  X  be  any  subset  of  Vi  and  let  A(X)  be  as  defined  earlier.  We 
count  the  number  of  edges  joining  vertices  of  X  to  vertices  of  A(X).  On  the  one 
hand  (thinking  of  X),  this  number  is  k\X\.  On  the  other  hand  (thinking  of  A(X)), 
this  number  is  at  most  k|A(X)|  since  k\A(X)\  is  the  total  degree  of  all  vertices 
in  A(X).  Hence,  k\X\  <  £|A(X)|,  so  |X|  <  |A(X)|  and  the  result  follows  from 
Hall’s  Theorem.  I 

Can  you  conclude  from  this  problem  that  Q  also  has  a  matching  which  saturates 
V2?  More  generally,  does  Q  have  a  matching  which  saturates  both  Vi  and  V2  at 
the  same  time  (a  perfect  matching)? 

Imagine  a  police  captain  standing  in  front  of  all  the  police  constables  in  her 
precinct.  She  wishes  to  divide  her  troops  into  pairs  for  patrol  duty,  but  there 
are  restrictions  on  how  this  can  be  carried  out  because  certain  constables  do  not 
work  well  with  certain  other  constables.  Is  it  possible  for  the  captain  to  pair  her 
constables  in  such  a  way  that  all  restrictions  are  met  and,  if  not,  how  close  can 
she  come? 

It  is  easy  to  see  how  graph  theory  can  be  used  here.  We  make  a  graph  in  which 
vertices  represent  police  constables  and  an  edge  indicates  that  the  corresponding 
constables  can  be  paired.  A  perfect  matching  of  this  graph  would  give  a  pairing  of 
the  type  desired.  Failing  this,  we  seek  a  matching  containing  as  many  pairings  as 
possible.  Notice  that  in  this  situation,  there  is  no  reason  for  the  graph  to  be  bipartite. 

We  are  most  interested  in  determining  whether  or  not  an  arbitrary  graph 
Q  has  a  perfect  matching,  that  is,  one  which  saturates  every  vertex.  A  theorem 
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giving  necessary  and  sufficient  conditions,  similar  in  spirit  to  Theorem  15.4.2,  was 
established  by  Tutte  in  1947, 6  but  we  content  ourselves  here  with  one  example  of 
a  sufficient  condition.  For  this,  we  require  the  following  proposition,  the  proof  of 
which  is  left  to  the  exercises. 


15.4.3  PROPOSITION 


Let  Q  be  a  graph  with  vertex  set  V. 

1.  If  Q  has  a  perfect  matching,  then  |V|  is  even. 

2.  If  Q  has  a  Hamiltonian  path  or  cycle,  then  Q  has  a  perfect  matching  if  and 
only  if  |V|  is  even. 


It  is  now  easy  to  prove  our  sample  result. 


15.4.4  THEOREM 


If  Q  is  a  graph  with  vertex  set  V,  |  V|  is  even,  and  each  vertex  has  degree  d  >  5  V|, 
then  Q  has  a  perfect  matching. 


Proof 


Answers  to  Pauses 


If  |V|  =  2,  then  Q  is  o — o,  which  clearly  has  a  perfect  matching.  If  | V |  > 
4,  then  Dirac’s  Theorem  (10.2.4)  tells  us  that  Q  is  Hamiltonian,  and  part  2  of 
Proposition  15.4.3  gives  us  the  answer.  I 

6.  Yes,  the  same  argument  works.  But  more  easily,  note  that  since  Q  is  bipartite, 
the  sum  of  the  degrees  of  vertices  in  V\  must  equal  the  sum  of  the  degrees 
of  vertices  in  V2.  Since  all  vertices  have  the  same  degree,  we  conclude  that 
|  Vi  1  =  |  V2 1 ,  so  a  matching  which  saturates  Vi  must  automatically  saturate  V2 
as  well,  and  vice  versa. 


EXERCISES 


The  symbol  [BB]  means  that  an  answer  can  be  found  in  the  Back  of  the  Book. 


1.  [BB]  Five  ladies  have  men  friends  as  shown  in  the  fol¬ 
lowing  table. 


Lady 

Men  Friends 

Karen 

David,  Stuart,  Paul,  Roger 

Mary 

Stuart,  Philip 

Aurie 

David,  Paul 

Pamela 

Paul,  Roger 

Lynne 

Stuart,  Paul 

Draw  a  bipartite  graph  depicting  this  situation.  Find  a 
way  in  which  each  lady  can  marry  a  man  she  knows,  or 
use  Hall’s  Marriage  Theorem  to  explain  why  no  such 
matching  exists. 


2.  Repeat  the  previous  question  with  reference  to  the  fol¬ 
lowing  table. 


Lady 

Men  Friends 

Karen 

Stuart,  Paul 

Mary 

Philip,  Stuart 

Aurie 

David,  Paul,  Roger,  Stuart 

Pamela 

Philip,  Paul 

Lynne 

Stuart,  Philip,  Paul 

3.  [BB]  Determine  whether  or  not  the  graph  has 
a  perfect  matching. 


6See  J.  A.  Bondy  and  U.  S.  R.  Murty,  Graph  Theory  with  Applications ,  North-Holland,  New  York 
(1976),  76ff. 
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4.  Angela,  Brenda,  Christine,  Helen,  Margaret,  Phyllis,  and 
Renee  are  seeking  to  fill  eight  vacant  managerial  posi¬ 
tions  A,  B,  C,  D,  E ,  F,  G,  and  H .  Angela  can  fill  A, 
B,  and  F;  Brenda  can  fill  B,  C,  and  F;  Christine  can 
fill  C,  E,  and  A;  Helen  can  fill  D,  G,  and  //;  Margaret 
can  fill  E,  A,  and  F;  Phyllis  can  fill  F,  A,  and  C;  Renee 
can  fill  C,  E,  and  F. 

(a)  [BB]  Show  that  it  is  not  possible  for  all  seven 
women  to  fill  managerial  positions  by  showing 
that  the  condition  in  Theorem  15.4.2  is  not 
satisfied. 

(b)  Find  a  matching  which  allows  as  many  women  as 
possible  to  fill  positions. 

5.  (Based  upon  a  problem  of  Tucker7)  New  Horizons  Dat¬ 
ing  Service  has  13  male  and  13  female  clients.  Each  of 
the  male  clients  selects  five  females  that  he  would  like 
to  date,  and  each  female  selects  five  males.  By  coinci¬ 
dence,  it  is  discovered  that  every  choice  is  mutual;  that 
is,  a  male  selected  a  certain  female  if  and  only  if  that 
female  selected  that  male. 

(a)  Show  that  it  is  possible  to  match  the  couples  into  13 
different  pairs,  following  the  aforementioned  selec¬ 
tions. 

(b)  Show  that  it  is  possible  to  repeat  the  matching  pro¬ 
cedures  given  in  (a)  on  five  consecutive  nights  such 
that  no  individual  dates  the  same  person  twice. 

6.  Bruce,  Edgar,  Eric,  Herb,  Maurice,  Michael,  Richard, 
and  Roland  decide  to  go  on  a  week-long  canoe  trip  in 
Labrador.  They  must  divide  themselves  into  pairs,  one 
pair  for  each  of  four  canoes.  The  following  table  indi¬ 
cates  those  people  who  are  willing  to  paddle  with  each 
other. 

(a)  [BB]  Find  a  way  of  dividing  the  campers  into  four 
canoes. 

(b)  Explain  why  it  is  impossible  to  find  a  pairing  in 
which  Roland  and  Bruce  share  a  canoe. 

(c)  Halfway  through  the  trip,  Roland  decides  that  he 
cannot  share  a  canoe  with  Eric  and  Edgar  refuses  to 
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share  a  canoe  with  Herb.  Is  it  now  possible  to  find 
a  suitable  matching? 


|  Bruce  Edgar  Eric  Herb  Maurice  Michael  Richard  Roland 

Bruce 

/ 

/ 

/ 

Edgar 

/ 

/ 

/ 

Eric 

/ 

/ 

/ 

Herb 

/ 

/ 

/ 

Maurice 

■/ 

/ 

/ 

Michael 

/ 

/ 

/ 

Richard 

/ 

Roland 

/ 

/ 

/ 

/ 

7.  (a)  [BB]  Given  a  set  S  and  n  subsets  Ai,  A2, . . .  ,  A„ 

of  S,  it  is  possible  to  select  distinct  elements  s\ , 
S2,  ■■■  , s„  of  S  such  that  si  e  A\,  sz  e  A2,  . . .  , 
s„  e  An  if  and  only  if,  for  each  subset  X  of  [1, 
2, . . .  ,  n)  the  number  of  elements  in  [  Jr,y  A,  is  at 
least  |X|.  Why? 

(b)  In  a  certain  game  of  solitaire,  the  52  cards  of  a 
standard  deck  are  dealt  face  up  into  13  columns 
of  4  cards  each.  Show  that  it  is  always  possible  to 
select  13  cards,  one  of  each  of  the  13  denominations 
2,  3, . . .  ,  10,  Jack,  Queen,  King,  Ace,  including  at 
least  one  card  from  each  column. 

[This  problem  comes  from  an  article  entitled  “Mar¬ 
riage,  Magic  and  Solitaire”  by  David  B.  Leep  and  Gerry 
Myerson,  which  appeared  in  the  American  Mathematical 
Monthly,  Vol.  106  (1999),  no.  5,  419-429.] 

8.  Suppose  Vj,  V2  are  the  bipartition  sets  of  a  bipartite 
graph  Q.  Let  m  be  the  smallest  of  the  degrees  of  the 
vertices  in  V\  and  M  the  largest  of  the  degrees  of  the 
vertices  in  V2.  Prove  that  if  m  >  M,  then  Q  has  a  match¬ 
ing  which  saturates  Vj. 

9.  (a)  [BB]  Suppose  Vi  and  V2  are  the  bipartition  sets  in 

a  bipartite  graph  Q.  If  |Vj  |  >  |  V2I,  then  it  is  clearly 
impossible  to  find  a  matching  which  saturates  Vi. 
State  a  result  which  is  applicable  to  this  case  and 
give  a  necessary  and  sufficient  condition  for  your 
result  to  hold. 

(b)  How  should  your  result  in  (a)  be  interpreted  when 
applied  to  the  job  assignment  problem? 

10.  (a)  [BB]  Determine  necessary  and  sufficient  conditions 
for  the  complete  bipartite  graph  to  have  a  per¬ 
fect  matching. 

(b)  In  those  cases  where  KLm „  does  not  have  a  perfect 
matching,  find  a  matching  which  saturates  as  many 
vertices  as  possible. 


7  See  Chapter  10  of  Alan  Tucker,  Applied  Combinatorics,  Wiley  (New  York),  1980. 
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11.  Let  Q  be  a  bipartite  graph  with  bipartition  sets  Vi ,  V2  and 
assume  that  Q  has  a  perfect  matching.  Add  two  vertices 
x  and  v  to  Q  such  that  x  is  adjacent  to  all  vertices  in  V] , 
y  is  adjacent  to  all  vertices  in  lA,  and  x  is  not  adjacent 
to  y. 

(a)  Show  that  Q  U  [x,  y]  is  bipartite. 

(b)  Show  that  Q  U  { x ,  y }  has  a  perfect  matching. 

12.  (a)  [BB]  Show  that  the  complete  graph  /C„  has  a  perfect 

matching  if  and  only  if  n  is  even. 


(b)  Let  e  be  a  fixed  edge  in  /C„,  where  n  >  2.  Does  the 
result  in  (a)  still  hold  for  fCn  \  [e]?  Explain.  What  if 
n  =  2? 

13.  Prove  Proposition  15.4.3. 

14.  [BB]  Is  the  converse  of  Proposition  15.4.3(1)  true? 
Explain. 

15.  [BB]  Is  the  converse  of  Theorem  15.4.4  true?  Explain. 

16.  Prove  that  if  a  tree  has  a  perfect  matching,  then  that 
matching  is  unique. 


REVIEW  EXERCISES  FOR  CHAPTER  15 


1.  (a)  Verify  the  law  of  conservation  of  flow  at  a  and  at  e. 

(b)  Find  the  value  of  the  indicated  flow. 

(c)  Find  the  capacity  of  the  (s,  t)-cut  defined  by 
5  =  {5-,  a,  b},  T  =  (c,  d,  e,  /,  f).  Illustrate  Corol¬ 
lary  15.1.5  for  this  cut. 

(d)  Can  the  flow  be  increased  along  the  path  sbftl 

(e)  Is  the  given  flow  maximum? 


a  4,2  d 


2.  In  the  directed  network  of  Exercise  1,  assume  that  the 
flow  along  each  of  the  paths  sbdt  and  sc  ft  has  been 
increased  by  one  unit. 

(a)  Show  that  every  path  from  .v  to  t  now  contains  a 
saturated  arc. 

(b)  Despite  (a),  show  that  sbeadt  is  a  flow-augmenting 
chain  from  ,v  to  t.  What  is  the  slack  in  this  chain? 

(c)  Find  a  maximum  flow  for  this  directed  network,  give 
its  value,  and  prove  that  it  is  maximum  by  appealing 
to  Theorem  15.2.2. 

3.  Find  a  maximal  ( s ,  /)-flow.  Verify  your  answer  by  find¬ 
ing  an  (x,  ?)-cut  whose  capacity  equals  the  value  of  the 
flow. 


5 


4.  One  example  of  an  (s,  t)-cut  in  a  directed  network  with 
vertex  set  V  is  the  case  5  =  ($},  T  =  V  \  {5}.  Assume 
you  have  a  flow  in  a  directed  network  and  you  dis¬ 
cover  that  the  capacity  of  the  cut  just  mentioned  equals 
the  value  of  the  flow.  Prove  that  every  arc  leaving  s 
must  be  saturated.  (You  may  assume  no  flow  is  going 
into  s.) 

5.  Four  warehouses  A,  B,  C,  D  with  shipping  capacities  6, 
8,  7,  9,  respectively,  are  linked  via  the  network  shown 
to  four  retail  outlets  E,  F,  G,  H  with  requirements  5, 
4,  9,  7,  respectively.  Can  these  requirements  be  met?  If 
not,  find  the  maximum  number  of  units  which  the  four 
outlets  (combined)  can  receive. 


9 
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6.  Find  a  maximum  flow. 


7.  (a)  Which  graphs  have  the  property  that  for  any  pair 

of  vertices  .v  and  t,  the  number  highlighted  by 
Menger’s  Theorem  is  1? 

(b)  Which  Hamiltonian  graphs  have  the  property  that 
for  any  pair  of  vertices  s  and  f,  the  number  high¬ 
lighted  by  Menger’s  Theorem  is  2? 

8.  The  six  teams  entering  the  final  round  of  the  World 
Hockey  Championships  are  Canada,  Finland,  Russia, 
Slovakia,  Romania,  and  the  United  States.  Albert,  Bruce, 
Craig,  Camelia,  Oana  and  Yuri  wish  to  bet  with  each 
other  on  who  the  winner  will  be  and,  ideally,  they  would 
like  to  all  select  different  teams.  The  teams  that  each 
bettor  is  willing  to  support  are  shown  in  the  follow¬ 
ing  table. 


Bettor 

Teams 

Albert 

Canada,  Finland,  Slovakia,  USA 

Bruce 

Slovakia,  Romania 

Craig 

Russia,  Slovakia 

Camelia 

Russia,  Romania 

Oana 

Canada,  Finland,  Russia,  Romania,  USA 

Yuri 

Russia,  Slovakia,  Romania 

Can  these  people  indeed  bet  on  different  teams?  Give 
such  a  selection  or  use  Hall’s  Marriage  Theorem  to 
explain  why  such  a  selection  is  impossible. 

9.  Do  Exercise  8  again  with  assumptions  as  before  but  for 
three  exceptions: 

.  Albert  is  now  happy  not  to  bet  on  Finland, 

•  Oana  will  no  longer  bet  on  Russia  or  Romania,  and 

•  Bruce  is  now  willing  to  bet  on  Canada. 

10.  Let  Q  be  a  connected  graph,  every  vertex  of  which  has 
degree  2.  Prove  that  Q  has  a  perfect  matching  if  and 
only  if  Q  has  an  even  number  of  vertices. 

11.  Let  Q  be  a  connected  graph  every  vertex  of  which  has 
degree  3. 

(a)  Show  that  Q  must  have  an  even  number  of  vertices. 

(b)  Must  Q  have  a  perfect  matching? 


Solutions  to  Selected  Exercises 


Exercises  1.1 

1.  (a)  True,  because  both  4  =  2  +  2  and  7  <  \/50  are  true  statements. 

(c)  True,  because  both  hypothesis  and  conclusion  are  true. 

(e)  True,  because  this  is  an  implication  with  false  hypothesis. 

(g)  False,  because  the  hypothesis  is  true  but  the  conclusion  is  false. 

2.  (a)  This  is  true:  The  hypothesis  is  true  only  when  a  >  b  and  b  >  a,  that  is,  when  a  =  b,  and  then  the  conclusion  is  also 

true. 

3.  (a)  a2  <  0  and  a  is  a  real  number  (more  simply,  a  —  0). 

(c)  x  /  1  and  x  /  —  1. 

(e)  There  exists  a  real  number  x  such  that  n  <  x  for  every  integer  n. 

(g)  Every  planar  graph  can  be  colored  with  at  most  four  colors. 

(i)  There  exist  integers  a  and  b  such  that  for  all  integers  q  and  r,  b  ^  qa  +  r. 

4.  (a)  Converse:  If  f  is  an  integer,  then  |  and  *  are  also  integers. 

Contrapositive:  If  f  is  not  an  integer,  then  |  is  not  an  integer  or  *  is  not  an  integer. 

(c)  Converse:  A  connected  graph  is  Eulerian. 

Contrapositive:  If  a  graph  is  not  connected,  then  it  is  not  Eulerian. 

(e)  Converse:  A  four-sided  figure  is  a  square. 

Contrapositive:  If  a  figure  does  not  have  four  sides,  then  it  is  not  a  square. 

5.  (a)  There  exists  a  continuous  function  which  is  not  differentiable. 

(c)  For  every  real  number  x,  there  exists  a  real  number  y  such  that  y  >  x. 

(e)  For  every  positive  integer  n,  there  exist  primes  p i,  P2 . Pi  such  that  n  =  p\P2  ■  •  •  Pt- 

Exercises  1.2 

1.  (a)  Hypothesis:  a  and  b  are  positive  numbers;  Conclusion:  a  +  b  is  positive. 

2.  This  statement  is  true.  Suppose  the  hypothesis,  x  is  an  even  integer,  is  true.  Then  x  —  2k  for  some  other  integer  k. 
Then  x  +  2  =  2k  +  2  =  2(k  +  1)  is  also  twice  an  integer.  So  x  +  2  is  even.  The  conclusion  is  also  true. 

6.  The  converse  is  the  statement,  “A  continuous  function  is  differentiable.”  This  is  false. 

8.  A  is  true.  It  expresses  the  fact  that  every  real  number  lies  between  two  consecutive  integers.  Statement  23  is  most 

definitely  false.  It  asserts  that  there  is  a  remarkable  integer  n  with  the  property  that  every  real  number  lies  in  the  unit 
interval  between  n  and  n  +  1 . 

11.  (a)  Case  I:  a  is  even.  In  this  case,  we  have  one  of  the  desired  conclusions. 

Case  2:  a  is  odd.  In  this  case,  a  =  2m  +  I  for  some  integer  m,  so  a  +  1  =  2m  +  2  =  2 (m  +  1)  is  even,  another 

desired  result. 

(b)  nr  +  n  =  n(n  +  1)  is  the  product  of  consecutive  integers  one  of  which  must  be  even;  so  «-  +  n  is  even. 

13.  2.v2  —  4x  +  3  =  2(.<r  —  2x)  +  3  =  2[(.t  —  l)2  —  1]  +  3  =  2(x  —  l)2  +  I  is  the  sum  of  I  and  a  nonnegative  number. 

So  it  is  at  least  1  and  hence  positive. 
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S-2  Solutions  to  Selected  Exercises 

15.  (— »•)  To  prove  this  direction,  we  establish  the  contrapositive,  that  is,  we  prove  that  n  odd  implies  n 2  odd.  For  this,  if 

n  is  odd,  then  n  =2m  +  ]  for  some  integer  m.  Thus  n 2  =  4 m2  +  4m  +  1  =  2(2 m2  +  2m)  +  1  is  odd. 

(«— )  Here  we  assume  that  n  is  even.  Therefore,  n  —  2m  for  some  integer  m.  So  tr  =  (2m)2  =  4 m2  =  2(2 nr)  which 
is  even,  as  required. 

17.  Since  n  is  odd,  n  =  2k  +  1  for  some  integer  k. 

Case  1 :  k  is  even. 

In  this  case  k  =  2m  for  some  integer  m,  so  n  =  2(2 m)  +  1  =  4m  +  1. 

Case  2:  k  is  odd. 

In  this  case,  k  =  2m  +  1  for  some  integer  m,  so  n  =  2(2 m  +  1)  +  1  =  4m  +  3. 

Since  each  case  leads  to  one  of  the  desired  conclusions,  the  result  follows. 

20.  Since  0  is  an  eigenvalue  of  A,  there  is  a  nonzero  vector  x  such  that  Ax  =  0.  Now  suppose  that  A  is  invertible.  Then 
A-1  (Ax)  =  A~'0  =  0,  so  x  =  0,  a  contradiction. 

22.  Observe  that  (1  4-  a)(l  +  b)  =  1  +  a  +  b  +  ab  =  1.  Thus  1  +  a  and  1  +  b  arc  integers  whose  product  is  1.  There  are 
two  possibilities:  \  +  a  —  \  +  b  =  \,  m  which  case  a  =  b  —  0,  or  1  +  a  =  1  +  b  =  —  1,  in  which  case  a  =  b  =  —2. 

We  begin  by  assuming  the  negation  of  the  desired  conclusion;  in  other  words,  we  assume  that  there  exist  real  numbers 
x,y,  z  which  simultaneously  satisfy  each  of  these  three  equations.  Subtracting  the  second  equation  from  the  first  we 
see  that  x  +  5y  —  4z  =  —2.  Since  the  third  equation  we  were  given  says  x  +  5y  —  4z  =  0,  we  have  x  +  5y  —  4 z  equal 
to  both  0  and  to  —2.  Thus,  the  original  assumption  has  led  us  to  a  contradiction. 

False;  x  =  y  =  0  is  a  counterexample. 

False:  x  =  0  is  a  counterexample. 

False:  a  =  b  =  */2  is  a  counterexample. 

Since  n 2  +  1  is  even,  n2  is  odd,  so  n  must  also  be  odd.  Writing  n  =2k+\,  then  n2  +  I  =2  m  says  4k2  +4k  +  2  =  2m, 
so  m  =  2 k2  +  2k  +  1  =  (k  +  l)2  +  k2  is  the  sum  of  two  squares  as  required. 

We  are  given  that  n2  +  I  =  2m  for  n  =  4373  and  m  =  9561565.  Since  n  =  2(2186)  4-  1,  our  solution  to  (a)  shows 
that  m  =  k2  +  (k+  l)2  where  k  =  2186.  Thus,  9561565  =  21862  +  21872. 

Exercises  1.3 


24. 


25.  (a) 
(c) 
(e) 
27.  (a) 

(b) 
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p 

Q 

r 

s 

-r 

qA(^n 

P  ^  [q  a  (-r)l 

-s 

(-s)  v  q 

T 

T 

T 

T 

F 

F 

F 

F 

T 

r  **  K-s)  v  q] 

IP  ->■  (q  a  (-0)1  v  [r  -h.  ((—s)  v  q)] 

T 

T 

P 

q 

r 

P^q 

q  -<■  r 

(p  -  q)  a  (q  — ►  0 

P-*  r 

[(p  q)  a  (q  -►  01  -*■  IP  -*•  r] 

T 

T 

T 

T 

T 

T 

T 

T 

T 

T 

F 

T 

F 

F 

F 

T 

T 

F 

T 

F 

T 

F 

T 

T 

T 

F 

F 

F 

T 

F 

F 

T 

F 

T 

T 

T 

T 

T 

T 

T 

F 

T 

F 

T 

F 

F 

T 

T 

F 

F 

T 

T 

T 

T 

T 

T 

F 

F 

F 

T 

T 

T 

T 

T 

Since  \(p  —>■  q  )  A  (q  r)\  —r  [p  —>■  r]  is  true  for  all  values  of  p,  q  and  r,  this  statement  is  a  tautology. 

(b)  If  p  implies  q  which,  in  turn,  implies  r,  then  certainly  p  implies  r. 

8.  (a)  We  are  given  that  A  is  false  for  any  values  of  its  variables.  An  implication  p  ->  q  is  false  only  if  p  is  true  and  q  is 
false.  Since  A  is  always  false,  A  -*  $  is  always  true.  So  it  is  a  tautology. 


10.  (a) 


(c) 


P 

q 

Py.q 

T 

T 

F 

T 

F 

T 

F 

T 

T 

F 

F 

F 

P 

q 

pvq 

pvq 

(pvq)-*  (pvq) 

T 

T 

F 

T 

T 

T 

F 

T 

T 

T 

F 

T 

T 

T 

T 

F 

F 

F 

F 

T 

The  truth  table  shows  that  (p  v  q)  —>■  (p  v  q)  is  true  for  all 
values  of  p  and  q,  so  it  is  a  tautology. 


Exercises  1.4 


1.  1 .  (Idempotence)  The  truth  tables  at  the  right  show  that  p  v  p  p  and  p 

p  a  p  <=>  p. 

p  vp 

T\ 

Pap 

T 

T 

T 

F 

F 

F 

F 

3.  (Associativity)  The  equality  of  the  fifth  and  seventh  columns  in  the  truth  table  shows  that  ((p  v  q)  v  r)  a=> 
(p  V  (q  V  r )). 
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p 

q 

r 

Pvt) 

(pvt))vr 

p  vr 

pv(t)vf) 

7 

7 

7 

7 

7 

T 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

F 

7 

7 

7 

7 

7 

7 

F 

7 

7 

7 

7 

7 

7 

The  equality  of  the  fifth  and  seventh  columns  in  the  truth  table  shows  that  ({p  a  q)  a  r)  <=>  (p  A  {q  A  r)). 


P 

P 

r 

PA£) 

(P  a  p)  a  r 

pxr 

P  a  (p  A  0 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

5. 


7. 


9. 


(Double  negation)  The  equality  of  the  first  and  third  columns  in  the  truth  table  shows 
that  p  (—’/?). 


P 

-P 

-H») 

7 

7 

7 

7 

7 

7 

The  two  truth  tables  show,  respectively,  that  pvl  <=>  1  and 
p  A  1  -<=>  p. 


P 

1 

p  v  1 

P 

1 

P  A  1 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

The  two  truth  tables  show,  respectively,  that  (p  v  (-■/»))  <=>  1  and  (p  a  (->p))  <==>■  0. 


P 

-P 

P  v  (-p) 

1 

P 

-P 

P  A  (-P) 

0 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

7 

1 1 .  The  third  and  sixth  columns  of  the  truth  table  show  that 

(P^q)  ->  (-p». 


13.  The  third  and  fifth  columns  of  the  truth  table  show  that 
(p  -»•  9)  ((-■p)  v?). 
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2.  (a)  Using  one  of  the  laws  of  De  Morgan  and  one  distributive  property,  we  obtain 

[(p  A  q)  V  (— '((—■/?)  V  <?))]  [(p  /\q)  v  (p  A  (-'<?))]  <S=^  [pA(«V(^))]  <=>  (p  A  1)  p. 


3.  (a) 


p 

Q 

PAP 

P  V  (P  A  £/) 

T 

T 

T 

T 

T 

F 

F 

T 

F 

T 

F 

F 

F 

F 

F 

F 

4.  (a)  Distributivity  gives  [(p  v  q)  A  (-ip)] 


(p  A  (-p))  v  (p  A  (-p)) 


[Ov((-p)A<?)] 


((-p)Ap). 


(d)  — >[(P  +■>  p)  V  (p  A  (^P))]  <!=>  [-(p  <7)  A 

6.  1 .  The  truth  table  shows  that  idempotence  fails. 


-’9))] 

<=>  t( 

P 

p  vp 

T 

F 

F 

F 

3.  Associativity  holds. 


P 

Q 

r 

pvq 

(pvq)wr 

q  v  r 

pv(qvn 

T 

T 

T 

F 

T 

F 

T 

T 

T 

F 

F 

F 

T 

F 

T 

F 

T 

T 

F 

T 

F 

F 

T 

T 

T 

F 

F 

F 

T 

F 

F 

T 

T 

F 

T 

F 

T 

F 

T 

T 

T 

T 

F 

F 

T 

F 

T 

T 

T 

F 

F 

F 

F 

F 

F 

F 

7.  The  truth  table  shows  that  p  v  1  is  no  longer  1: 


P 

1 

p  v  1 

T 

T 

F 

9.  The  truth  table  shows  that  (p  V  (->p)) 


1: 


P 

-P 

P  v  C-p) 

1 

T 

F 

T 

T 

F 

T 

T 

T 

13.  This  is  no  longer  true: 


P 

q 

p^q 

-P 

(-P)  Y  q 

F 

T 

T 

T 

F 

7.  (b)  (p  A  q)  V  ((— ■  p)  a  (-■p))  is  in  disjunctive  normal  form. 

(c)  p  v  ((-■p)  A  q)  is  not  in  disjunctive  normal  form:  not  all  variables  are  included  in  the  first  term. 

8.  (a)  This  is  already  in  disjunctive  normal  form!  (b)  (p  A  q)  v  (p  a  (-><?)) 


Exercises  1.5 

1.  (a)  Since  [p  ->  (q  -►  r)]  [p  ->•  ((-^)  v  r)]  «=>■  [(— >p)  v  (-- q )  v  r],  [(—>/?)  v  r]  v  (-'?) 

the  given  argument  can  be  rewritten  as  shown  and  this  is  valid  by  q 

disjunctive  syllogism.  ( — > p)  v  r 
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(b)  We  analyze  with  a  truth  table.  In  row  one,  the  premises 
are  true  but  the  conclusion  is  not.  The  argument  is  not 
valid. 


p 

q 

r 

p^q 

qvr 

-q 

r  -+  (-q) 

T 

T 

T 

T 

T 

F 

F 

T 

T 

F 

T 

T 

F 

T 

T 

F 

T 

F 

T 

T 

T 

F 

T 

T 

T 

T 

F 

F 

T 

F 

F 

F 

F 

T 

T 

F 

T 

F 

T 

T 

F 

T 

F 

F 

T 

T 

T 

T 

T 

F 

F 

F 

T 

F 

T 

T 

3.  (a)  We  analyze  with  a  truth  table.  There  are  five  rows  when  the  premises  are  all  true  and  in  each  case  the  conclusion  is 
also  true.  The  argument  is  valid. 


P 

q 

r 

pvq 

P  r 

q  -* r 

T 

T 

T 

T 

T 

T 

T 

T 

T 

F 

T 

F 

F 

F 

T 

F 

T 

T 

T 

T 

T 

F 

T 

T 

T 

T 

T 

T 

T 

F 

F 

T 

F 

T 

F 

F 

T 

F 

T 

T 

F 

F 

F 

F 

T 

F 

T 

T 

T 

F 

F 

F 

F 

T 

T 

T 

4.  (a)  Since  [(-v)  v  (—'<7)]  <=>  [q  (t)],  the  first  two  premises  give  p  (->r)  by  the  chain  rule.  Now  ->p  follows 
by  modus  tollens. 

(b)  This  argument  is  not  valid.  If  q  and  r  are  false,  p  and  s  are  true,  and  I  takes  on  any  truth  value,  then  all  premises 
are  true,  yet  the  conclusion  is  false. 


5.  (a)  Let  p  and  q  be  the  statements 


P‘ 

4- 


I  stay  up  late  at  night 
I  am  tired  in  the  morning. 


P  <7 

The  given  argument  is  p  .  This  is  valid  by  modus  ponens. 

<7 


(b)  Let  p  and  q  be  the  statements 


P- 


I  stay  up  late  at  night 
I  am  tired  in  the  morning. 


P  q 

The  given  argument  is  q 

P 


This  is  not  valid,  as  the  truth  table  shows.  In  row  three,  the  two  premises  are  true 
but  the  conclusion  is  false. 


P^ 

T 

T 

F 

F 


T 

F 

T 

F 


T 

F 

T 

T 


q 


★ 


★ 


(e)  Let  p  and  q  be  the  statements 


I  wear  a  red  tie 
I  wear  blue  socks. 
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pvq 

The  given  argument  is  -•q  This  is  valid  by  disjunctive  syllogism. 

P 


P 

(g)  Let  p,  q,  and  r  be  the  statements  q 

r : 

p^  q 

The  given  argument  is  q  —*■  r  . 

r  ->  P 


I  work  hard 
I  earn  lots  of  money 
I  pay  high  taxes. 


This  is  not  valid,  as  the  truth  table  shows.  In  row  five,  the  two 
premises  are  true  but  the  conclusion  is  false. 


(1) 


P 

Let  p ,  q  ,  and  r  be  the  statements  ^ 

a-  : 


I  like  mathematics 
I  study 

I  pass  mathematics 
I  graduate. 


The  given  argument  is 


P  ->  q 
i-'q)  v  r 
(~ '-v)  ->  (-'i') 
s  q 


This  is  the  same  as 


which  is  certainly  not  valid,  as  the  following  partial  truth  table  shows. 


p  q 

q  —*  r 
r  -*  s 
s  — >  q 


6.  r  V  q  is  logically  equivalent  to  [ — ■( — >r)  v  q ]  <=>  [( — )  — ►  q ]  so,  with  p  -*  ->r,  we  get  p  — *■  q  by  the  chain  rule. 

7.  We  will  prove  by  contradiction  that  no  such  conclusion  is  possible.  Say  to  the  contrary  that  there  is  such  a  conclusion 
6.  Since  C  is  not  a  tautology,  some  set  of  truth  values  for  p  and  q  must  make  C  false.  But  if  r  is  true,  then  both 
the  premises  (~'p)  —>  r  and  r  V  q  are  true  regardless  of  the  values  of  p  and  q.  This  contradicts  C  being  a  valid 
conclusion  for  this  argument. 

8.  (a)  p  /\q  is  true  precisely  when  p  and  q  are  both  true. 

10.  In  Latin,  modus  ponens  means  “method  of  affirming"  and  modus  tollens  means  “method  of  denying”.  This  is  a 
reflection  of  the  fact  that  modus  tollens  has  a  negative  -^p  as  its  conclusion,  while  modus  ponens  affirms  the  truth  of 
a  statement  q. 


Exercises  2. 1 

1.  (a)  {-y/S,  V5);  (c)  {0,  — §} 

2.  (a)  For  example,  1  +  t,  1  +  2/,  1  +  3 i,  —8  —  5/  and  17  —  43 i. 

3.  (a)  {1,2},  {1,2,3},  {1,2,4},  {1,2,  3,  4} 
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4. 

Only 

(c)  is  true.  The  set  A 

contains  one  element,  (a,  ft} 

5. 

(a) 

True; 

(d)  False. 

6. 

(a) 

(01 

7. 

(a) 

True; 

(e)  False;  (h) 

False. 

8. 

Yes  it 

is;  for  example,  let  a 

:  =  {!}  and  A  =  (1,(1}}. 

9. 

(a) 

ii.  {a, 

ft,  c),  ( a ,  ft,  J},  (a,  e, 

rf},  (ft,  c,  } 

11. 

(a) 

4; 

(b)  8; 

(c)  There  are  2"  subsets  of  a  set  of  n  elements.  (See  Exercise  16  in  Section  5.1  for  a  proof.) 

12.  (a)  False;  (d)  True. 

14.  (a)  True.  (-»■)  If  C  e  V(A),  then  by  definition  of  power  set,  C  is  a  subset  of  A;  that  is,  CCA. 

(«— )  If  C  c  A,  then  C  is  a  subset  of  A  and  so,  again  by  definition  of  power  set,  C  e  V(A). 

Exercises  2.2 

1.  (a)  A  =  (1,2,  3,4,  5,6},  B  =  (-1,0,  1,2,3, 4,5},  C  =  (0,  2, -2}. 

2.  (a)  S  (T  T  =  {V2,  25},  S  U  T  =  (2,  5,  s/2, 25,  tt,  f ,  4,  6,  §}, 

T  x  (S  n  T)  =  {(4,  s/2),  (4,  25),  (25,  72),  (25,  25),  (s/2,  s/2),  (s/2,  25),  (6.  s/2),  (6,  25),  (§,  s/2),  (§,25)}. 

(b)  ZUS  =  (s/2,jt,  §,0,  1 ,  —1 , 2,  —2, . . . );  Z n  5  =  (2,  5,  25}; 
zur  =  (s/2,  0,  1,  -1, 2,  -2, . . . );  Zn  T  =  (4,  25,  6}. 

3.  (a)  (1,9,0,6,7} 

5.  (a)  (c,  (a,  £>}};  (e)  0 

6.  (a)  Ac  =  (-2,  1] 

7.  (a)  2"~2 

8.  (a,  b)c  =  (—oo,  a]  U  [ft,  oo),  [a,  ft)c  =  (— oo,  a)  U  [ft,  oo),  (a,  oo)f  =  (— oo,  a]  and  (— oo,  b]c  =  (ft,  oo). 

9.  (a)  T  CCS;  (b)  M  n  P  =  0. 

11.  (a)  EflP^0 

12.  (a)  A3  U  A_ 3  =  A3 

13.  Region  2  represents  (A  n  C)  \  B.  Region  3  represents  A  IT  B  D  C;  region  4  represents  (A  IT  5)  \  C. 

14.  (a)  A  C  B,  by  Problem  7. 

15.  Think  of  listing  the  elements  of  the  given  set.  There  are  n  pairs  of  the  form  (1,  ft),  n  —  1  pairs  of  the  form  (2,  ft), 
n  —  2  pairs  of  the  form  (3,  ft),  and  so  on  until  finally  we  list  the  only  pair  of  the  form  ( n,b ).  The  answer  is 
1  +  2  +  3  +  ■  ■  ■  +  n  —  \n(ii  -P  1). 

17.  (b)  Yes.  Given  An  B  =  ACC  and  Ac  fl  B  =  Ac  HC,  certainly  we  have  A  n  B  c  C  and  Ac  n  B  C  C  so,  from  (a),  we 

have  that  B  C  C.  Reversing  the  roles  of  B  and  C  in  (a),  we  can  also  conclude  that  C  C  B;  hence,  B  =  C. 

20.  Using  the  fact  that  X  \  Y  =  X  (T  Yc,  we  have 

(A  \  B)  \  c  =  (A  n  bc)  n  cc  =  a  n  ( bc  n  cc)  =  a  n  (B  u  C)c  =  a\(B  u  C). 

23.  (a)  (A  U  B  U  C)c  =  [AU(fiU  C)|'  =Afn(fiU  Cf  =  Ac  n  ( Bc  n  Cc)  =  Af  n  Bc  n  cc. 

(A  n  b  n  cy  =  [An  (Bn  c)f  =  ac  u  (B  n  cy  =  ac  u  (Bc  u  cc )  =  Ae  u  bc  u  cc. 
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24.  (a)  Looking  at  the  Venn  diagram  at  the  right,  A  ffi  B  consists  of  the  points  in  regions  1  and  3. 

To  have  A  ®  B  =  A,  we  must  have  both  regions  2  and  3  empty;  that  is,  B  =  0.  On  the  other 
hand,  since  A  ©  0  =  A,  this  condition  is  necessary  and  sufficient. 


25.  (a)  This  does  not  imply  B  =  C.  For  example,  let  A  =  (1, 2),  B  =  {1),  C  =  {2}.  Then  A  U  B  =  A  U  C,  but  B  ^  C. 

26.  (d)  True.  Let  x  e  A.  Then  x  e  A  U  B,  so  x  e  A  n  B  and,  in  particular,  x  e  B.  Thus,  Acs.  Similarly,  we  have  B  c  A, 

so  A  —  B. 

28.  (c)  True.  Let  (x,  y)  e  (A  ©  B)  x  C.  This  means  that  x  e  A  ©  B  and  y  e  C;  that  is,  x  €  A  U  B,  x  £  A  n  B,  y  e  C.  If 
x  e  A,  then  x  <£  B ,  so  (x,  y)  e  (A  x  C)  \  (B  x  C).  If  x  e  B,  then  x  ^  A,  so  (x,  y)  €  (B  x  C)  \  (A  x  C).  In  either 
case,  (x,  y)  e  (A  x  C)  ffi  (B  x  C).  So  (A  ©  B)  x  C  c  (A  x  C)  ©  (B  x  C). 

Now,  let  (x,  y)  €  (A  x  C)  ffi  (B  x  C).  This  means  that  (x,  y)  e  (fxC)U(fix  C).  but  (x,  y)  ^  (AxC)fl(Jx  C).  If 
(x,  y)  e  A  x  C,  then  (x,  y)  ^  B  x  C,  so  x  e  A,  y  e  C  and,  therefore,  x  ^  B.  If  (x,  y)  e  B  x  C,  then  (x,  y)  £  Ay.  C, 
so  x  e  B,  y  e  C  and,  therefore,  x  £  A.  In  either  case,  x  €  A  ffi  B  and  y  €  C,  so  (x.  y)  e  (A  ffi  B)  x  C.  Therefore, 
(A  x  C)  ffi  (B  x  C)  C  (A  ffi  B)  x  C  and  we  have  equality,  as  claimed. 


Exercises  2.3 

1.  S  x  B  is  the  set  of  ordered  pairs  (s,  b ),  where  s  is  a  student  and  h  is  a  book;  thus,  S  x  B  represents  all  possible  pairs 
of  students  and  books.  One  sensible  example  of  a  binary  relation  is  {(s,  b)  |  .5  has  used  book  b}. 

3.  (a)  not  reflexive,  not  symmetric,  not  transitive. 

(c)  not  reflexive,  not  symmetric,  but  it  is  transitive. 


abed 

a 

X  X 

b 

X  X 

c 

X 

d 

X 

5.  (a)  {(1,1),  (1,2),  (2,  3));  (g)  {(1,2),  (2,  1),  (I,  I),  (2,  2)}. 

7.  The  argument  assumes  that  for  a  €  72  there  exists  a  b  such  that  ( a,b )  e  1Z.  This  need  not  be  the  case:  See 
Exercise  5(g). 

8.  (a)  Reflexive  since  every  word  has  at  least  one  letter  in  common  with  itself. 

Symmetric  since  if  a  and  b  have  at  least  one  letter  in  common,  then  so  do  b  and  a. 

Not  antisymmetric,  (cat,  dot)  and  (dot,  cat)  are  both  in  the  relation  but  dot  ^  cat!! 

Not  transitive,  (cat,  dot)  and  (dot,  mouse)  are  both  in  the  relation  but  (cat,  mouse)  is  not. 

9.  (b)  Not  reflexive;  (2,  2)  £  TZ. 

Not  symmetric:  (3,  4)  e  7 Z  but  (4,  3)  ^  72. 

Not  antisymmetric:  (1,2)  and  (2,  1)  are  both  in  72  but  1  /  2. 

Not  transitive:  (2,  1)  and  (1,2)  are  in  72  but  (2,  2)  is  not. 

(c)  Reflexive:  For  any  a  e  Z,  it  is  true  that  a 2  >  0.  Thus,  (a.  a)  e  72. 

Symmetric:  If  (a,  b )  e  72,  then  ab  >  0,  so  ba  >  0  and  hence,  (b.a)  e  72. 

Not  antisymmetric:  (5,  2)  e  72  because  5(2)  =  10  >  0  and  similarly  (2,  5)  e  72,  but  5^2. 

Not  transitive:  (5,0)  e  72  because  5(0)  =  0  >  0  and  similarly,  (0,-6)  e  72;  however,  (5,-6)  ^  72  because 
5(— 6)  t  0. 
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(i)  Reflexive:  If  ( x ,  y)  e  R2,  then  x  +  y  <  x  +  y,  so  ((x,  y),  (x,  y))  e  1 Z. 

Not  symmetric:  ((1 , 2),  (3,  4))  e  TZ  since  1  +  2  <  3  +  4,  but  ((3,  4),  (1,  2))  ^  1Z  since  3  +  4  £  1+2. 

Not  antisymmetric:  ((1, 2),  (0,  3))  e  TZ  since  1+2  <  0  +  3  and  ((0,  3),  (1,2))  e  1Z  since  0  +  3  <  1  +  2,  but 
(1,2)94(0,  3). 

Transitive:  If  ((a,  b),(c,  d))  and  ((c,  d),  (e,  /))  are  both  in  TZ,  then  a  +  b  <  c  +  d  and  c  +  d  <  e  +  f,  so  a  +  b  <  e  +  / 
(by  transitivity  of  <)  which  says  ((a,  b),  ( e ,  /))  e  TZ. 

10.  (a)  Reflexive:  For  any  set  X,  we  have  X  c  X. 

Not  symmetric:  Let  a,  b  e  S.  Then  {a}  C  { a ,  b }  but  {a,  b }  £  { a }. 

Antisymmetric:  If  X  c  Y  and  Y  c  X,  then  X  =  F. 

Transitive:  If  X  c  y  and  T  c  Z,  then  X  c  Z. 

11.  (a)  Reflexive:  Any  book  has  price  >  its  own  price  and  length  >  its  own  length,  so  (a,  a)  e  TZ  for  any  book  a. 

Not  symmetric:  ( Y ,  Z)  e  TZ  because  the  price  of  Y  is  greater  than  the  price  of  Z  and  the  length  of  Y  is  greater  than 
the  length  of  Z,  but  for  these  same  reasons,  (Z,  Y )  ^  TZ. 

Antisymmetric:  If  ( a ,  b )  and  ( b ,  a)  are  both  in  TZ,  then  a  and  b  must  have  the  same  price  and  length.  This  is  not  the 
case  here  unless  a  =  b. 

Transitive:  If  (a,  b)  and  ( b ,  c)  are  in  TZ,  then  the  price  of  a  is  >  the  price  of  b  and  the  price  of  b  is  >  the  price  of  c, 
so  the  price  of  a  is  >  the  price  of  c.  Also  the  length  of  a  is  >  the  length  of  b  and  the  length  of  b  is  >  the  length  of 
c,  so  the  length  of  a  is  >  the  length  of  c.  Hence,  (a,  c)  e  TZ. 

Exercises  2.4 

2.  (a)  This  is  not  reflexive:  (2,  2)  ^  TZ. 

3.  Equality!  The  equivalence  classes  specify  that  x  ~  y  if  and  only  if  x  =  y. 

4.  (a)  Reflexive:  If  a  e  R  \  {0},  then  a  ~  a  because  |  =  1  e  Q. 

Symmetric:  If  a  ~  b,  then  |  €  Q  and  this  fraction  is  not  zero  (because  0  f  A).  So  it  can  be  inverted  and  we  see  that 
|  =  1/|  e  Q  too.  Therefore,  b  ~  a. 

Transitive:  If  a  ~  b  and  b  ~  c,  then  |  6  Q  and  *  eQ.  Since  the  product  of  rational  numbers  is  rational,  |  =  is 
in  Q,  so  a  ~  c. 

(b)  I  =  [a  |  a  ~  1)  =  [a  \  f  €  Q}  =  {a  |  a  e  Q}  =  Q  \  (0). 

(c)  ^  =  2  e  Q,  so  V3  ~  712  and  hence  V3  =  712. 

6.  (a)  Reflexive:  For  any  a  S  R,  a  ~  n  because  a  —  a  =  0  e  Z. 

Symmetric:  If  a  ~  h,  then  a  —  b  e  Z,  so  h  —  a  e  Z  (because  b  —  a  —  —(a  —  b))  and,  hence,  b  ~  a. 

Transitive:  If  a  ~  b  and  b  ~  c,  then  both  a  —  b  and  b  —  c  are  integers;  hence,  so  is  their  sum,  (a  —  b)  +  (b  —  c)  =  a  —  c. 
Thus,  a  ~  c. 

(b)  5  =  {x  e  R  |  x  ~  5}  =  (x  |  x  -  5  €  Z}  =  Z; 

5^  =  {x  e  R|x~52)  =  {x  |  x  =  n  +  j,  for  some  n  €  Z) 

(c)  For  each  a  e  R,  0  <  a  <  1,  there  is  one  equivalence  class  a  =  {x  e  R  |  x  =  a  +  n  for  some  integer  /;). 

The  quotient  set  is  {a  |  0  <  a  <  1). 

7.  Reflexive:  For  any  a  €  Z,  a  ~  a  because  2a  +  3a  =  5a. 

Symmetric:  If  a  ~  b,  then  2 a  +  3b  =  5n  for  some  integer  n.  So  2b  +  3a  =  (5a  +  5b)  —  (2a  +  3b)  =  5(a  +  b)  —  5«  = 
5 (a  +  b  —  «).  Since  a  +  b  —  n  is  an  integer,  b  ~  a. 

Transitive:  If  a  ~  b  and  b  ~  c,  then  2a  +  3b  =  5 n  and  2b  +  3c  =  5 m  for  integers  n  and  m.  Therefore,  (2a  +  3b)  + 
(2b  +  3c)  =  5 (n  +  m )  and  2 a  +  3c  =  5(n  +  m)  —  5b  =  5(n  +  m  —  b).  Since  n  +  m  —  b  is  an  integer,  a  ~  c. 
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10.  (a)  Reflexive:  If  a  e  Z  \  {0},  then  aa  =  a2  >  0,  so  a  ~  a. 

Symmetric:  If  a  ~  b,  then  ab  >  0.  So  ba  >  0  and  F  ~  a. 

Transitive:  If  a  ~  b  and  b  ~  c,  then  ab  >  0  and  be  >  0.  Also  b2  >  0  since  b  ^  0.  Hence,  czc  =  -  cb\b -  =  (ofc^fec)  >  0 
since  tzfc  >  0  and  fee  >  0.  Hence,  a  ~  c. 

(b)  5  =  jreZ\  (0)  |  x  ~  5}  =  {x  |  5x  >  0J  =  {x  |  x  >  0) 

—5  =  (x  €  Z  \  {0}  |  x  ~  —5}  -  {x  |  — 5x  >  0)  =  {x  |  x  <  0} 

(c)  This  equivalence  relation  partitions  Z  \  {0}  into  the  positive  and  the  negative  integers. 

11.  (a)  Reflexive:  For  any  a  €  Z,  a2  —  a1  =  0  is  divisible  by  3,  so  a  ~  a. 

Symmetric:  If  a  ~  ft,  then  a2  —  fz2  is  divisible  by  3,  so  b2  —  a 2  is  divisible  by  3,  so  b  ~  a. 

Transitive:  If  a  ~  £>  and  b  ~  c,  then  a2  —  F2  is  divisible  by  3  and  F2  —  c2  is  divisible  by  3,  so  a2  —  c2  = 

(a2  —  b2)  +  (b2  —  c2)  is  divisible  by  3. 

12.  (a)  Yes,  this  is  an  equivalence  relation. 

Reflexive:  Note  that  if  a  is  any  triangle,  a  ~  a  because  a  is  congruent  to  itself. 

Symmetric:  Assume  a  ~  b.  Then  a  and  b  are  congruent.  Therefore,  b  and  a  are  congruent,  so  b  ~  a. 

Transitive:  If  a  ~  b  and  b  ~  c,  then  a  and  b  are  congruent  and  b  and  c  are  congruent,  so  a  and  c  are  congruent. 
Thus,  a  ~  c. 

13.  (a)  {(1, 1),  (1, 2),  (2,  1),  (2,  2),  (3,  3),  (3, 4),  (3,  5),  (4,  4),  (4,  5),  (5,  5),  (4,  3),  (5,  3),  (5,  4)} 

14.  (c)  As  suggested  in  the  text,  a  good  way  to  list  the  equivalence  relations  on  {a,  b,  c)  is  to  list  the  partitions  of  this  set. 

Here  they  are:  {  (a),  {b},  (c)  };  {  (a,  b,  c}  };  (  { a ,  b ),  {c}  };  {  [a,  c },  {b}  };  {  [b,  c),  {a}  }.  There  are  five  in  all. 

15.  (a)  The  given  statement  is  an  implication  which  concludes  “x  —  y  =  x  —  y,”  whereas  what  is  required  is  a  logical 

argument  which  concludes  “so  ~  is  reflexive.” 

A  correct  argument  is  this:  For  any  (x,  y)  e  R2,  x  —  y  =  x  —  y;  thus,  (x,  y)  ~  (x,  y).  Therefore,  ~  is  reflexive. 

16.  Reflexive:  If  (x,  y)  €  R2,  then  x2  —  y2  =  x2  —  y2,  so  (x,  y)  ~  (x,  y). 

Symmetric:  If  (x,  y)  ~  ( u ,  v ),  then  x2  —  y2  —  u2  —  v2,  so  u 2  —  v2  =  x2  —  y2  and  ( u ,  v)  ~  (x,  y). 

Transitive:  If  (x,  v)  ~  («,  u)  and  (zz,  u)  ~  (w,  z),  then  x2  —  y2  =  u2  —  v2  and  u2  —  v2  —  w2  —  z2,  so  x2  —  y2  = 

u2  —  v2  =  w2  —  z2',  x2  —  y2  =  w2  —  z 2  and  (x,  y)  ~  ( w ,  z). 

M  =  {(Jt,  y)  I  (Jt.  v)  ~  (0,  0)1  =  {(x,  y)  |  x2  -  y2  =  02  -  02  =  0}  =  {(x,  y)  |  y  =  ±x} 

Thus,  the  equivalence  class  of  (0,  0)  is  the  pair  of  lines  with  equations  y  =  x,  y  =  —x. 

CFO)  =  {(x,  y)  |  (x,  y)  ~  (1, 0)}  =  {(x,  y)  |  x2  -  y2  =  l2  -  02  =  1} 

Thus,  the  equivalence  class  of  (1,0)  is  the  hyperbola  whose  equation  is  x2  —  y2  =  1. 

19.  (a)  The  ordered  pairs  defined  by  ~  are  (1 ,  1),  (1,4),  (1,9),  (2,  2),  (2,  8),  (3,  3),  (4,  1),  (4,  4),  (4,  9),  (5,5),  (6,  6),  (7,7), 

(8,2),  (8,8),  (9,  1),  (9,4),  (9,9). 

(b)  T  =  {1,4,9}  =  4  =  9;  2  =  {2,  8)  =  8;  3  =  {3};  5  =  {5):  6  =  {6};  7  =  {7). 

(c)  Since  the  sets  {1,4,  9},  {2,  8},  {3},  {5},  {6}  and  {7}  partition  A,  they  determine  an  equivalence  relation,  namely,  that 

equivalence  relation  in  which  a  ~  b  if  and  only  if  a  and  b  belong  to  the  same  one  of  these  sets.  This  is  the  given 
relation. 

20.  Reflexive:  If  a  e  A,  then  a2  is  a  perfect  square,  so  a  ~  a. 

Symmetric:  If  a  ~  b,  then  ab  is  a  perfect  square.  Since  ba  =  ab,  ba  is  also  a  perfect  square,  so  b  ~  a. 
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Transitive:  If  a  ~  b  and  b  ~  c,  then  ab  and  be  are  each  perfect  squares.  Thus  ab  =  x2  and  be  =  y2  for  integers  jc 

?  ->7  x2y 2  txy\2  xy 

and  v.  Now  ab  c  =  x  y  ,  so  ac  =  — =  I  — )  .  Because  ac  is  an  integer,  so  also  —  is  an  integer.  Therefore, 

b-  V  b  /  b 

a  ~  c. 

Exercises  2.5 

1.  (a)  This  defines  a  partial  order. 

Reflexive:  For  any  a  e  R,  a  >  a. 

Antisymmetric:  If  a,  b  e  R,  a  >  b  and  b  >  a,  then  a  =  b. 

Transitive:  If  a,  b,  c  e  R,  a  >  b  and  b  >  c,  then  a  >  c. 

This  partial  order  is  a  total  order  because  for  any  a,  b  e  R,  either  a  >  b  or  b  >  a. 

(b)  This  is  not  a  partial  order  because  the  relation  is  not  reflexive;  for  example,  1  <  1  is  not  true. 

2.  (a)  1,  10,  100,  1000,  1001,  101,  1010,  11,  110,  111 

3.  (a)  (a,  b),  (a,  c),  («,  d),  (b,  c),  (h,  d),  (c,  d)  (b)  ( a,b),(c,d ) 

4.  (a)  a  is  minimal  and  minimum;  d  is  maximal  and  maximum. 

(b)  a  and  c  are  minimal;  b  and  d  are  maximal;  there  are  no  minimum  nor  maximum  elements. 

7.  A  C  B  and  the  set  B  contains  exactly  one  more  element  than  A. 

9.  (a)  Let  (A,  ;<)  be  a  finite  poset  and  let  a  €  A.  If  a  is  not  maximal,  there  is  an  element  a\  such  that  a\  >-  a.  If  a\  is  not 

maximal,  there  is  an  element  ai  such  that  ai  >  a\.  Continue.  Since  A  is  finite,  eventually  this  process  must  stop,  and 

it  stops  at  a  maximal  element.  A  similar  argument  shows  that  (A,  <)  must  also  contain  minimal  elements. 

11.  (a)  Suppose  that  a  and  b  are  two  maximum  elements  in  a  poset  (A,  <).  Then  a  <  b  because  b  is  maximum  and  b  <  a 

because  a  is  maximum,  so  a  —  b  by  antisymmetry. 

12.  (a)  Assuming  it  exists,  the  greatest  lower  bound  G  of  A  and  B  has  two  properties: 

(1)  CCA,  CCD; 

(2)  if  C  c  A  and  C  C  B,  then  C  C  G. 

We  must  prove  that  A  n  B  has  these  properties.  Note  first  that  A  n  B  c  A  and  A  H  B  c.  B,  soAflB  satisfies  (1). 
Also,  if  C  c  A  and  C  c  B,  then  CCAnfi.soAflg  satisfies  (2)  and  A  IT  B  =  A  A  B. 

13.  (a)  a  V  b  =  b  and  here  is  why.  We  are  given  a  <  b  and  have  b  <  b  by  reflexivity.  Thus  b  is  an  upper  bound  for  a  and 

b.  It  is  least  because  if  c  is  any  other  upper  bound,  then  a  <c,b  <c\  in  particular,  b  <c. 

14.  (a)  Suppose  x  and  y  are  each  gibs  of  two  elements  a  and  b.  Then  x  <  a,  x  <  b  implies  x  <  y  because  y  is  a  greatest 

lower  bound,  and  y  <  a,  y  <  b  implies  y  <  x  because  x  is  greatest.  So,  by  antisymmetry,  x  =  y. 

16.  (a)  ( V(S ),  C)  is  not  totally  ordered  provided  |S|  >  2  (since  {a}  and  ( b }  are  not  comparable  if  a  /  b ).  But  0  is  a  minimum 
because  0  is  a  subset  of  any  set  and  the  set  S  itself  is  a  maximum  because  any  of  its  subsets  is  contained  in  it. 

18.  (a)  We  have  to  prove  that  if  b  <  a,  then  b  =  a.  So  suppose  b  <  a.  Since  a  is  minimum,  we  have  also  a  <  b.  By 
antisymmetry,  b  =  a. 

Exercises  3.1 

1.  (a)  Not  a  function;  /  contains  two  different  pairs  of  the  form  (3,  — ).  (c)  This  is  a  function. 

2.  (a)  This  is  not  a  function  unless  each  student  at  the  University  of  Calgary  has  just  one  professor,  for  if  student  a  is  taking 

courses  from  professors  b \  and  f>2,  the  given  set  contains  ( a ,  b\)  and  (a,  l>2)- 

3.  A  x  B  is  a  function  A  -»  B  if  and  only  if  B  contains  exactly  one  element. 

To  see  why,  first  note  that  if  B  =  {b\,  then  A  x  B  =  {(a,  b)  \  a  e  A)  is  certainly  a  function. 
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4.  (a) 

5.  (a) 

6. 

8.  (a) 

9.  (a) 
(b) 

10.  (a) 

13.  (a) 

14.  (a) 

15.  (a) 

(b) 

17.  (a) 
(b) 

18.  (a) 

19.  (b) 

22.  (a) 


Conversely,  if  A  x  B  is  a  function  but  B  contains  two  elements  b\ ,  ft2,  then  for  any  a  e  A,  (a,  b\ )  and  (a,  hi)  are 
both  in  A  x  B,  so  A  x  B  is  not  a  function. 

the  function  defined  by  fin)  =  2 n,  for  example. 

If  x  e  X,  then  x  is  a  country  in  the  British  Commonwealth  with  a  uniquely  determined  Prime  Minister  y  who  lives 
in  that  country;  that  is,  jef. 

If  y  6  Y,  then  y  is  a  person  living  in  one  of  the  countries  in  the  British  Commonwealth.  Thus,  the  domicile  of  y  is 
a  uniquely  determined  element  x  e  X. 

Parking  rates,  bus  fares,  admission  prices  are  several  common  examples. 

add  is  not  one-to-one  since,  for  example,  add(l,  l)=add(0,  2)  while  (1,1)^  (0,  2).  It  is  onto,  however,  because,  for 
any  y  e  R,  the  equation  y  =  add(x)  has  the  solution  x  =  ( y ,  0). 

g  is  not  one-to-one  since,  for  example,  g(l)  =  g(—  1)  =  2.  Neither  is  g  onto;  For  any  x  e  Z,  |x|  >  0,  so  |jc |  +  1  >  1. 
Thus,  for  example,  0  £  rngg. 

g  is  not  one-to-one  as  in  (a).  This  time  it  is  onto,  however,  because  for  n  e  N,  the  equation  n  =  g (x )  has  the  solution 
x  =  n  —  1 .  (Note  that  n  >  0  for  n  e  N,  so  \n  —  1 1  =  n  —  1 .) 

If  f(x  i)  =  fix  2),  then  3xi  +5  =  3x2  +5,  so  x\  =  x2  which  proves  that  /  is  one-to-one.  Also  /  is  onto,  since  given 

y  e  Q,  y  =  fi\iy  -  5))  with  j(>-  -  5)  e  Q. 

Note  that  f(x)  =  (x  +  l)2  -  100.  If  f(x j)  —  f(x 2)  it  follows  that  (xj  +  l)2  —  100  =  (xt  +  7)2  —  100,  so 
(xi  +  7)2  =  (X2  +  7)2  and,  taking  square  roots,  |xi  +  7|  =  |x2  +  7|.  Since  x\,X2  e  N,  we  know  x;  +  7  >  0  and 

X2  -f  7  >  0.  Thus,  x]  +  7  =  x2  +  7  and  x\  =  x2.  Thus,  /  is  one-to-one,  but  it  is  not  onto:  For  example,  1  e  B ,  but 

there  is  no  x  €  N  with  f(x)  =  1  since  (x  +  7)2  —  100  =  1  implies  (x  +  7)2  =  101  and  this  equation  has  no  solution 
in  the  natural  numbers. 

The  domain  of  /  is  R.  Its  range  is  also  R  because  every  y  6  R  can  be  written  y  =  fix)  for  some  x;  namely,  x  =  y'^3. 
The  function  is,  therefore,  onto.  It  is  also  one-to-one:  If  fix  1)  =  /(x2),  then  x3  =  x|  and  this  implies  x\  =  x2. 

y 

The  graph  of  /  shown  at  the  right  makes  it  clear  that  /  is  one-to-one  and  onto. 


Since  / :  R  — >  R  is  one-to-one  by  part  (a),  it  is  also  one-to-one  as  a  function  with 
domain  Z.  Here,  however,  /  is  not  onto  for  we  note  that  /( 0)  =  0,  /( 1 )  =4  and  / 
is  increasing,  so  1  is  not  in  the  range  of  /. 

This  is  not  one-to-one  since,  for  example,  /( 1,  3)  =  /(4,  1)  =  1 1  while  (1,3)  ^4  (4,  1).  The  function  is  not  onto 
since,  for  example,  the  equation  fix)  =  1  has  no  solution  x  =  in,  m)  (because  2  n  +3m  >5  for  every  n .  m  e  N). 

This  is  not  one-to-one  since,  for  example,  /( 1,3)  =  /( 4,  1)  =  11  while  (1,3)  7^  (4,  1).  The  function  is  onto, 
however,  since  for  k  e  Z,  the  equation  k  =  fix)  has  the  solution  x  =  i—k,  k). 

A  =  {x  6  R  |  x  #  3};  mg  /  =  {y  |  y  ±  0). 

Note  that  fix)  =  —  (2x  —  3)2.  This  function  is  not  one-to-one  since,  for  example,  /(l)  =  /(2)  (=  —1).  Restrict  the 
domain  to  {x  |  x  >  3/2}  (or  to  {x  |  x  <  |}). 

Here  are  the  functions  X  — >■  Y : 


{(a,  l),(b,  1)}  lia,l).ib,2)}  {(a,  1),  (*,  3)} 

«a,2),(M)}  [ia,2),ib,2)}  {(a,2),  (*,  3)} 
{(a,  3),  ib,  1)}  {(a.  3),  (ft,  2)}  {(a,  3),  (ft,  3)}. 
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Here  are  the  functions  Y  — >  X: 


{(!,«).  (2,  a),  (3,  a)} 
{(l,a).(2,  b),  (3,  a)} 
{(1,6),  (2,  a),  (3,  a)} 
{(1,6),  (2,  6),  (3,  a)} 


{(1.  a),  (2,  a),  (3,  6)} 
{(1,  a),  (2,  6),  (3,  b)} 
{(1,6),  (2,  a),  (3, 6)} 
{(1,6),  (2,  6),  (3,  b)}. 


(b)  There  are  no  one-to-one  functions  Y  — >  X.  The  one-to-one  functions  from  X  ->  F  are 

{(a,l),(6,2)}  {(a,  1),(6,3)(  {(a,  2),  (6,  1)} 

{(a,  2),  (6,  3)}  {(a,  3),  (b,  1)}  {(a,  3),  (6,  2)}. 


(c)  There  are  no  onto  functions  X  -»  Y.  The  onto  functions  Y  —*■  X  are 


{(l,a),(2,a),(3,6)) 
{(1,  a),  (2,  6),  (3,  6)} 
{(1,6),  (2,  a),  (3,6)) 


{(1,  a),  (2,  6),  (3,  a)} 
{(1,6),  (2,  a),  (3,  a)) 
{(1,6),  (2,  6),  (3,  a)}. 


24. 


« 


1 

2 

3 

4 

1 

1 

2 

3 

4 

2 

1 

4 

9 

16 

3 

1 

8 

27 

64 

4 

1 

16 

81 

256 

We  would  guess  that  for  |X|  =  m  and  |T|  =  n,  the  number  of  functions 
X  -»  Y  is  nm. 


26.  (a)  (— »■)  Suppose  A  and  B  each  contain  n  elements.  Assume  that  f:A—*B  is  one-to-one  and  let  C  =  {/(a)  |  a  e  A}. 

Since  f(a\)  ^  f(ai)  if  a \  ^  a2,  C  is  a  subset  of  B  containing  n  elements;  so  C  =  B.  Therefore,  /  is  onto. 

27.  (a)  (b)  The  domain  of  /  is  R.  The  range  is  the  set  of  all  real  numbers  of  the 

form  2 n  +  a,  where  n  e  Z  and  0  <  a  <  1 . 


Exercises  3.2 


1.  (a)  /“*  =  {(1,5),  (2,  1),  (3,  2),  (4,  3),  (5,  4)} 


(b)  /->  =  {(1,4),  (2,  1),  (3,  3),  (4,  2),  (5,  5)} 


2.  (a)  /  1 :  R  — s-  R  is  defined  by  /  *(x)  =  j(x  —  5). 


3.  (a)  If  f(x i)  =  f(x2),  then  1  -I - l—  =  1  4 - — -,  — X— 

x\  —  4  X2  —  4  xi  —  4 

one-to-one. 

rng/  =  B  =  {y  €  R  |  y  #  1}  and  /-1(x)  =4+  — [— . 


-  and  xj  —  4  =  X2  —  4.  Thus  x\  =  x2  and  /  is 

X2  —  4 
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5.  (a) 

6.  (a) 


7.  (a) 

8. 


10. 


12.  (a) 


15.  (a) 

16.  (a) 


(b) 

18.  (a) 


19.  (a) 


20.  (b) 


21. 


24.  (b) 


maternal  grandmother 


fog  =  {(1,1),  (3.  8),  (2,1).  (4,9),  (5,1)} 

g  o  /  is  not  defined  because  rng  /  =  { 1 , 2,  3,  8,  9}  g  dom  g  =  { 1 , 2,  3, 4,  5). 
/  o  /  is  not  defined  because  rng  /  =  {1, 2,  3,  8,  9}  g  dom  /  =  {1, 2,  3,  4,  5). 
gog  =  {(1.2),  (2,2),  (3,  2),  (4,1),  (5,  2)). 
g-1  o  /  o  g  =  {( 1 , 3),  (2, 1),  (3,  2),  (4,  4)| 


g  o  f(x)  = 


f  °  gU)  = 


(x  +  2)2  +  1  x 2  +  4.r  +  5 

h  °  g  o  /U)  =  3;  g  o  h  o  f(x)  =  g(3)  =  i. 

Since  /_l  o  fix)  =  jr,  we  have  g  o  /~*  o  /( x)  =  g(x)  = 

Since  /_1  (x)  =  jc  —  2,  we  have 


x 2  + 


+  2  = 


x2  +  1 


lx2  +  3 

x2  +  1 


/-'  ogof(x)  =  /-'( 


I 


(*  +  2)2  +  1 , 


1  _  —  2(.y  +  2)2  —  I 

(a.-  +  2)2+l  (at  +  2)2  +  1 


ig  0  f)(x)  =  gif(x))  =  fix)  —  c.  Thus  the  graph  of  g  o  /  is  the  graph  of  /  translated  vertically  c  units  down  if 
c  >  0  and  —  c  units  up  if  c  <  0.  The  graphs  are  identical  if  c  =  0. 


Since  —  |x|  = 


— x 


if*-°  U  f  ,  v  ft  I  n  I  /(-•*)  if.r>0 

we  have  /og(x)  =  /(- jc  )  =  \ 

la;  if  x  <  0,  I  f(x)  if  x  <  0. 

So  the  graph  of  /  o  g  is  the  same  as  the  graph  of  /  to  the  left  of  the  y-axis  (where  a:  <  0)  while  to  the  right  of  the 
y-axis,  the  graph  of  /  o  g  is  the  reflection  (mirror  image)  of  the  left  half  of  the  graph  of  /  in  the  y-axis.  We  call 
fog  an  even  function  since  it  is  symmetric  with  respect  to  the  y-axis;  /  o  g(—  x)  —  f  o  g(ar). 


/  o  g  =  {(1,4),  (2,  3),  (3,  2),  (4,  1),  (5,  5)};  g  o  /  =  {(I,  5),  (2,  3),  (3,  2),  (4.  4),  (5,  1))  Clearly,  /  o  g  /  g  o  /. 

it 

Forx  £  B,  (f  o  g)(x)  =  fijzj)  =  ^  =  ar. 

x- 1  “  2 
2(-3j) 

For  x  e  A,  ig  o  f)(x)  =  g(fz*>)  —  — — — -  =  x  and  so,  by  Proposition  3.2.7,  /  and  g  are  inverses. 

7^2  ~  1 

Suppose  g(/?i)  =  g(f>2)  for  b\,bz  €  B.  Since  /  is  onto,  b\  =  f{a\)  and  bi  =  f{a2)  for  some  a\,ci2  e  A. 
Thus,  g(/(«i))  =  g(/(az));  that  is,  g  o  /(«i)  =  g  o  /(a2)-  Since  g  o  /  is  one-to-one,  ai  =  ai-  Therefore, 
b\  =  /(«i)  =  /(+>)  =  bi  proving  that  g  is  one-to-one. 


Suppose  f:  A  -+  B  and  g:  B  —>  C  are  one-to-one.  We  prove  that  g  o  / :  4  -*•  C  is  one-to-one.  For  this,  suppose 
ig  °  /)(oi)  ~  (go  f)(af)  for  some  a\,a2€  A.  Then  g(f(a\))  =  g(/(a2)l  [an  equation  of  the  form  g((?i)  =  g  (i>2 )  1  - 
Since  g  is  one-to-one,  we  conclude  that  /(fl|)  =  /(+?),  and  then,  since  /  is  one-to-one,  that  «i  =  a2- 


With  A  =  {1,2},  B  =  {-1,  1,2},  C  =  {5,  10},  /  =  {(1,  1),  (2,2)}  and  g  =  {(-1, 5),  (1. 5),  (2,  10)}  we  have 
g  o  /  =  {(1,5),  (2,  10).  Then  g  o  /;  A  C  is  onto  but  /  is  not. 


Since  a  bijective  function  is,  by  definition,  a  one-to-one  onto  function,  we  conclude,  by  the  results  of  part  (a)  of  the 
previous  two  exercises,  that  indeed  the  composition  of  bijective  functions  is  bijective. 

Writing  a  =  [xj  +  a,  0  <  a  <  1,  we  have  tix)  =  [x\  —  a.  Now  it  is  straightforward  to  see  that  t  is  one-to-one: 
Suppose  t(xi)  =  r(x 2),  where  xi  =  |+ij  +  fli,  x2  =  |x2J  +  «2  and  0  <  ai,a2  <  Then  [xjJ  —  ai  =  |+2J  —  a2,  so 
[jtiJ  —  L*2J  =  a  i  —  a2.  The  left  side  is  an  integer;  hence,  so  is  the  right.  Because  of  the  restrictions  on  a  \  and  a2, 
the  only  possibility  is  a\  =  a2.  Hence,  also  |xij  =  1+2 J >  so  .ri  =  x2. 
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Exercises  3.3 


1. 


2. 

3.  (c) 

6. 

9.  (a) 

(b) 

11.  (a) 

12.  (a) 

13.  (a) 

15. 

16. 

17.  (d) 

18.  (a) 

(e) 

20.  (a) 

(c) 


21. 


22. 


24. 


Ask  everyone  to  find  a  seat. 

The  two  lists  l2,  22,  32,  42, . . .  and  1 , 2,  3,  4, . . .  obviously  have  the  same  length;  a1  i->  a  is  a  one-to-one  correspon¬ 
dence  between  the  set  of  perfect  squares  and  N. 

The  function  /:NxN->C  defined  by  f(m,n)=m+  ni  for  all  m,n  e  N. 

This  is  false.  For  example,  |N|  =  |N  U  {0} | ,  as  shown  in  the  text. 

False.  Let  X  =  {1},  Y  =  {2},  Z  =  {3}.  Then  ((1,2),  3)  <E  (X  x  Y)  x  Z  but  ((1,2),  3)  <£  X  x  (Y  x  Z)  ( a  set  whose 
second  coordinates  are  ordered  pairs). 

Define  f :  (X  x  Y)  x  Z  X  x  (Y  x  Z)  by  f((x,  y),  z)  =  (x,  (y,  z)). 

f(x)  =  x  +  2  is  a  one-to-one  correspondence  between  (1 ,  oo)  and  (3,  oo).  We  conclude  that  these  sets  have  the  same 
cardinality. 

The  function  defined  by  f(x)  =  x  +  1  is  a  one-to-one  correspondence  between  (0,  1)  and  (1,2). 

The  function  defined  by  g(x)  =  — F  9  is  a  one-to-one  correspondence  between  (0,  1)  and  (10,  oo).  This  is  just 

x 

f(x)  +  10,  where  /  is  the  function  given  in  Problem  26. 

/  is  certainly  a  function  from  R  to  R+  since  3X  >  0  for  all  jt  e  R.  If  3X  =  3V,  then  xlog3  =  ylog3  (any  base),  so 
x  —  y.  Thus,  /  is  one-to-one.  If  r  e  R+,  then  3log3r  =  r,  so  /  is  onto.  We  conclude  that  R  and  R+  have  the  same 
cardinality. 

Since  (a,  b )  has  the  same  cardinality  as  R+  by  Exercise  14  and  since  R+  and  R  have  the  same  cardinality  by 
Exercise  15,  the  result  follows  by  transitivity  of  the  notion  of  “same  cardinality” — see  Exercise  10. 

Follow  the  procedure  given  in  the  text  for  all  rational  numbers  and  omit  those  with  even  denominators.  The  listing 
starts  1,2,  j,  3,  4,  |,  j,  5,  6,  |,  |,  |,  7, ...  . 

This  set  is  uncountable.  The  function  defined  by  f{x)  —  x  —  1  gives  a  one-to-one  correspondence  between  it  and 
(0,  1),  which  we  showed  in  the  text  to  be  uncountable. 

This  set  is  countably  infinite.  In  Exercise  3(c)  we  showed  it  is  in  one-to-one  correspondence  with  N  x  N. 

Impossible.  To  the  contrary,  suppose  that  the  union  were  a  finite  set  S.  Since  S  has  only  finitely  many  subsets  (the 
precise  number  is  2^),  there  could  not  have  been  infinitely  many  sets  at  the  outset. 

Impossible.  If  even  one  infinite  set  is  contained  in  a  union,  then  the  union  must  be  infinite. 

Imagine  5)  sitting  inside  .$2,  both  spheres  with  the  same  center.  Rays  emanating  from  this  common  point  establish  a 
one-to-one  correspondence  between  the  points  on  S|  and  the  points  on  .S'2. 

Let  si ,  $2.  $3.  ■  •  •  be  a  countably  infinite  subset  of  S.  Define  / :  S  —>  S  U  {x)  by 

/(■V|  )  =  x 

f(sk+i)  =  sk  for  k  >  I 

f(s)=s  if  J  ^  {j|,  •••  }• 


Then  /  is  a  one-to-one  correspondence. 


We  are  given  that  A  =  {a\ ,  a2, . . .  ,  a„)  for  some  n  e  N  and  that  B  =  {b\ ,  £2,  £3,  • . . }.  Then  A  U  B  is  countably 

infinite  because  it  is  infinite  and  its  elements  can  be  listed  a\,  02,  ■  ■  ■  ,  a„,  b\ ,  63 . The  function  / ;  N  — >  A  U  B 

Uj  if  1  <  n 


bi-n 


if  i  >n. 


corresponding  to  this  listing  is  defined  by  f(i) 
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Exercises  4.1 

1.  (a  +  b)c  =  c(a  +  b)  (by  commutativity)  =  ca  +  cb  (by  the  first  distributive  law)  =  ac  +  be  (by  commutativity  again). 

2.  (a)  True. 

4.  (a)  q  =  29;  r  =  7;  (b)  q  =  -30;  r  =  10  (c)  q  =  -29;  r  =  7  (d)  q  =  30;  r  =  10 

7.  (a)  The  domain  of  /  is  Z;  its  range  is  Z  as  well,  because  given  q  €  Z,  we  have  q  =  f(qn). 

(b)  /  is  not  one-to-one  since  /( 0)  =  /(l).  (Note  that  n  >  1  guarantees  that  the  quotient  is  0  when  either  0  or  1  is 
divided  by  n.) 

(c)  /  is  onto,  as  shown  in  (a). 

9.  By  definition,  x  <  |"x)  <  x  +  1  for  any  x.  Letting  k  —  \a/b'],  we  have  ajb  <  k  <  (a/i>)  +  1.  Multiplying  by  the 
negative  number  b,  we  obtain  a  >  kb  >  a  +  b.  Thus,  0  <  a  —  kb  <  —b.  Letting  r  =  a  —  kb,  we  have  a  =  kb  +  r 
with  0  <  r  <  \b\,  so,  by  uniqueness,  k  =  q  as  asserted. 

10.  (a)  4034  =  (11 1,1 11,000,010)2  =  (7702)8  =  (FC2)i6. 

Exercises  4.2 

1.  (a)  Not  totally  ordered;  for  example,  6  and  21  are  not  comparable. 

(b)  1  is  a  minimum  element  since  I  |  n  for  all  n  e  N,  but  there  is  no  maximum  element  since  given  any  n  e  N,  n  \  2 n,  so 
n  can’t  be  maximum. 

3.  (a)  4  6 


2  3  5  7 


5.  As  in  Problem  6,  write  a  —  nq  +  r ,  where  0  <  r  <  n.  Thus,  r  =  0,  1, 2.  ....  or  n  —  I.  If  r  =  0,  then  a  =  qn  is 

divisible  by  n.  If  r  =  1,  then  a  =  nq  +  I,  so  a  +  n  —  1  is  divisible  by  n.  If  r  =  2.  then  a  =  nq  +  2  and  a  +  n —  2  is 

divisible  by  n.  In  general,  if  r  =  k ,  then  a  +  n  —  k  is  divisible  by  n. 

6.  Assume,  to  the  contrary,  that  for  some  integer  n,  we  have  4  |  {n2  —  2);  that  is,  n2  —  2  =  4x  for  some  integer  x.  We 

derive  a  contradiction.  First  consider  the  case  that  n  is  even;  that  is,  n  =  2 y  for  some  integer  y.  Then  (2y)2  —  2  =  4x 

says  4(y2  —  x)  =  2.  Since  y2  —  x  is  an  integer,  this  says  4 1 2,  which  is  impossible. 

Next  suppose  n  is  odd;  that  is,  n  =  2y  +  1  for  some  integer  y.  Then  (2y  +  l)2  —2  =  4x  says  4(y2  +  y  —  x)  =  1 .  Since 
y2  +  y  —  x  is  an  integer,  this  says  4  1 1,  another  impossibility.  In  either  case  we  reach  a  contradiction,  so  4 1  (n2  —  2) 
can  never  be  true. 

8.  We  have  a  =  q\n  +  r  and  b  =  q2n  +r  for  some  integers  q\,  q2  and  r.  Subtracting,  a  —b  =  ( q\n  +  r)  —  (<?2«  +  r)  = 
(q\  —  qi)n.  Thus,  n  \  (a  —  b)  as  required. 

9.  (a)  True.  Since  a  |  b,  b  =  ax  for  some  integer  x.  Since  b  j  (— c),  —  c  =  by  for  some  integer  y.  Thus,  c  =  —by  =  —axy  = 

a(—xy).  Since  —xy  is  an  integer,  a  |  c. 

11.  (a)  gcd(93.  1 19)  =  1  =  (— 25)(1 19)  +  32(93). 

(b)  gcd(— 93,  119)=  1  =  (-25)(119)  +  (-32)(-93). 

(c)  gcd(— 93,-119)  =  1  =  (25)( — 119)  +  (— 32)(— 93). 

13.  Since  a  and  b  are  relatively  prime,  we  have  ma+nb  =  1  for  some  integers  m  and  n,  so  2  =  2ma  +  2nb.  Now  suppose 
x  |  (a+b)  and  x  |  (a  —b).  Then,  by  Proposition  4.2.2,  x  \[(a+b)  +  (a  —  b)\;  that  is,  x  j  2 a.  Also  x\[(a  +  b)  —  (a  —b)]; 
that  is,  x  |  2b.  Again,  by  Proposition  4.2.2,  x  |  (2 ma  +  2 nb),  so  x  |  2  and  we  conclude  that  x  =  ±1  or  x  =  ±2.  The 
result  follows. 
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14.  If  gi  and  g2  are  each  greatest  common  divisors  of  a  and  b,  then  g]  <  go  (because  go  is  greatest)  and  g2  <  g i 
(because  g]  is  greatest),  so  gi  =  g2- 

16.  (a)  17369(— 10588)  +  (5472)(33608)  =  4. 

(c)  Consider  the  possibility  that  154*  +  260v  =  3  for  certain  integers  x  and  y.  The  left  side  of  this  equation  is  even 
while  the  right  side  is  odd,  a  contradiction. 

17.  (a)  We  know  that  gcd(x,  y)  |  jc  and  gcd(x,  y)  |  y  and  so  gcd(x,  y)  |  (mx  +  ny)  (Proposition  4.2.2). 

19.  Let  gi  =  gcd(a,  b )  and  g2  =  gcd(ac,  be).  We  have  to  show  that  g2  —  eg \ .  First  note  that  cgi  |  ac  since  gi  |  a,  and, 

similarly,  cgi  |  be.  Thus,  cgi  is  a  common  divisor  of  ac  and  be,  hence,  cgi  <  g2,  the  largest  of  the  common  divisors 
of  ac,  be.  On  the  other  hand,  since  gi  =  am  +  bn  for  some  integers  m,  n,  cgi  =  acm  +  ben.  Since  g2  |  acm  and 
g2  |  ben,  it  must  be  that  gi  |  cgi,  and  so,  since  c,  gi,  gi  are  all  positive,  g2  <  cg\.  We  conclude  that  g2  =  cgi,  as 
required. 

21.  Say  x  |  n  and  x  |  (n  +  1).  Then  x  |  [(n  -P  1)  —  n\,  so  x  1 1  and  gcd(w,  n  +  1)  =  1.  We  have  (—  \)n  4-  (l)(n  +  1)  =  1. 

24.  As  suggested  by  the  hint,  consider  the  set  S  of  all  positive  linear  combinations  of  a  and  b.  Since  S  contains  a  =  \a+0b 

if  a  >  0  and  —a  otherwise,  S  is  not  empty  so,  by  the  Well-Ordering  Principle,  S  contains  a  smallest  element  g. 
Since  g  €  S,  we  know  that  g  =  ma  +  nb  for  integers  a  and  b\  hence,  we  have  only  to  prove  that  g  is  the  greatest 
common  divisor  of  a  and  b.  First  we  prove  that  g  j  a.  Write  a  =  qg  +  r  with  0  <  r  <  g.  Note  that  r  is  a  linear 
combination  of  a  and  b  since  r  —  a  —  qg  —  a  —  q(ma+nb)  —  (1  —  qa)m  +  (-qn)b.  Since  g  is  the  smallest  positive 
linear  combination  of  a  and  b,  we  have  r  =  0,  so  g  |  a  as  desired.  Similarly,  g  |  b.  Finally,  if  c  \  a  and  c  |  b,  then 
c  |  (ma  +  nb),  so  c  I  g,  so  c  <  g. 


lcm(63,  273)  =  =819;  lcm(56,  200) 


=  1400. 


26.  (a) 


30.  (a) 


lcm(93,  119)  =  (93-  119)/1  =  11067 

gcd(a,  b)  |  a  and  a  |  lcm(«,  b)  so  gcd(a,  b)  |  lcm(a,  b )  by  transitivity  of  |.  (See  Proposition  4.2.3.) 

Let  x  —  (a, ,£!„-i  . . .  ao)lO  be  the  integer.  Thus,  x  =  aq  +  1 0« i  +  •  ■  •  +  10 nan. 

Now  x  =  (ao  +  a\  4 - b  a„)  +  (9a i  +  99a2  H - f  99  ■  ■  ■  9 a„) 

n  times 

—  (cio  ci i  +  •  •  •  +  cin)  4-  3(3fl|  +  33^2  4-  •  •  •  +  33  •  •  •  3  cin). 

n  times 

Since  ao  +  ai  + - h  a„  is  divisible  by  3  and  3  clearly  divides  the  second  term  on  the  right,  3  |x  as  required. 

Reverse  the  argument  in  (a);  that  is,  note  that  if  3  lx,  then  we  can  conclude  from  the  preceding  that  3  I  (ao+a\+-  •  ■ +an ). 


33.  (b) 


34.  (a)  By  Theorem  4.2.9,  we  know  there  is  at  least  one  pair  of  integers  a.  b  such  that  g  =  am  +  bn.  Now  observe  that  for 
any  integer  k,  it  is  also  true  that  g  =  (a  +  kn)tn  +  (b  —  km)n. 


Exercises  4.3 


1.  (a)  157  is  prime;  (b)  9831  is  not  prime;  3  |  9831. 
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3.  (a)  Note  that  n  =  p(n/p).  Now  if  n/ p  is  not  prime,  then  it  has  a  prime  factor  q  <  ~Jnj p  by  Lemma  4.3.4.  Since  a 

prime  factor  of  n/p  is  a  prime  factor  of  n,  q  would  then  be  a  prime  factor  of  n  not  exceeding  «Jn / p,  contradicting 
the  fact  that  the  smallest  prime  factor  of  n  is  bigger  than  this. 

(b)  Note  that  16,773,121  =  433(38,737)  and  that  v/38,737  «  197.  Since  433,  the  smallest  prime  dividing  38,737,  is 
larger  than  197,  38,737  is  prime,  by  part  (a).  Thus,  16,773,121  =  433(38,737)  is  the  representation  of  16,773,121  as 
the  product  of  primes. 

4.  (a)  856  =  23  •  107;  (e)  (28  -  l)20  =  320  •  520  •  1720. 

5.  If  14"  terminates  in  0,  then  it  is  divisible  by  10  and,  hence,  by  the  prime  5;  that  is,  there  is  a  5  in  the  factorization 

of  14".  This  contradicts  unique  factorization  because  14"  =  2"  7"  says  that  the  only  primes  dividing  14"  are  2  and  7. 

6.  (b)  We  have  to  prove  that  a,b  e  A  implies  a  +  b  e  A.  Let  a,b  €  A.  Then  a  =  ^  and  b  =  where  3  Jf  n,  3  \l.  Then 

fn£,  |  Yik 

a  +  b  = - .  Since  3  is  prime,  if  3  I  nl ,  then  3  I  n  or  3  1 1,  neither  of  which  is  true.  Thus,  3  \  nt,  so  a  +  b  e  A. 

nl  iii  i 

8.  (b)  False.  The  prime  described  here  has  the  miraculous  property  that  it  divides  all  natural  numbers  greater  than  1.  Certainly 
this  prime  cannot  be  2  since  2^3,  neither  can  it  be  odd  since  if  p  is  an  odd  prime,  p  \  2. 

10.  (a)  This  function  is  one-to-one.  If  f(n\,  m\)  =  /(«2.  m2),  then  2m|6'!|  =  2”'26"2,  and  so  2m|2"13'i|  =  2m22"23"2.  By  the 

Fundamental  Theorem  of  Arithmetic,  the  powers  of  3  and  2  on  each  side  of  this  equation  are  the  same.  So  n\  =  nz, 
hence  m\  =  m2  too.  Therefore,  {n\,m\)  =  (n2,  mi). 

11.  (a)  jr(10)  =  4,  10/  ln  10  %  4.343  and  ^  ^  0.921. 

14.  This  is  a  special  case  of  Exercise  10,  Section  4.2  since  if  p  and  q  are  distinct  primes,  then  p  and  q  are  relatively 
prime.  The  result  also  follows  quickly  from  the  Fundamental  Theorem  of  Arithmetic  since,  writing  n  =  p\ pi  -  ■  ■  pr, 
the  hypotheses  say  that  one  of  the  p,-  is  p  and  some  other  pj  is  q. 

15.  Without  loss  of  generality,  we  may  assume  that  a  <  b.  Since  there  are  infinitely  many  primes,  there  exists  an  integer 
n  >  0  such  that  b  +  n  is  prime.  Since  a  +  n  <  b  +  n,  b  +  n  cannot  divide  a  +n,  so  a  +  n  and  b  +  n  are  relatively 
prime. 

16.  (b)  First  note  that,  if  p  is  prime,  d{p2)  =  3  since  the  only  divisors  of  p2  are  1 ,  p  and  p2.  We  claim  that  only  integers 

of  the  form  p2  satisfy  the  condition.  To  see  this,  notice  that  if  n  is  divisible  by  distinct  primes  p  and  q,  then  n  is 
divisible  by  each  of  1,  p,  q  and  pq  (see  Exercise  14)  and  so  d(n)  >  4.  Thus,  n  has  just  one  prime  factor.  Also,  if  n 
is  divisible  by  the  cube  of  a  prime,  p3,  then  n  is  divisible  by  each  of  1,  p.  P2,  p3  so  again,  d(n)  >  4.  We  conclude 
that  n  =  p2  for  some  prime  p. 

17.  (a)  No.  215  -  1  =  (2 5  -  1)(210  +  25  +  1). 

18.  (a)  26  +  1  =  (22  +  1)(24  -  22  +  1). 

20.  Suppose,  to  the  contrary,  that  p  and  q  are  consecutive  primes  (in  that  order)  such  that  p  +  q  =  2r,  where  r  is  a  prime. 
Then  p  =  =  r  <  =  q-  that  is,  p  <  r  <  q.  This  contradicts  the  fact  that  p  and  q  are  consecutive 

primes. 

22.  This  is  true,  and  the  proof  has  given  Andrew  Wiles  a  place  in  history.  This  is  Fermat’s  Last  Theorem  (4.3.14)! 


23.  The  condition  tells  us  that  in  the  prime  decompositions  of  a,  b  and  c,  no  prime  appears  in  two  of  the  decompositions. 
Flence,  a  =  p“'  p“2  •  ■  ■  pf' ,  b  =  qf 1  q^2  ■  ■  ■  ,  c  =  rf  r^2  ■  ■  ■  rf  with  the  primes  p, ,  q,,  r,  all  different.  Thus, 

ab  =  pax'  ■■■Pa/q['  ■■■qt 
ac  =  p“'  ■■■  p“rr\'  ■  ■  ■  r * 
be  =  qf1  ■■■qfsr\'  ■  ■  ■  rj' . 

Suppose  gcd(ab,  be,  ac)  ^  1.  Then  there  must  exist  a  prime  s  such  that  s  |  ab,  s  \  be,  s  |  ac.  But  there  is  no  prime 
common  to  the  decompositions  of  ab,  ac,  be.  Hence,  no  such  s  exists  and  ged {ab,  be,  ac)  =  I. 

25.  (a)  gcd(ab,  p4)  =  p3. 
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Writing  a  =  par  and  b  =  j/' s  with  gcd(p,r)  =  I  and  gcd(/?,.r)  =  1,  there  are  two  possibilities:  If  /?  =  1,  then 

gcd(fl2,  b)  =  p,  while  if  fi  >  1,  then  gcd(a2,  b)  =  p2. 

Let  the  prime  decompositions  of  a  and  b  be 

a  =  p“'  p“2  ■■■  Parr  and  b  =  <?f 1  q!p  ■  •  •  qfs . 

Since  gcd(a,  b)  =  1,  we  know  that  pt  /  qj  for  any  i  and  j.  Thus,  ab  =  p“'  •  •  ■  Pr'q^'  •  •  •  with  no  simplification 
possible.  Since  ab  =  jc2,  it  follows  that  each  a,  and  each  p ,  must  be  even.  But  this  means  that  a  and  b  are  perfect 
squares. 

Let  a  =  p“2  •  •  •  prr .  Since  a  is  a  square,  all  the  a,  are  even.  Since  a  is  a  cube,  all  the  a,-  are  divisible  by  3.  It 

follows  from  Exercise  14  that  6|a,  for  all  i.  This  means  that  a  is  a  6th  power. 

True.  Write  a  =  p“l  p“2  ■  •  ■  p“r  as  the  product  of  powers  of  primes.  Then  a11  =  p|iai  •••/?/'“''.  If  p  |  a”,  then  p 

must  be  one  of  the  p,,  so  p\a. 

Whenever  a,  b,  c  satisfy  x2  +  y2  =  z2,  so  also  do  ka,  kb,  kc  satisfy  the  equation,  for  any  integer  k.  Thus,  for  example, 
all  triples  of  the  form  3k,  4k,  5k  satisfy  a2  +  b2  =  c2. 

Write  3n  +  2  =  pipi  •  •  •  Pm  as  the  product  of  (necessarily  odd)  primes.  Each  of  the  p,  is  either  3  or  of  the  form 

3k  +  1  or  3k  +  2.  Now  we  notice  that  the  product  of  integers  of  the  form  3k  +  \  is  of  the  same  form  and  the  product 

of  3  with  such  an  integer  is  a  multiple  of  3.  So  if  none  of  the  p,  is  of  the  form  3k  +  2,  the  product  cannot  be  either. 

If  we  drop  the  word  odd,  the  result  is  false.  For  example,  3(2)  +  2  =  8  is  divisible  by  only  the  prime  2,  which  is  not 

of  the  form  3n  +  2  for  n  e  N. 

35.  (a)  /( 8)  =  64;  (b)  x  =  1000. 

Exercises  4.4 

1.  (a)  5,  12,  19,  —2,  -9, -16  are  in  5.  4,  11,  18,  -3,-10, -17  are  in  ^3. 

(b)  The  general  element  of  5  is  an  integer  of  the  form  Ik +  5  for  some  integer  k.  The  general  form  of  an  element  in  —3 
is  an  integer  of  the  form  Ik  —  3  for  some  integer  k. 

3.  (a)  1286  (mod  39)  =  38;  (c)  -545,608  (mod  51)  =  41. 

4.  (a)  False:  18  —  2  =  16  is  not  divisible  by  10.  (c)  True:  44  —  (—8)  =  52  is  divisible  by  13. 

5.  (a)  0  =  3, 1  =  4,  2  =  5. 

6.  (a)  4  (mod  6);  (e)  2,  4,  8,  5,  10,  9,  7,  3, 6,  1 

8.  Observe  that  5°  =  1  (mod  7),  51  =5  (mod  7),  52  =  4  (mod  7),  53  =  6  (mod  7),  54  =  2  (mod  7)  and  55  =  3 

(mod  7).  Thus,  each  of  the  integers  1, 2,  3, 4,  5,  6  is  congruent  mod  7  to  some  power  of  5.  By  Proposition  4.4.5,  any 

integer  a  is  congruent  mod  7  to  one  of  0,  1, 2,  3,  4,  5,  6.  So  the  result  follows. 

9.  (a)  No  x  exists.  The  values  of  3x  mod  6  are  0  and  3.  (e)  x  =  9,  34;  (f)  x  =  9;  (k)  x  —  56. 

10.  (b)  i.  x  =  2,  11.  iv.  No  x  exists.  The  values  of  x2  (mod  6)  are  0,  1,  3  and  4. 

11.  (a)  There  is  just  one  solution  mod  6;  namely,  x  =  0,  y  =  1. 

(b)  Subtracting  the  first  congruence  from  the  second  gives  3x  =  —  2  =  7  (mod  9).  But  the  values  of  3x  mod  9  are  just 
0,  3  and  6.  There  is  no  solution. 

12.  Since  gcd(c,  n)  =  1,  we  have  cx  +  ny  =  1  for  some  integers  x  and  y.  We  also  know  that  ac  —  be  =  nk  for  some 
integer  k.  So  ( a  —  b)cx  =  nkx;  that  is,  (a  —  b)(  1  —  ny)  =  nkx,  or,  a  —  b  =  n(kx  +  (a  —  b)y).  Thus,  n  j  (a  —  b)  and 
a  =  b  (mod  n),  as  desired. 

13.  Since  n  \  {a  —  b),  we  have  a  —  b  =  qn  for  some  integer  q,  hence,  a  —  qn  =  b.  Let  g  =  gcd(a,  n ).  Then  g  |  a  and 
g\n.  Therefore,  g  |  b  (and  g  |  n),  so  g  \  ged  (b,  n).  Similarly  gcd(b,  n )  |  g  and  so  these  natural  numbers  are  equal. 


27.  (a) 

28. 

30. 

31.  (a) 

32. 

33.  (a) 
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14.  (a)  This  is  false.  Consider  r  =  5,  s  =  1,  a  =  2,  n  =  4.  Then  5  =  1  (mod  4),  but  25  ^  21  (mod  4)  because  2s  —  21  =  30 

and  4  ^  30. 

15.  (a)  False!  For  example,  with  a  =  b  =  1  and  n  =  4,  we  have  a  =  b  (mod  4),  but  3a  =  3,  b2  =  1  so  3  ^  1  (mod  4). 

16.  (a)  Suppose  \pl  —  a  jb  for  some  integers  a  and  b.  If  a  and  b  have  any  factors  in  common,  these  can  be  canceled  leaving 

us  with  an  equation  of  the  form  %/2  =  a/b  where  a  and  b  have  no  common  factors  (except  ±  1 ).  So  we  now  make 
this  assumption.  Then  a 2  =  2b2,  so  a2  +  b2  =  3 b2  =  0  (mod  3).  By  Problem  23,  a  and  b  have  3  as  a  common 
factor,  which  is  not  true. 

17.  (b)  x  =  1  and  x  =  p  —  1 . 

18.  (a)  i.  x  =  1  and  x  =  24. 

20.  (a)  By  Fermat’s  Little  Theorem,  188970  =  1  (mod  8971).  So  188971  =  18  and  188972  =  182  =  324  (mod  8971). 

21.  The  key  to  this  problem  is  the  observation  that  97  is  a  prime.  Thus,  by  Fermat’s  Little  Theorem,  if  x  ^  0  (mod  97), 
then  x91  =  x  (mod  97),  so  jc97  —  jc  +  1  ==  1  ^  0  (mod  97).  Since  x  =  0  (mod  97)  is  also  not  a  solution,  there  are 
no  solutions  at  all. 


Exercises  4.5 

1.  (a)  This  number  is  valid.  (b)  This  number  is  not  valid. 

2.  (a)  A  =  9;  (d)  A  =  7. 

4.  a i  +  2 a2  H - h  9a<)  +  10am  =  0  (mod  1 1)  -o-  a\  +  2aj  H - b  9a$  =  —  1 0a j o  a\  +  2ai  4 - \-9ag  =  a\ o  since 

—  10aio  =  aio  (mod  11). 

6.  (a)  For  example,  2-419713-29-0  is  not  valid  because  a\  +  2l\2  H - F  lOaio  =  208  ^  0  (mod  11). 

7.  (a)  The  given  rule  is  w  •  a  =  0  (mod  11)  for  a  =  (ai,  aj . am)  and  w  =  (1,  1,  1,  1,  1,  1,  1,  1,  1,  1). 

8.  (a)  1-13579-02468-8;  (d)  0-63042-00635-5. 

9.  (a)  Not  valid:  3(0  +  24-44-6-4-84-0) +  (1+3  +  54-7  +  9-1-1)  =  60  4-26  =  86  #0  (mod  10). 

(c)  Valid:  3(2  +  2  +  9+  l  +  3  +  4)  +  (5  +  9  +  8  +  7  +  9  +  9)  =  63  +  47  =110  =  0  (mod  10). 

10.  (a)  x  =  1. 

11.  (a)  Suppose  the  sum  of  the  digits  in  the  odd  positions  is  a  and  the  sum  of  the  digits  in  the  even  positions  is  b.  If  the 

check  digit  in  our  bar  code  is  correct,  3a  +  b  ==  0  (mod  10).  An  error  in  an  even  position  digit  changes  b  to  c  ^  b 

(mod  10).  For  the  new  number,  we  compute  3a  +  c  ^  3a  +  b  and  thus,  3a  +  c  #  0  (mod  10).  So  we  know  an  error 

has  been  made. 

(b)  Here  is  a  valid  number:  3-12498-66132-9.  Changing  the  1  and  4  in  positions  two  and  four  to  8  and  7  respectively 
produces  the  number  3-82798-66132-9  which  is  also  valid. 

13.  (a)  1-23586-98732-6;  (b)  1-23006-98732-6. 

15.  (b)  Single-digit  errors  are  not  detected.  Since  10a3  =  0  (mod  10),  the  third  digit  can  in  fact  be  changed  arbitrarily  and 

the  new  number  will  still  pass  the  test. 

16.  (a)  If  n  <  9,  then  any  a,  in  the  range  0  <  a,-  <9  —  n  could  be  replaced  by  n  +  a,  and  the  test  would  miss  the  error. 

17.  (a)  jc  =  18;  (d)  x  =  142. 

18.  (a)  x  =  53;  (d)  x  =  260. 

19.  The  number  was  340. 
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20.  (a)  x  =  297  (mod  900).  Since  there  are  many  integers  less  than  50,000  which  are  congruent  to  297  (mod  900),  it  is 

impossible  to  determine  ab  exactly. 

(e)  ab  =  27,887  (mod  34,300);  ab  =  27,887. 

21.  First  we  address  the  hint.  Suppose  you  want  to  write  Is  4-  22 1  =  1 .  The  “obvious”  numbers  are  s  =  —3,  /  =  1 : 
(—3)7  4-  (1)22  =  1.  But  suppose  you  want  s  >  0.  This  can  be  achieved  by  adding  and  subtracting  22(7)  in  the 
equation  (—3)7  4-  1(22):  (—3  4-  22)7  +  (1  —  7)22  =  I;  that  is,  19(7)  4-  (—6)22  =  1.  In  general,  we  know  that 

1  =  sm  +  tn  for  some  integers  s  and  t  by  Theorem  4.2.9.  Now  if  s  0,  choose  k  such  that  s  +  kn  >  0  and  note  that 
Os  +  kn)m  +  (t  —  km)n  =  1. 

22.  (a)  300;  (c)  259. 

23.  (a)  Z;  (c)  G. 

24.  (a)  HI. 

Exercises  5. 1 

1.  (a)  £?=  i  i2  =  l2  4-  22  4-  32  +  42  +  52  =  55  (b)  £?=)  2‘  =  21  +  22  +  23  +  24  =  30 

(c)  5Zr=i  sin7rt  =  sinTr(l)  =  sin;r  =0 

4.  (a)  If  n  =  1,  n2  +  n  =  l2  +  1  =  2,  which  is  divisible  by  2.  Suppose  k  >  1  and  the  result  is  true  for  n  =  k;  that  is, 

2  |  (A:2 -FA:).  We  wish  to  prove  that  2  |  [(k  + 1)2  +  (/:+  1 )].  But  (k  +  l)2  +  (fc+  1)  =  k2+2k+  1  +k+  1  =  k2 +k  +  2(k  +  1). 
Since  2  |  ( k 2  +  k)  (by  the  induction  hypothesis)  and  since  2  |  2(/0  +  1),  2  divides  the  sum  of  k2  +  k  and  2 (k  +  1); 
that  is,  2  |  [(k  +  l)2  +  (k  +  1)]  as  required.  By  the  Principle  of  Mathematical  Induction,  the  result  holds  for  all 
n  >  1. 

(c)  If  n  =  1,  «3  +  (n  +  l)3  4-  (n  4-  2)3  =  l3  +  23  +  33  =  36  and  9]  36.  Suppose  the  result  is  true  for  n  =  k  >  1 ;  that  is, 
9 1  [>fe3  +  (k+  l)3  +  (k  4-  2)3].  Then  (k+  l)3  +  (it  +  2)3  4-  (k  +  3)3  =  (k+  l)3  +  (k  +  2)3  +  k2,  +  9k2  +  21k  +  27  = 
[&3  +  (k  +  1  )3  +  (k  4-  2)3]  4-  [9k2  +  21k  +  27].  Since  9  divides  the  first  term  by  the  induction  hypothesis  and  since  9 
clearly  divides  the  second  term,  9  divides  (k  +  1  )3  4-  (k  +  2)3  +  {k  +  3)3,  as  desired.  By  the  Principle  of  Mathematical 
Induction,  the  result  holds  for  all  n  >  1 . 

5.  (a)  For  n  =  1,  1  4-  2  4-  3  •  •  •  4-  n  is  1,  by  definition,  and  'l("2+l)  =  |  =  1  also.  Thus,  the  result  is  true  for  n  =  1.  Now 

suppose  that  k  >  1  and  the  result  is  true  for  k\  that  is,  assume  that  1  4-24-34----4-fc  =  . 

Then  1  4~  2  +  •  •  -  4~  {k  4~  1)  =  (1  4-  2  4~  •  •  •  4~  k)  4~  ik  4-  1) 

k(k  4-  1) 

=  - - - 1-  (k  +  1 )  (by  the  induction  hypothesis) 

_  k(k  +1)4-  2(k  4-  1)  _  (*+!)(*  + 2) 

2  2 

which  is  the  given  statement  with  n  =  k  4-  1 .  So,  by  the  Principle  of  Mathematical  Induction,  we  conclude  that 
1  4-  2  4-  •  •  •  4 -  n  =  5«(«  4-  1)  for  all  n  >  1. 

6.  (a)  If  n  =  1,  1  4-  21  4-  22  4-  23  4-  •  •  ■  4-  2"  =  14-2  =  3,  while  21  +  l  —  1  =  4  —  1  =  3,  so  the  result  is  true  if  n  =  1. 

Suppose  the  result  is  true  for  n  =  k  >  1;  that  is,  suppose  1  4-  2  4-  22  4-  23  4-  •  •  •  4-  2*  =  2*+1  —  1.  We  wish  to 
prove  the  result  for  n  =  k  4-1;  that  is,  we  wish  to  prove  that  1  4-  2  4-  22  4-  23  4-  ■  •  •  4-  2k+]  =  2(<:+1)+1  —  1.  Now 

1  4-2  4-224-234 - h2*+1  =  (1  4- 2  4- 22  4- 23  4 - h  2*>  4-  2*+1  =  (2k+x  —  1)4-  2*+l  (by  the  induction  hypothesis) 

=  2  ■  2*+l  —  1  =  2k+1  —  1  =  2^+1)+1  —  1  as  required.  By  the  Principle  of  Mathematical  Induction,  we  conclude  that 
the  given  assertion  is  true  for  all  n  >  1 . 

(b)  If  n  =  1,  l2  -22  4-32  -42  +  --  -  4-  (-l)"-'«2  =  l2  =  1,  by  definition,  while  =  Iii+12  =  i, 

and  so  the  result  is  true  for  n  =  1.  Now  suppose  that  k  >  1  and  the  result  is  true  for  n  =  k\  that  is,  suppose  that 

l2  -  22  4-  32  -  42  4-  •  •  -  4-  (-1)*-1*2  =  (-1  )k~'kik^  — . 


Section  5.1 


S-23 


We  wish  to  prove  that  l2  -  22  +  32  -  42  +  •  •  •  +  (— l)(*+l)1  (*:  +  l)2  =  (-l)u+1)_1  ^  +  1)[(/^+  1)  +  1]. 
Now  l2  -  22  +  32  -  42  +  •••  +  (-l)f*+I)”1(A:+  l)2 

=  [  1 2  —  22  +  32  —  42  -t-  -  -  -  +  (-l)<r“‘/t2]  +  (-l)(<r+1)“1(A:+  l)2 

a-iW+D 


=  (-DK 


=  (-!)' 


k- 1 


2 

A:2  +  k 


+  (—!)(£  -f  1)  (using  the  induction  hypothesis) 


+  ( — 1)*— *( —  1  )(/r2  +  2A:  +  1) 


=  <-!/-'( 


=  (-D 


k-l 


=  (-D 


k-l 


k2  +  k -2k2  -4k -2 


t-k2  —  3k  —  2\ 

( - 5 - 


,  (k  +  m  +  2) 

2 


as  desired.  By  the  Principle  of  Mathematical  Induction,  the  result  is  true  for  all  n  >  1 . 

n 

7.  (a)  J22>i 


k= 0 


1 

I 

/=) 


8.  (a)  For  n  =  1,  ^jP(/  +1)2*  =  2(2')  =  4,  while  n2"+l  =  1(22)  =  4.  Thus,  the  formula  is  correct  for  n  =  1.  Now  suppose 


that  k  >  1  and  the  formula  is  correct  for  n  =  k;  thus,  we  suppose  that  ^(i  +  1)2'  =  k2k+] . 


i=\ 


k+ 1 


We  must  prove  that  the  formula  is  correct  for  n  =  k  +  1;  that  is,  we  must  prove  that  ^(/  +1)2'  =  ( k  +  1  )2‘ 


k+2 


/=! 


k+\ 

Now  Yxi  +  1)2'  =  +  1)2'  +  (k  +  2)2*+l  =  A'2<+l  +  (k  +  2)2A  +  1  (using  the  induction  hypothesis) 

=  1 


i= 1 


=  (k  +  k  +  2)2A+I  =  (2k  +  2)2a+i  =  2  (k  +  1)2A+1  =  (k  +  1)2A+2 


as  required.  By  the  Principle  of  Mathematical  Induction,  the  given  assertion  is  true  for  all  n  >  1. 

9.  (a)  If  n  =  5,  25  =  32,  52  =  25.  Since  32  >  25,  the  result  is  true  for  n  =5.  Now  suppose  that  k  >  5  and  the  result  is 
true  for  n  —  k\  that  is,  suppose  that  2k  >  k2.  We  must  prove  that  the  result  is  true  for  n  =  k  +  1;  that  is,  we  must 
prove  that  2A+I  >  (k  +  l)2.  Now  2A+1  =  2  ■  2k  >  2 k2  by  the  induction  hypothesis,  and  2 k2  =  k2  +  k2  >  k 2  +  5k 
(since  k  >  5),  and  k2  +  5A:  =  k2  +  4k  +  k  >  k2  +  2k  +  \  (since  k  >  1),  and  k2  +  2k  +  1  =  (k  +  l)2.  By  the  Principle 
of  Mathematical  Induction,  the  result  is  true  for  all  n  >  5. 

l  i  i 

10.  (a)  For  n  =  1,  ^(jc,  +  y,)  is,  by  convention,  just  the  single  term  x\  +  yi.  Since  this  equals  ^  yi  the  formula 

1  =  1  1=1  /  =  1 
holds  for  n  =  1.  Now  suppose  k  >  1  and  the  formula  holds  for  n  =  k.  Then 
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11. 


*+i 

+-v') 

/=! 


[x>  +  >’/)]  +  (xk+ 1  +  yk+ 1) 

1  =  1 

k  k 

7>  +  Y,  yi  +  (**+i  +  y*+i )  (by  the  induction  hypothesis) 

i=  1  i  =  l 


[(i>) +**+>] +[(x»  +  yu+ 1  j 


*+1  /t+1 

i=i  i=i 


which  is  the  formula  with  n  =  k+  1.  Thus,  the  result  holds  for  all  rc  >  1  by  the  Principle  of  Mathematical  Induction. 

The  k  to  k  +  1  step  does  not  apply  to  the  case  k  =  1.  When  k  =  1,  G  =  {fli.fli}.  Observe  that  the  groups  {«i },  (02) 
have  no  member  in  common. 


12.  The  induction  was  not  started  properly.  When  n  =  1,  the  left  side  is  I,  while  the  right  side  is  9/8.  The  statement  is 
not  true  when  n  =  1 . 

15.  (a)  We  leave  the  primality  checking  of  /(I), . . .  ,  /( 39)  to  the  reader,  but  note  that  /(40)  =  412. 

(b)  f(k 2  +  40)  =  ( k 2  +  41  +  k)(k2  +  41  -  k). 

16.  If  a  set  A  contains  n  =  0  elements,  then  A  =  0  and  A  has  exactly  1  subset,  namely,  0.  Since  1  =  2°,  the  statement 
is  true  for  n  =  0.  Now  assume  that  a  set  of  k  elements  has  2k  subsets.  Let  A  =  [a\ ,  02, . . .  ,  ak+] }  be  a  set  of  k  +  1 
elements.  We  show  that  A  must  have  2*+l  subsets.  This  follows  because  every  subset  of  A  either  contains  ak+\  or  it 
doesn’t.  By  the  induction  hypothesis  there  are  2k  sets  which  do  not  contain  ak+\\  but,  by  the  induction  hypothesis, 
there  are  also  2k  subsets  which  do  contain  ak+ 1  since  these  are  precisely  those  sets  obtained  by  forming  the  union  of 
a  subset  of  (a\ , 02, . . .  ,  ak)  and  (a^+i }.  Altogether  A  has  2k  +  2k  =  2(2*)  =  2*+l  subsets.  So,  by  the  Principle  of 
Mathematical  Induction,  we  conclude  that  a  set  of  n  elements  contains  2n  subsets. 

17.  Let  S  be  a  nonempty  set  of  natural  numbers.  Let  a  be  any  element  of  S.  Then  the  smallest  element  of  S  is  the  smallest 
element  of  the  nonempty  set  {.r  €  S  \  !  <  jc  <  a}  (which  is  finite). 

19.  For  n  =  1,  P|"=)  fl,  =  fi|,  and  so  A  U  (n!=i  #;)  =  A  U  B\  =  P|"=[(A  U  S,)  and  the  result  holds  for  n  =  1.  Now 

assume  that  A  U  (p|f=i  #/)  =  H/ =  1  ( U  #<)  for  k  >  1.  Then,  given  a  set  A  and  k  +  1  sets  B\,  ZL . J3*+i,  we 

have 

A  u  (n^,1  Bt)  =  A  U  (  (  nti  Bi  )  n  Bk+l  ) 

=  (  A  U  (  rti  Bi )  )  n  (  A  u  Bk+ 1  )  since  A  U  (B  (T  C)  =  (A  U  B)  (T  (A  U  C) 

=  (ft=,(A  U  Bj))  fl  (A  U  Bk+\)  (by  the  induction  hypothesis) 


=  rt±i'(Aufl,), 

giving  the  result  for  k+  1.  By  the  Principle  of  Mathematical  Induction,  we  conclude  that  A  U(  n"=i  Bi )  =  p|"=1(  AU 
Bj ),  for  all  n  >  1 . 

23.  When  n  =  2,  gcd(ai,fl2)  =  •'‘lfli  +-^202  by  Theorem  4.2.9.  Now  suppose  that  k  >  2  and  the  statement  is  true  for 
n  =  k;  that  is,  suppose  that  the  gcd  of  any  k  >  2  integers  is  an  integral  linear  combination  of  them.  We  have  to  show 
the  statement  is  true  for  n  =  k  +  1,  that  is,  that  the  gcd  of  k  +  1  integers  is  an  integral  linear  combination  of  them. 
Now  gcd(ai,  a2, . . .  ,  ak+ 1)  =  gcd(ai,  gcd(«2, .  ■ .  ,  a*+i))-  By  the  induction  hypothesis,  there  are  k  integers  rj,  rs, 
. . .  ,  />+ 1  such  that  gcd(a2,  ....  ak+ 1)  =  Yi= 2  r‘  a‘  ■  Then  Theorem  4.2.9  tells  us  that 

gcd(ai ,  ^2+l  riai)  =xa\  +y(J2riai)  =xa\  +  (yr2)a2  H - (yrk+\)ak+\ 

as  desired  (si  =  x,  S2  =  yr 2,  . . .  ,  ^+1  =  yrk+ 1).  By  the  Principle  of  Mathematical  Induction,  the  statement  is  true 
for  all  n  >  1. 
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28.  When  n  =  2,  we  have  two  distinct  points  in  the  plane  joined  by  exactly  one  line.  Since  2(2L 1 1  =  1 ,  the  result  holds 
in  this  case.  Now  suppose  the  result  is  true  for  n  =  k  >  2;  that  is,  suppose  that  the  number  of  lines  obtained  by 
joining  k  distinct  points  in  the  plane,  no  three  of  which  are  collinear,  is  jk(k  —  1).  We  must  prove  the  result  for 
n  =  k  +  1;  that  is,  we  must  prove  that  the  number  of  lines  obtained  by  joining  k  +  1  distinct  points,  no  three  of  which 
are  collinear,  is  \(k  +  l)[(k  +  1)  —  1]  =  j k(k  +  1).  Let  P  be  one  of  the  k+  1  points.  By  the  induction  hypothesis,  the 
remaining  k  points  are  joined  by  \k{k  —  1)  lines.  On  the  other  hand,  P  can  be  joined  to  each  of  those  remaining  k 
points,  giving  k  additional  lines.  (Since  no  three  of  the  k  +  1  points  are  collinear,  these  additional  lines  are  different 
from  the  lines  determined  by  the  other  k  points.)  The  total  number  of  lines  is 

k(k  —  1)  (  k(k  —  1)  +  2k  k2  —  k  +  2k  k2  +  k  k(k  +  1) 

2  f  '  -  2  “  2  ~  2  ~  2 
as  desired.  We  conclude  that  the  given  assertion  is  true  for  all  n  >  2,  by  the  Principle  of  Mathematical  Induction. 

31.  (a)  First  set  aside  two  coins;  then  separate  the  remaining  six  into  two  piles  of  three  and  compare  weights.  If  the  weights 

are  equal,  the  only  possible  light  coin  is  one  of  the  two  set  aside  and  this  can  be  found  in  a  second  weighing.  If  one 
set  of  three  is  lighter  than  the  other,  then  there  is  indeed  a  light  coin  and  it  is  one  of  a  known  three.  Remove  one  of 
these  three  and  compare  the  weights  of  the  remaining  two  coins  (with  a  second  weighing).  Either  one  of  these  coins 
will  be  found  to  be  lighter  or  the  one  set  aside  is  the  light  one. 

32.  (a)  False.  If  n  =  3,  5"  +  n  +  1  =  129  and  7  j  129. 

1  -  x2'+'  " 

36.  lim  - =  lim  rfd+.v2' )  (using  the  result  of  Exercise  35) 

jc->±1  l-X2  i^±l  1  1 

r=\ 
n 

=  f[d  +(±l)2')  =  2n. 

r= 1 


Exercises  5.2 


1.  (a)  ai  =  l;  ak+]  —  5ak  for  k  >  1. 

2.  (a)  16,8,4,2,  1,1,1 

4.  When  n  =  1,  the  formula  becomes  ( i-D(.i+2)(i  +1+2)  __  js  fl|  Now  assume  that  the  formula  is  correct  when 


n  =  k.  When  n  —  k  +  I , 


,  ,  ,  (k- l)(k  +  2)(k2  +  k  +  2) 

ak+ 1  —  {k  +  1 )  +  (ik  —  (k  +  1  )■  H - - - 


4k3  +  12 k2  +  \2k  +  4  +  k4  +  2 k3  +  k2-  4 

__  _ 

k4  +  6k3+  13k2  +  12k 
_  4 

k(k  +  3)(k2  +  3k  +  4)  k(k  +  3)[(k  +  l)2  +  (k  +  1 )  +  2] 

4  4 

as  desired.  By  the  Principle  of  Mathematical  Induction,  we  conclude  that  the  formula  is  correct  for  all  n  >  1 . 

6.  The  first  six  terms  are  1, 3,  7,  15.  31, 63.  Our  guess  is  that  an  =  2"  —  1.  When  n  =  1,  21  —  I  =  1.  agreeing  with 
a\ .  Now  assume  that  k  >  1  and  the  result  is  true  for  n  =  k;  that  is,  assume  that  =  2k  —  1.  We  must  prove  the 
result  is  true  for  n  =  k  +  1;  that  is,  we  must  prove  that  a*+ 1  =  2k+l  —  1.  Now  a*+ 1  =  2 at  +  1  =  2(2*  —  1)  +  1  (by 
the  induction  hypothesis)  =  2*+1  -24-1=  2*+l  —  1,  as  required.  By  the  Principle  of  Mathematical  Induction,  we 
conclude  that  the  result  is  true  for  all  n  >  I . 
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a\  —  1,  a%  =  22  —  a\  =  4—  1  =  3,  03  =  32— aj  =  9—3  =  6.  <24  =  42— 123  =  16—6  =  10,  05  =  52—  <24  =  25—10  =  15, 
=  62  —  05  =  36  —  15  =  21.  Thus,  the  first  six  terms  are  1,3,6,  10,  15,21.  We  recognize  these  numbers  as  the  six 
sums  1,  1+2,  1+2  +  3,  1+2  +  3  + 4,  1+2  +  3+4  +  5  and  1+2  +  3 +  4  +  5  +  6  and  so  guess  that  an  —  n(n  +  l)/2. 
(See  Exercise  5(a)  of  Section  5.1.) 

For  n  =  1,  n(n  +  l)/2  =  l(2)/2  =  1,  which  agrees  with  a\.  Now  assume  the  formula  holds  for  n  =  k\  that  is,  assume 
at  =  k  (kt 1 1  ■  We  must  prove  the  formula  holds  for  n  =  k  +  1 ;  that  is,  we  must  prove  that  at+i  =  tt±iM+2l.  Now 


at  =  2  ’ .  We  must  prove  the  formula  holds  for  n  =  k  +  1 ;  that  is,  we  must  prove  that  at+ 1  =  -  2 


at+\  ={k+  l)2  -ak-(k  +  l)2  - 


k{k+  1) 


using  the  induction  hypothesis 


2(k+  l)2  -k(k+  1)  (Jt  +  1)[2(*  +  1)  —  *] 


(*+  l)(*  +  2) 
2 


as  required.  By  the  Principle  of  Mathematical  Induction,  we  conclude  that  the  result  is  true  for  all  n  > 

The  first  few  terms  are  0,  1,0,  4,  0,  16, ...  .  Our  guess  is  that 

[0  if  n  is  odd 
a„  =  1  n  _ , 

'  45  if  n  is  even. 


We  will  prove  this  using  the  strong  form  of  mathematical  induction.  Note  first  that  ay  =  0  and  4J_I  =  4°  =  1  =  ai 
so  that  our  guess  is  correct  for  a\  and  ao-  Now  let  k  >  2  and  assume  the  result  is  true  for  all  n,  1  <  n  <  k.  We 

must  show  that  the  result  is  true  if  n  =  k.  If  k  is  odd,  then  ak  =  4a<._2  =  4(0)  =  0  since  at- 2  =  0  by  the  induction 

hypothesis,  k  —  2  being  odd.  If  k  is  even,  then 

k-  2  I  k-2  k_, 

ak  =  4a*_2  =  4(4  2  )  (using  the  induction  hypothesis)  =  4_3_  =  42 

as  desired.  By  the  Principle  of  Mathematical  Induction,  the  result  is  true  for  all  n  >  1 . 

15.  (a)  The  first  ten  terms  are  2,  5,  8,  11,  14,  17,  20,  23,  26, 29.  The  123rd  term  is  2  +  122(3)  =  368. 

(b)  We  attempt  to  solve  752  =  2  +  (n  —  1)3  and  discover  n  —  1  =  250.  So  752  does  belong  to  the  sequence;  it  is  the 

25 1  st  term. 


(d)  The  sum  of  75  terms  is  ^[2(2)  +  (74)3]  =  8475. 

16.  (a)  an  =  —1;  ag2  =  —  38j;  (b)  S  -  — 

19.  For  11  =  1,  a  +  (1  —  1  )d  =  a,  which  agrees  with  ai  =  a.  Now  assume  that  the  formula  is  correct  for  the  integer  k, 
that  is,  that  ak  =  a  +  (k  —  l)nf.  Then  ak+ 1  =  ak  +d  =  a  +  (k  —  \)d  +  d  =  a+kd  shows  that  the  formula  is  true  also 
for  k  +  1.  So,  by  the  Principle  of  Mathematical  Induction,  the  formula  is  correct  for  all  n  >  1 . 

20.  (a)  The  first  ten  terms  are  59049,  -19683,6561,  -2187,729,  -243,81,  -27,9,  -3.  The  33rd  term  is  59049(-4)32  = 


23.  (a)  ang  =  (— .0000 1 240)(— 1 . 1 ) 128  «  -2.4643.  (b)  S  =  -  00001 2400-(.- >■>>  ~  «  -1.2908. 

24.  This  is  straightforward  to  prove  by  mathematical  induction  but  we  give  an  alternative  proof.  Letting  S  =  a  +  ar  + 

ar2  +  •  ■  •  +ar"_1,  we  have  rS  =  ar  +  ar 2  H - 1-  ar'1 .  Subtraction  gives  (1  —  r)S  —  S  —  rS  =  a  —  arn  =  a(l  —r”) 

from  which  the  formula  follows. 

I  —  rn  a 

26.  (b)  If  |r|  <  1,  then  lim^oo  /"  =  0.  Hence,  lim  a - -  - .  (c)  6. 

«-»o o  1  —  r  I  —  r 


28.  (a)  This  is  the  sum  of  an  arithmetic  sequence  with  a  =  15,  d  =  — 1.  Solving  —61  =  a  +  (n  —  l)d  =  75  +  ( n  —  1)(— 4), 
we  obtain  n  =  35.  So  the  sum  is  y[2(75)  +  34( — 4)]  =  245. 
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29.  Suppose  the  arithmetic  sequence  ao,  ao  +  d,ao  +  2d, ...  is  also  the  geometric  sequence  a i,  a\r,  a\r2, ....  Then 
«o  =  a\.  Let's  call  this  number  a.  Note  that  one  and  only  one  possibility  occurs  when  a  =  0  (namely,  0,  0,  0, . . . )  so 
assume  henceforth  that  a  /  0.  Since  the  sequence  is  both  arithmetic  and  geometric,  it  follows  that  all  terms  of  this 
sequence  are  nonzero.  We  have  a  +  d  =  ar  and  a  +  2d  =  ar2,  so 

a  +  d  ar  a  +  2d 

a  a  a  +  d 

So  (a  +  d )2  =  a(a  +  2d),  giving  a 2  +  2  ad  +  d2  =  a2  +  lad  and  hence  d  =  0.  Since  ar=a+d=a,r  =  1.  The 
only  arithmetic  sequences  which  are  also  geometric  are  constant,  of  the  form  a,  a,  a, .. .  for  some  number  a. 

31.  (a)  $1000(1. 15)2  =  $1322.50. 

(b)  We  seek  t  such  that  1000(1.15)'  =  2000.  Thus,  1.15'  —  2 ,t  =  log2/log  1.15  ^  4.9595  years. 

36.  Rewriting  fk+l  =  fk  +  /*_ i  as  /*_)  =  fk+l  -  fk,  we  see  that  /o  =  fz  -  f\  =  1  -  1  =  0.  Similarly,  /_ i  =  1, 

f—2  =  -1,  /- 3  =  2,  f—4  =  -3,  /_ 5  =  5,  /— 6  =  -8.  In  general,  /_„  =  (-I)"*1/™- 

37.  The  Fibonacci  sequence  is  the  sequence  1,  1, 2,  3,  5,  8 . . .  ,  each  term  after  the  first  two  being  the  sum  of  the 
two  previous  terms.  For  n  =  1,  /2/1  =  1(1)  =  l1  =  f2.  So  the  assertion  is  correct  for  n  =  1.  Let  k  >  1 
and  assume  the  assertion  is  correct  for  n  =  k:  that  is,  assume  that  fk+ifk  =  E?=i  f}-  ^e  wish  to  prove  the 
assertion  is  correct  for  n  =  k  +  I;  that  is,  we  wish  to  prove  that  fk+2fk+\  =  Xa=i  / } ■  But  fk+2  =  fk+ 1  +  fk>  s°. 

fk+2fk+\  =  (fk+ 1  +  /*)/*+ 1  =  fk+i  +  fk+ifk  =  fk+]  +  Ef=i  /,2  (using  the  induction  hypothesis)  =  £*=/  f?  as 
desired.  By  the  Principle  of  Mathematical  Induction,  we  conclude  that  the  assertion  is  correct  for  all  n  >  1. 

40.  The  induction  hypothesis  is  valid  only  for  n  in  the  interval  3  <n  <  k  because  all  integers  in  this  problem  are  at  least 
as  large  as  «o  =  3.  But  the  induction  hypothesis  is  applied  to  the  integer  k  —  2;  this  is  not  valid  if  k  =  4. 

42.  (a)  ai  =  1,  02  =  2  (2  and  1  +  1),  03  =  4,  as  given.  We  can  write  4  =  4  =  3+1  =  1+  3  =  2+1  +  1  =  14-2+1  = 

l  +  l+  2  =  2  +  2=  l  +  l  +  l  +  l  so  04  =  8.  Careful  counting  shows  that  05  =  16. 

44.  (a)  an  =  2an-\.  The  sequence  is  the  sequence  2, 4,  8, . . .  of  powers  of  2. 

47.  (b)  f\\)  =  /(l)  =  [4(1)  -  l]/3  =  1.  /2(  1)  =  /o/(l)  =  /(/(I))  =  /(l)  =  1.  This  continues:  /”( 1)  =  1  for  all  n. 

Exercises  5.3 

1.  an  =  3" 

3.  an  =  -5(3")  +6n(3") 

5-  an  =  ^-4+^n-^~4-Vr5)n 

7.  an  =  — 2(— 1)"  +  2(3")  =  2[(— 1)',+1  +  3"] 

10.  (a)  an  =  1(3")  +  7-5)";  (b)  an  =  2(3")  +  (-5)"  -  2. 

13.  (a)  an  =  4";  (b)  an  =  2(8")  -  4". 

(c)  For  n  =  0,  2(8°)  —  4°  =  2  —  1  =  1,  as  required. 

Let  k  >  0  and  assume  that  ak  =  2(8A)  —  Ak .  We  wish  to  prove  that  Ojt+i  =  2(8*+1)  —  Ak+] .  Now 
ak+i  =  4 ak  +  8i+1  =  4[2(8*)  —  4*]  +  8*+l  (by  the  induction  hypothesis) 

=  _  4*+'  _|_  8i+I  :  2(8t+1 )  —  4*+1 

as  desired.  By  the  Principle  of  Mathematical  Induction,  the  result  is  true  for  all  n  >  0. 

16.  (a)  a„  =  4"  +  3;  (b)  a„  =  10(4")  -  (3n  +  6)2". 

18.  The  nth  term  of  the  Fibonacci  sequence  is  o„_  1  =  —  75 ( Thus,  \an  1  -  -^(1+2V^)"I  = 

l^(1~2V^)"l-  Now  |-E(E^S)n|  <  -E  <  since  <  E  so  l«n-i  —  ( 1  )" I  <  5-  Remembering  that 
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an- 1  is  an  integer,  the  result  follows  from  the  fact  that  there  is  precisely  one  integer  within  j  of  any  given  real 
number. 

22.  (a)  Since  x  is  a  root  of  the  characteristic  polynomial,  x2  =  rx  +  s.  Hence,  ran-\  +  san- 2  =  r(cxn~l)  +  s(cxn~ 2)  = 

cxn~2(rx  +  s)  =  cxn~2(x2)  =  cx”  —  an. 

(b)  We  know  that  pn  =  rpn_\  +  spn-2  and  qn  =  rqn-\  +  sqn- 2.  Thus,  pn  +  qn  =  rp„-  \  +  sp„-2  +  rqn-\  +  sq„~2  = 
r(p„-i  +q,,-\)  +s(pn-2  +  qn-i). 

(c)  If  x\  and  x2  are  the  characteristic  roots,  part  (a)  tells  us  that  c\x"  and  cjx'^  both  satisfy  the  recurrence  relation  while 
part  (b)  says  that  cixf  +  c2x%  is  also  a  solution.  If  either  x\  or  x2  is  0,  one  initial  condition  determines  the  single 
unknown  constant.  Otherwise,  two  initial  conditions  will  determine  c\  and  c2  for  they  determine  two  linear  equations 
in  the  unknowns  c\,c2  which,  because  x\  /  x2,  must  have  a  unique  solution. 

23.  (a)  The  characteristic  polynomial  is  x2  —  rx  —  s  with  characteristic  roots  x  =  f'±  2  +4'  •  Since  there  is  only  one  root, 

we  must  have  r 2  =  — in  which  case  x  =  rj 2.  Hence,  r  =  2x  and  s  =  —r2/4  =  — 4a:2/4  =  — jc2. 

Exercises  5.4 

1.  (a)  4,  -12,  9,  0,  0, ... ;  (c)  1,  -6,  27,  -108, ...,(«  +  l)(-3)", .... 

2.  (a)  1  +  2*  +  5a:2;  (f)  - — — j. 

3.  an  =  2n 

5.  an  =  —2"  +2-3" 

7.  an  =  2n+l  +  3 

9.  (a)  an  =  2(— 5)” 

11.  a„  =  -i  +  (-i)(-l)"  +  f(2”) 

12.  a„  =  i(2n  +  l)  +  |(-l)n 

Exercises  6.1 

1.  Let  B  be  the  set  of  connoisseurs  of  Canadian  bacon,  and  A  the  set  of  those  who  like  anchovies. 

(a)  |BUA|  =  \B\  +  |A|  —  |SnA|  =  10  +  7  —  6=  11;  (b)  |B  \  A|  =  |B|  -  \B  (T  A|  =  4; 

(c)  \B  ©  A|  =  \B  U  A|  -  \B  fl  A|  =  5;  (d)  |t/|  -  \B  U  A|  =  4. 

4.  Let  A  be  the  set  of  people  with  undergraduate  degrees  in  arts,  S  those  with  undergraduate  degrees  in  science,  and  G 
those  with  graduate  degrees. 

(a)  |A  U  S  U  G|  =  300;  (b)  |S  \  (A  U  G|  =  35. 

7.  Let  D  be  the  set  of  delegates  who  voted  to  decrease  the  deficit,  E  be  the  set  of  delegates  who  voted  in  favor  of  the 
motion  concerning  environmental  issues,  and  T  be  the  set  of  delegates  who  voted  in  favor  of  not  increasing  taxes. 

(a)  |(£>UEUr)c|  =200;  (b)  \T  \  (D  U  £)|  =  316. 

9.  Let  O,  A,  G,  and  C  denote  the  sets  of  people  who  bought  orange  juice,  apple  juice,  grapefruit  juice,  and  citrus  punch, 
respectively. 

(a)  |onAncnc|  =  i 


10.  (a)  75  -  (4(28)  -  6(12)  +  4(5)  -  1)  =  75  -  59  =  16 
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11.  (a)  Let  A  and  B  be  the  set  of  integers  between  1  and  500  which  are  divisible  by  3  and  5,  respectively.  The  question  asks 
for  | A  U  B |.  This  number  is  |A|  +  |B|  -  |A  n  B\  =  L^J  =  233. 

(b)  The  question  asks  for  |A  \  (B  U  C)|  =66. 

13.  Let  A,  B,  and  C  be  the  sets  of  integers  between  1  and  250  which  are  divisible  by  4,  by  6,  and  by  15,  respectively. 
We  want  |AUBUC|  =91. 

14.  (a)  Let  A,  B,  C,  D  be  the  sets  of  integers  between  1  and  1000  (inclusive)  which  are  divisible  by  2,  by  3,  by  5,  and  by 

7,  respectively.  We  want  |A  U  B  U  C  U  D\c  =  1000  —  772  =  228. 

17.  Let  A,  B,  C,  D,  E,  F  be  the  sets  of  natural  numbers  between  1  and  200  (exclusive)  which  are  not  prime  and  divisible 
by  2,  3,  5,  7,  11,  and  13,  respectively.  Let  A,  B,  C,  D,  E,  F  be  the  sets  of  natural  numbers  between  1  and  200 
(exclusive)  which  are  not  prime  and  divisible  by  2,  3,  5,  7,  11,  and  13,  respectively.  |A  U  B  U  C  U  D  U  E  U  F\  =  152, 
so  the  number  of  primes  less  than  200  is  198  —  152  =  46. 

18.  Write  a  =  qb  +  r  with  0  <  r  <  b.  Then  |_f  J  =  q.  Clearly  the  positive  integers  b,  2b,  ...  ,  qb  are  all  less  than  or 
equal  to  a  and  are  all  divisible  by  b.  On  the  other  hand,  if  sb  <  a,  then  s  must  belong  to  the  set  {1, 2, . . .  ,q).  Hence, 
there  are  exactly  q  such  natural  numbers,  as  required. 

22.  (a)  | (A  ®  B)  n  C\  =  |((A  \  B)  U  (B  \  A))  n  Cj  =  [((A  \  B)  n  C)  U  ((B  \  A)  n  C)\ 

=  I  (A  O  C)  \  B\  +  |(fl  n  C)  \  A|  since  (A  \  5)  n  (5  \  A)  =  0 
=  |A  nc|  -  | a  n  s  nci  +  \b  nc|  -  |ao  snci 
=  |Anc|  +  |5nc|  -2|Ansnc| 


Exercises  6.2 

1.  (a)  12;  (b)  7. 

3.  10  x  9  x  8  =  720 

5.  (a)  9  x  26  x  26  x  26  x  10  x  10  x  10  =  158,184,000  (b)  9  x  26  x  105  =  23,400,000 

(c)  158,184,000  +  23,400,000=  181,584,000 
7.  (a)  13  x  6  x  2  x  4  =  624;  (b)  13  +  4  =  17;  (c)  25. 

9.  (a)  4x4=16;  (b)  4  x  4  x  4  =  64;  (c)  16  +  64  =  80. 

11.  60  x  60  x  60  =  216,000 

13.  500  +  (500)  (499)  +  (500)(499)(498)  =  1.24501  x  108 

16.  (a)  52  x  3  x  2  x  1  =  312 


Total 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

No.  of  ways 

1 

2 

3 

4 

5 

6 

5 

4 

3 

2 

1 

20.  (a)  HHHH,  HHHT,  HHTH,  HTHH,  THHH,  HHTT,  HTHT,  HTTH,  THHT,  THTH,  TTHH,  HTTT,  THTT,  TTHT,  TTTH,  TTTT.  There  are 
16  possibilities  in  all. 

22.  (a)  26  x  26  x  26  x  10  x  9  x  8  =  12,654,720 

25.  (a)  The  easiest  way  to  see  that  there  are  2"  functions  from  A  to  5  is  to  note  that  {a,  |  (a,-,  0)  e  /}  4*  /  is  a  one-to-one 
correspondence  between  the  (2")  subsets  of  A  and  the  set  of  functions  /;  A  -*■  B. 

(b)  Of  the  2"  functions  A  ->  5,  precisely  two  are  not  onto, 

l(a\ ,  0),  (a2, 0), . . .  ,  (a„,  0)}  and  {(ai,  1),  (a2,  1), . . .  ,  (a„,  1)}. 


Thus,  the  number  of  onto  functions  is  2”  —  2  as  claimed. 
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Exercises  6.3 

1.  There  are  seven  days  of  the  week.  Hence,  we  wish  to  put  eight  objects  (people)  into  seven  boxes  (days).  By  the 
Pigeon-Hole  Principle,  some  day  must  have  at  least  two  people  corresponding  to  it. 


6.  (a)  Any  given  processor  is  connected  to  at  least  one  of  19  other  processors.  There  are  20  processors,  so  the  Pigeon-Hole 
Principle  assures  us  that  at  least  two  are  connected  to  the  same  number. 

(b)  The  result  is  still  true,  though  for  a  somewhat  more  subtle  reason.  The  number  of  processors  to  which  a  given 
processor  is  connected  is  in  the  range  0-19  (inclusive).  On  the  other  hand,  if  0  occurs  (that  is,  some  processor  is 
not  connected  to  any  other),  then  19  cannot.  So,  as  in  part  (a),  there  are  at  most  19  possibilities  for  the  number  of 
processors  connected  to  a  given  processor.  By  the  Pigeon-Hole  Principle,  at  least  two  processors  are  connected  to  the 
same  number. 

9.  Mimicking  the  solution  to  Problem  14,  we  let  a,  be  the  number  of  sets  Martina  plays  on  day  i.  Then  we  have 

1  <  a\  <  ai+a2  <  ■■■  <  a\+a2-\ - 1-077  <  132.  Now  it  is  not  true  that  the  only  integer  in  the  range  1-77 

which  is  divisible  by  21  is  21  itself.  Thus,  while  two  of  these  sums  must  leave  the  same  remainder  upon  division  by 
21,  we  can  conclude  this  time  only  that  the  difference  of  these  sums  is  divisible  by  21,  not  that  the  difference  is  21. 

10.  Suppose  Brad  and  his  mother  drive  a\  quarter  hours  on  day  one,  02  quarter  hours  on  day  two,  and  so  on,  035  quarter 
hours  on  day  35.  The  list 

ai,  a i+a2,  01+02  +  03-  •  • .  ,  01  +  02  +  •  ■  ■  +  035 

consists  of  35  natural  numbers  between  1  and  15  x  4  =  60  (15  hours  =  60  quarter  hours)  and  so,  if  any  of  them 

is  divisible  by  35  (8|  hrs.  =  35  quarter  hrs.),  it  is  35  and  we  are  done.  Otherwise,  each  of  these  numbers  leaves  a 

remainder  between  1  and  34  upon  division  by  35,  and  so  two  leave  the  same  remainder.  Suppose 

a\  +  fl2  +  ■  ■  ■  +  os  =  35q  +  r 
0\  +  02  +  •  ••  +  Of  =  35q  +  r . 

Then  (assuming  s  >  t)  at+ 1  +  a, +2  +  •  •  •  +  as  is  divisible  by  and,  hence,  equal  to  35;  that  is,  Brad  drives  8|  hours 
on  days  t  +  1,  t  +  2 . s. 

13.  Let  box  1  correspond  to  days  1,  2,  and  3,  box  2  to  days  4,  5,  and  6,  box  3  to  days  7,  8,  and  9,  and  box  4  to  days  10, 
11,  and  12.  Putting  each  bill  into  the  box  corresponding  to  the  day  it  was  mailed,  we  see  that  one  box  must  contain 
at  least  f-^1  =49  bills,  by  the  general  form  of  the  Pigeon-Hole  Principle.  This  gives  the  desired  result. 

16.  Divide  the  rectangle  into  25  rectangles,  each  3  x  4.  By  the  Pigeon-Hole  Principle,  at  least  two  points  are  within,  or 

on  the  boundary,  of  one  of  these  smaller  rectangles.  This  gives  the  result  since  the  maximum  distance  between  two 
points  of  such  a  rectangle  is  V32  +  42  =  5. 

20.  As  suggested,  we  consider  the  sequence  of  natural  numbers  M\  =  3,  M2  =  33,  . . .  ,  Mn  =  33  •  •  •  3  (n  3’s).  If  one 
of  these  is  divisible  by  n,  we  have  the  desired  result.  Otherwise,  at  least  two  of  these  natural  numbers  leave  the 
same  (nonzero)  remainder  upon  division  by  n;  that  is,  for  some  i  /  j,  M,  =  qn  +  r,  Mj  =  q'n  +  r  for  the  same  r. 
Assuming,  without  loss  of  generality,  that  i  >  j,  Mi  —  Mj  =  (q  —  q')n  is  divisible  by  n  and  has  only  3’s  and  0’s  in 
its  base  10  representation. 

22.  After  finitely  many  steps  in  the  long  division  process,  we  will  be  adjoining  a  “0”  at  each  stage  to  the  remainder  from 
the  division  at  the  previous  stage.  Since  only  finitely  many  remainders  are  possible  (each  remainder  is  less  than  the 
divisor),  two  must  be  the  same  and  from  the  first  repetition,  all  steps  (and  corresponding  decimal  places)  will  repeat. 

24.  (a)  Suppose  no  two  people  have  the  same  age  so  that  there  are  at  least  51  different  ages  in  the  room.  Let  box  1  correspond 
to  integers  1  and  2,  box  2  correspond  to  integers  3  and  4,  . . .  ,  box  50  to  99  and  100.  Assigning  ages  to  boxes,  by 
the  Pigeon-Hole  Principle,  some  box  contains  two  ages.  This  says  that  ages  of  two  people  are  consecutive  integers, 
as  required. 

26.  (a)  Focus  on  one  of  the  ten  people,  say  Hilda.  Suppose  first  that  at  least  four  of  the  remaining  nine  people  are  strangers 
to  Hilda.  If  these  four  are  mutual  friends  we  are  done;  otherwise  two  of  these  are  strangers  and,  hence,  together  with 


Section  7.1  S-31 


Hilda,  we  have  three  mutual  strangers  and  again  we  are  done.  Thus,  we  suppose  that  less  than  four  people  are  strangers 
to  Hilda;  so  Hilda  has  at  least  six  friends  in  the  group.  By  Problem  17,  this  group  of  six  contains  either  three  mutual 
strangers  (in  which  case  we  are  done)  or  three  mutual  friends  who,  together  with  Hilda,  give  four  mutual  friends. 

Exercises  7.1 

1.  13  -12-  11-6=  P(13,  8)  —  51,891,840 

3.  10  ■  9  •  8  ■  7  =  PflO,  4)  =  5040 

5.  3x5!  =  360 

7.  (b)  10!4!2! 

9.  (a)  3!  x  4!  =  144;  (b)  P( 4,  2)  x  4!  =  288. 

10.  (a)  m ! 

(b)  If  m  >  n,  there  are  0  injective  functions  X  — >•  Y. 

If  m  <  n,  the  number  is  n(n  —  1  )(n  —  2)  •  •  •  (n  —  m  +  1)  =  P(n,  m). 

m  factors 

12.  Let  N  be  the  set  of  lines  in  which  the  Noseworthys  are  beside  each  other  and  A  the  set  of  lines  in  which  the  Abbotts 
are  beside  each  other. 

(a)  |7V|  =  2  x  3!  =  12;  (b)  |AC|  =  24  -  12  =  12;  (c)  | N  n  A\  =  2  x  2  x  2  =  8; 

(d)  4;  (e)  \N  U  A\  fc  |7V|  4-  |  A\  -  \N  n  A|  =  12  +  12  -  8  =  16; 

(f)  \N  ©  A\  =  |A|  +  |A|  -  2\N  n  A\  =  12  +  12  -  2(8)  =  8. 

14.  (a)  2x5!  =  240 

16.  (a)  9x9x8x7x6x5x4  =  9x  P( 9,  6)  =  9  x  60, 480  =  544,  320. 

17.  (b)  7  x  6  x  P(7,  5)  =  42  x  2520  =  105,840;  (d)  P(l,  7)  =  5040. 

Exercises  7.2 

1-  W  (2X2H2)  =  8820:  (t»  (3H2X2)  +  (2X3X2)  +  @©(3)  =  44'  10°- 

3.  (i/)  =  $  =  792 

5.  (a)  (*55)( j0);  (b)  ('/MS);  (O  3('75)(I). 

6-  (X0)  +  (‘t°)  +  (*2°)  +  C”)  =  1^6 

8.  (a)  4(‘53)  =  5148 

10.  (a)  210;  (b)  (‘4°)=210. 

11.  (a)  (255)  =  53,130;  (b)  (l3)  =  3003;  (c)  (3X2°)  =  20,475. 

13.  (a)  ®;  (b,  (>“);  (c)  ('“)  x  (>“). 

(d)  The  answer  is  ('“)  x  (”)  x  Q.  This  is  also  (l™,)(20)Q.  (Why?) 

17.  (a)  ('«)  =  45;  (c)  (3)  +  (3)(3)  =  10  +  25  =  35. 

19.  (a)  C2°)  =  45 

20.  (a)  ('32)  =  220 
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21.  (a)  (®)  -  8  =  28  -  8  =  20 

23.  The  product  of  n  consecutive  natural  numbers  starting  at  m  is 

m(m  +  i )(m  +2)  ■  ■  ■  (m  +  n  —  1)  =  P{m  +  n  —  1,  n)  = 

So  fflfm+l)(m+l,)  -(m+'|-|)  is  the  integer  (m+n"_l). 

Exercises  7.3 

i-  rtn = (30) 

3.  (6+5_l)  =  (5°)  =  252 

4-  (To"’)  =  Co)  -286 

6.  (a)  (287)  =  2,220,075 

7.  (c)  $  =  79,833,600 

8.  (a)  (l2)(5)(®)(2)  =  369,600;  (b)  ^0  =  15,400. 

12  tat  fMWSOW40!  -  &0! 

W  VlOAloAltV  -  10!  10!  I0!30! 

14-  (1,3)(,22)(,50)=niW=  216,216 

15.  (a)  =  302,400 

16  -  M 

10!5!7!8! 

19.  (a)  Given  any  solution  (jcj ,  x2>  *3,  X4)  of  (*)  with  x\  >  8,  then  (,vi  —  8,  x2,  X3,  X4)  is  a  solution  of  (**)  with  x\  — 
8,  X2,  X3,  X4  nonnegative  integers.  Conversely,  given  a  solution  (,vi ,  x2,  X3,  X4)  of  (**)  in  nonnegative  integers,  then 
(X|  +  8,  x2,  X3,  X4)  is  a  solution  to  (*)  with  xi  +  8  >  8.  The  number  of  solutions  to  (**)  in  nonnegative  integers 
corresponds  to  the  number  of  ways  to  put  1 3  identical  marbles  into  four  boxes  labeled  x\ ,  x2.  *3,  X4.  This  number  is 

(it)  =  56°- 

Exercises  7.4 

1.  D6  =  265;  D7  =  1854;  D8  =  14,833. 

3.  D,,  =  111(1  +  + 

5.  (b)  7!  -  Dv 

6.  (c)  2OD19 

7.  (a)  9!  -  [5(8!)  -  10(7!)  +  10(6!)  -  5(5!)  +  4!]  =  205,056 
10.  (a)  Using  Proposition  7.4.2, 

D„  =  «!  —  m!  +  [ra(rc  —  1)(«  —  2)  ■  •  •  3]  —  [«(«  —  l)(ra  —  2)  •  ■  ■  4]  +  •  •  •  +  (— l)"~'w  +  (— 1 )" 

=  n\  (n  -  l)(n  —  2)  ■  ■  ■  3  —  (n  -  1  ){n  -  2)  •  •  ■  4  +  •  •  •  +  (-  1 )'!~‘ (n  -  1)  ]  +  (-1)"  =  (-1)"  (mod  n). 

(b)  This  follows  immediately  from  part  (a). 
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11.  (a)  (n  -  1  )(£>„_,  +  D„_2) 

=  +  + 

r  i  i  „  ,  i 

+(n  -2)!  1 - + - +  (-l)"~2 - 

L  1!  2!  (n  —  2) ! 

r  1  1  I  1  "|  r  1  1 

=  n(n  -  1)!  i - + - +  (-1)” _1 - —  -  (w  -  1)  I - + 

L  1!  2!  (n-  1)!  J  L  1! 


lT  +  2!  +(“*)■ 


n  — J 


( n  -  1)! 


t] 


+(n-  l)![l  _i  +  i_. ..  +  (_!) 


;] 


(n  —  2) ! 

/II  „  1  \  „  I 

=  «!(1  -  —  +  — - h  (-1)"  — )  —  «!(-l)”—  -  («  -  1)!(-D 

V  1 !  2 !  n !  /  n\ 

=  A, -(-!)" -(-I)"-1  =Dn 


n~  1 


(«-D! 


Exercises  7.5 


1.  (a) 
(b) 

3.  (b) 

4. 

5.  (c) 

6.  (a) 
(b) 

8. 

9. 

11. 

14.  (a) 


16. 


(x  +  y)6  =  x6  +  6x5y  +  15x4y2  +  20x3y3  +  I5x2y4  +  6xv5  +  y6 

(2x  4-  3y)6  =  64x6  +  576x5y  +  2 1 60x4v2  +  4320x3y3  4-  4860x2y4  +  2916xy5  4-  729y6 

(2x3  -x2)8  =  256x  24  -  1024x  23  4-  1792x22  -  1792x  21  4-  1 120x20  -  448x19  4-  1 1 2x 1 8  -  16x17  +x16 

(12)(x3)9(— 2y2)3  =  —  1 760x27  y6 

The  seventh  term  is  (26°)x14y6  =  38,760xl4y6.  The  fifteenth  term  is  (^)x6y 14  =  38,760x6y14. 

17  terms 

There  is  a  middle  term  (since  16  is  even).  This  term  is  ('86)(2x)8(—  v)8  =  3,294,720x8y8. 

The  required  term  is  ('7°)(4x)3(5y)7  =  120(64x3)(78125y7).  The  coefficient  is  600.000,000. 

The  term  in  question  is  (’47)xl3(3y2)4  =  2380x13(8  tv8).  The  coefficient  is  2380(81)  =  192,780. 

The  general  term  is  (lii8)^-^  (x2/  =  (^8)3l8_<:x3/;_18.  We  want  3k  —  1 8  =  27,  so  k  =  15  and  the  coefficient  is 

(j8)33  =  ( 8 1 6)(27)  =  22,032. 

If  n  =  0,  (1  +  V2)°  =1  =  1+  0V2.  so  x0  =  I,  y«  =  0. 

If  n  =  1,  (1  +  s/2)1  =  1  4-  \/2,  so  x ]  =  1 ,  yi  =  1. 

If  n  =  2,  (1  +  v/2)2  =  1  +  2s/2  +  2  =  3  +  2\fl  so  x2  =  3,  y2  =  2. 

If  n  =  5,  (1  +  V2)5  =  1  +  5V2  +  10(v/2)2  +  lOf+'iZ)3  +  5(V2)4  +  (V2)5 

=  1  +  5V2  +  20  +  20V2  +  20  +  4\/2  =  41  +  29V2 
and  so  X5  =  4 1 ,  vs  =  29. 

With  the  given  interpretation,  row  n  of  Pascal's  triangle  becomes  the  single  number 

l°'(o)  +  lo'"(")  +  -  +  10(»-i)  +  1  =  S 


So  we  are  trying  to  prove  that  J2"=o  *0"  r(") 
"  (n' 

•r. 


11 n  =  (io+ 1)"  =  J2  (')(ioy'~r(i)r. 


r=  0 


1 1".  This  follows  immediately  from  the  Binomial  Theorem  since 
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20. 


22.  (a) 


Consider  (x  -  y)n  =  EL o  C)x"-‘(-y)*  =  ELo  ©*"-*  (-!)*/• 

Setting  x  =  y  =  1,  we  obtain  0  =  (1  -  1)"  =  ELo  =  ELo  ©(-0* 


n(n-l)(n-2)  n(n  -  1)(«  -  2)(n  -3) 

:  n  +  n(n  -  1)  +  - - - - 1 - — - E - 1-  n(n 

,  ,  ,  „  ,  (« -!)(«- 2)  ,  (n  —  l)(n  —  2)(n  —  3)  , 

1  +  (n  -  1)  H - - - 1 - — - 1 - h  n  -  1 


=  n[ 


2 

n  —  1 

2 


+ 


w  —  1 


3! 

+  •••  +  1 


=  n2”-‘ 


using  the  result  of  Problem  1 8  in  the  last  step. 


—  1 )  +  n 

+  >] 


Exercises  8.1 


1.  To  find  the  mid-point  of  a  line  segment  AB,  choose  a  radius  r  (for  example,  r  =  |Afi|)  such  that  the  arcs  with 
centers  A  and  B  and  radius  r  meet  at  two  distinct  points,  P  and  Q.  The  point  of  intersection,  M ,  of  AB  and  PQ  is 
the  required  mid-point.  ■■,  / 

To  see  why  this  works,  consider  the  diagram  to  the  right,  in  which  the  labels  are 
given  as  before  and  line  segments  AP,  BP,  AQ,  BQ  are  joined.  Note  that 
LP  AB  =  LPBA  since  A  APB  is  isosceles.  Since  ABPQ  is  congruent  to  A  APQ 

(three  pairs  of  sides  of  equal  length),  LBPQ  =  LAPQ.  Thus,  triangles  PAM  and  B 

PBM  are  congruent  (two  equal  pairs  of  sides  and  equal  contained  angles),  so 
\AM\  =  \BM\  as  required. 


3.  (a)  Let  A  be  any  point  on  l  and  let  r  =  \AP\.  Draw  the  circle  with  center  P  and  radius  r.  If  this  meets  l  in  only  the 
single  point  A,  then  PA  is  the  desired  perpendicular  to  i.  Otherwise,  the  circle  meets  i  in  two  points,  A  and  B.  Draw 
arcs  with  centers  A  and  B  and  radius  r  meeting  in  P  and  Q.  Then  PQ  is  the  desired  perpendicular  to  l. 

To  see  this,  consider  the  diagram  at  the  right,  with  M  the  point  of  intersection  of 
PQ  and  t.  Since  triangles  P AQ  and  PBQ  are  congruent,  LAPM  =  LBPM. 

Then  it  follows  that  AAPM  =  A B PM  and  thus,  LAMP  =  LBMP .  Since  these 
angles  have  sum  180°,  each  is  a  right  angle. 


Q 

X 

The  basic  idea  behind  this  is  the  simple  fact  that  xy  —  (— )(2y).  When  x  is  even,  the  product  xy  can  be  equally  well 

determined  as  the  product  of  x/2  and  2 y.  This  explains  why  one  crosses  out  a  line  where  the  first  term  is  even:  The 
product  of  the  two  numbers  in  a  crossed  out  line  equals  the  product  of  the  numbers  in  the  line  below. 
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When  x  is  odd,  however,  the  entry  below  x  is  while  the  entry  below  y  is  2y.  Their  product  is  (£yi)(2y)  =  xy—y, 
so  the  product  xy  is  the  product  of  the  numbers  in  the  line  below,  plus  y.  Note  that  y  is  the  number  in  the  right 
column  corresponding  to  the  odd  x  on  the  left.  Thus,  we  do  not  cross  out  lines  where  x  is  odd;  the  y  terms  must  be 
added  at  the  end  to  give  the  correct  product. 

7.  We  find  the  sum  of  the  numbers  and  divide  by  n.  Thus,  to  find  the  average  of  n  numbers  ai,  a2, . . .  ,  a„. 

Step  1.  set  S  =  0; 

Step  2.  for  i  =  1  to  n ,  replace  .S'  by  .S'  +  a,\ 

Step  3.  output 

8.  To  find  the  maximum  of  n  numbers  a\,  aj,  . , .  ,  an, 

Step  1.  let  M  —  a\\ 

Step  2.  for  i  =  2  to  n,  if  a,  >  M,  replace  M  by  a,-; 

Step  3.  output  M. 

The  value  of  M  output  in  Step  3  is  the  maximum  of  the  at. 

11.  If  x  <  a\,  we  wish  to  output  jc,  a [,  a2, . . .  ,  a„.  If  a\  <  x  <  02,  we  wish  to  output  ai,  x,  a2, . . .  ,  a„,  and  so  on.  If 
x  >  an ,  we  wish  to  output  a\,ct2, ...  ,  a„,  x.  Here  is  one  suitable  algorithm. 

To  insert  x  into  its  correct  position  in  the  ordered  list  a \  <  02  <  ■  ■  ■  <  an, 

Step  1.  set  F  =  0; 

Step  2.  for  i  —  1  to  n, 

if  jc  <  a,  and  F  =  0,  output  .r  and  set  F  —  1 ; 
else  output  a,-. 

Step  3.  if  F  —  0,  output  x. 

The  variable  F  introduced  here  is  a  flag ,  w  hose  purpose  is  to  tell  us  whether  or  not  x  has  been  output.  At  the  end  of 
the  loop  in  Step  2,  if  x  has  not  been  output  (that  is,  x  is  larger  than  all  the  a,  ’s),  then  we  will  know  this,  because  F 
will  not  have  changed  from  its  initial  value  of  0;  hence,  we  must  output  x  as  the  final  element.  Note  that  the  “and” 
in  Step  2  is  the  logical  "and”  introduced  in  Section  1.1.  We  output  x  and  set  F  =  1  only  if  both  x  <  a,  and  F  =  0 
are  true. 

13.  Here’s  the  idea.  Divide  a  by  2,  thereby  obtaining  a  =  2q\  +aj,  0  <  a\  <2.  Then  divide  q\  by  3,  obtaining 
<71  =  3*72  +  0.2,  0  <  U2  <  3.  Note  that,  at  this  point,  we  have  a  =  2(3^2  +  02)  +  a  1  =  3 !<72  +  2!fl2  +  «i-  Next,  divide 
q2  by  4,  obtaining  qj  —  4<?3  4-  03  and  a  =  4!^3  +  3\a$  +  2\ai  +  a\ .  Eventually  we  have 

a  =  n\q„- 1  +  an~\(n  -  1)!  H - b  a22!  +  a\, 

which,  with  a„  —  qn~\,  is  the  required  expression.  (Note  that  if  at  some  stage  £7*—  1  =  0,  we  have  a„  =  an-\  =  •  •  •  = 
ak  =  0,  but  the  procedure  is  still  valid.)  Here  is  the  algorithm  for  writing  a  =  a„n\  +a„-i(n  —  1)!  +  •  •  •  +022!  +a  1. 

Step  1.  Let  qq  —  a. 

Step  2.  For  i  =  1  to  n  —  1,  write  q-, -- \  =  (i  +  I  )q,  +  a,  with  0  <  a,  <  i  +  1. 

Step  3.  Let  a„  =  q„~  1 . 

15.  A  number  between  0  and  2"  —  1,  when  expressed  in  base  2,  is  a  string  of  at  most  n  0’s  and  l’s.  By  “padding”  with 
initial  0’s,  if  necessary,  we  can  assume  that  all  such  strings  have  length  exactly  n.  Then  such  a  string,  eie2  •  •  •  en,  each 
€j  =  0  or  1,  determines  the  subset  A  of  [a\,  a2, ...  ,an]  as  follows:  a,  e  A  -o-  e,-  =  1.  Thus,  the  string  00 ...  0  of  n 
0’s  corresponds  to  the  empty  set,  the  string  1 1  ...  1  of  n  l’s  to  the  entire  set  {a\ ,  a2,  ■  ■  ■  ,  a„],  the  string  1 1010 ...  0  to 
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the  subset  {at ,  a2,  <24),  and  so  on.  Enumerating  the  subsets  of  [a\, . . .  ,  a,,}  now  simply  amounts  to  listing  the  integers 
between  0  and  2"  —  1  in  base  2.  Here  is  an  algorithm. 

Step  1.  Let  M  =  0.  Output  the  words  “empty  set.” 

Step  2.  for  /  =  1  to  2"  —  1 

•  replace  M  by  M  +  1 ; 

•  write  M  =  eie2  ■  ■  ■  en  as  an  /; -digit  number  in  base  2; 

•  for  k  =  1  to  n,  if  e*  =  1,  output  ak- 

Each  value  of  i  in  Step  2  yields  one  subset  of  , a„ }  and,  when  Step  2  is  complete,  all  subsets  have  been 

output. 

16.  (a)  i.  S  =  1;  S  =  1  +  (— 3)(2)  =  -5;  S  =  -5  +  2(22)  =  -5  +  8  =  3. 

ii.  S  =  2;  S  =  —3  +  2(2)  =  1;  S  =  1  +  1(2)  =  3. 

17.  The  first  value  of  S  is  S  =  an . 

With  i  —  1  in  Step  2,  the  value  of  S  is  an-\  +  Sx  =  an-\  +  a„x. 

With  i  =  2  in  Step  2,  the  value  of  S  is  2  +  Sx  =  a„_ 2  +  (a„-i  +  anx)x  =  a„_2  +  a„_ ix  +  a„x 2. 

With  i  =  3  in  Step  2,  the  value  of  S  is  a„- 3  +  Sx  =  a„_ 3  +  an-2X  +  an~\x2  +  an x3. 

With  i  =  n,  the  value  of  S  is  +a„_(„_i)jc +a„_(„_2)Jt2  H - h a„xn  =  ao  +a\x  -\-a2x2  H - \-a„x"  as  desired. 


Exercises  8.2 


1.  f(n)  =  n  +  2n  +  n  —  1  +  20  =  4m  +  19 


3. 


Consider  the  division  of  a  number  a,  which  has  at  most  two 
digits,  by  a  single-digit  number  b,  where  a  <  10 b. 


q 

b  |  « 

qb 

X 


Since  the  quotient  q  has  just  one  digit,  the  division  counts  as  one  operation,  the  product  qb  counts  as  another,  and 
since  a  —  qb  <  b,  the  difference  a  —  qb  has  just  a  single  digit,  so  the  subtraction  requires  a  third  operation.  In  the 
division  of  an  n-digit  number  by  a  single-digit  number,  this  process  is  repeated  at  most  n  —  1  or  n  times,  depending 
on  whether  or  not  b  is  greater  than  the  first  digit  of  a.  So  at  most  3 n  operations  are  required. 


6.  (a)  With  reference  to  Horner’s  algorithm  as  described  in  (8.1.1),  each  iteration  of  Step  2  requires  two  operations.  Since 
Step  2  is  repeated  n  times,  this  method  requires  2 n  operations.  This  is  fewer  operations  than  are  needed  by  the 
obvious  method  of  polynomial  evaluation,  as  we  now  show. 


Assuming;  multiplications  to  compute  a,-  x' ,  1+2-1 - \-n  =  ^n(n  + 1)  multiplications  are  involved  in  the  evaluation 

n2  +  3  n 

— - - operations.  This 


n(n  +  1) 

of  ao  +  a\x  -t - 1-  anx" .  As  well,  n  additions  are  required,  for  a  total  of - 1-  n 


estimate  can  be  improved.  Problem  18  shows  that  x'  can  be  computed  with  approximately  log  i  multiplications.  So 
1  +  log  i  multiplications  are  required  for  a,x‘  and 


1  +  (1  +  log2)  +  (1  +  log 3)  H - 1-  (1  +  logn)  =  n  +  logn! 

for  the  polynomial.  Including  additions,  approximately  2 n  +  log  n !  operations  are  required. 

7.  (a)  Since  5 n  <  n 3  for  all  n  >  3,  we  can  take  c  =  1,  no  =  3,  or  c  =  5,  «o  =  1. 

(c)  For  n  >  1,  we  have  8n3  +  4 n2  +  5n  +  1  <  9 n4  +  18n2  +  24n  +  6  =  3(3n4  +  6 n2  +  8 n  +  2),  so  we  can  take  c  =  3, 
«0  =  L 

8.  (a)  By  Proposition  8.2.7,  for  instance,  5 n  x  n,  and  by  Proposition  8.2.8,  n  <  n3.  Thus,  5 n  <  n 3 . 

(c)  By  Proposition  8.2.7,  f(n)  x  n3  and  g(n)  x  n4.  By  Proposition  8.2.8,  n3  -<  u4,  so  f  g. 
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10.  (b)  Suppose  f  <  g  and  g  -<  h.  We  must  prove  f  <  h.  By  part  (a),  /  =  O(h),  so  it  remains  only  to  prove  that  h  7=  O(  f). 

Assume  to  the  contrary  that  h  —  0(/).  Then,  since  /  =  O(g),  using  part  (a)  again,  we  would  have  that  h  =  O(g), 
contradicting  g  <  h. 

11.  If  /  x  g,  then  /  =  O(g).  Since  g  <  h,  g  =  0(h)  and  so,  by  Exercise  10(a),  /  =  0(h).  To  obtain  f  <  h,  we 
have  now  to  prove  that  h  is  not  0(/).  But  if  h  =  O(f),  then,  because  /  =  0(g),  we  would  have  that  h  =  0(g), 
contradicting  g  <  h. 

12.  We  have  a"  <  bn  for  n  >  i  since  a  <  b,  so  a'1  =  0(bn).  We  must  prove  bn  ^  (D(an).  If  b"  =  0(an ),  then  there 
would  exist  a  constant  c  such  that  bn  <  can  for  all  suitably  large  n.  Dividing  by  a,  we  obtain  (|)"  <  c,  for  all 
suitably  large  n,  but  this  is  not  true  because  |  >  1  implies  that  (|)"  grows  without  bound  as  n  increases. 

13.  (a)  We  established  2”  <  n!  for  n  >  4  in  Problem  6  of  Chapter  5.  Thus,  2”  =  0(n!)  (c  =  1,  no  =  4).  On  the  other  hand, 

n  \  ^  0(2"),  for  consider 

n !  1  2  3  4  n 

¥  ~  2222"' 2' 

The  product  of  the  first  three  factors  on  the  right  is  |  and  each  of  the  remaining  n  —  3  terms  is  bigger  than  2.  So 
>  |(2"~3)  for  n  >  3.  If  n!  =  0(2”),  then  n!  <  c2”  for  some  constant  c  and  all  suitably  large  n  and  so  ^  <  c. 
We  have  shown  this  is  not  possible.  Since  2"  =  0(n\)  but  n\  ^  0(2”),  we  have  2"  n\  as  required. 

15.  For  any  n  >  1,  \kf(n)\  =  |it||/(n)|.  Taking  «o  =  1  and  c  =  |1-|  in  Definition  8.2.1,  we  see  that  kf  is  0(/).  Also, 
1/001  =  i  .ha,  /  =  0(t/)  L  so  /X  tf. 

18.  Since  log2  n  =  0(n)  by  Proposition  8.2.9  and  n  =  0(n),  we  have  n  log2  n  =  0(n2)  by  Proposition  8.2.3.  Thus, 
it  remains  only  to  show  that  n2  ^  0(n  log2  n).  Assume  to  the  contrary  that  for  some  positive  constant  c,  we  have 
n 2  <  cn  log2  n  for  sufficiently  large  n.  Then,  for  large  n,  we  would  have  n  <  clog2n.  This  says  n  —  0(log2  n ), 
contradicting  Proposition  8.2.9. 

19.  (b)  /  x  n2 

20.  (a)  n5 

21.  Since  log an  =  (loga  fi)(logfc  n),  we  have  |  loga  n \  =  1  loga  b\  \  \ogbn\  for  all  n  >  1.  With  c  =  log ab  and  no  =  1  in 
Definition  8.2.1  we  see  that  loga  n  is  0(logb  n).  By  symmetry,  logb  n  =  0( logfl  n);  hence,  loga  n  x  logb  n. 

23.  Since  n!  =  n(n  —  l)(n  —  2)  •  •  -3  •  2  •  1  <  n”,  logn!  <  nlogn.  With  c  =  1,  no  =  1  >n  Definition  8.2.1,  we  see  that 
logn!  =  0 (nlogn)  as  required. 

Exercises  8.3 

1.  (a)  Since  n  =  9  ^  1,  we  set  m  =  LjJ  =  4.  Since  2  =  x  =  04  =  4  is  false  and  x  <  <24  is  true,  we  set  n  =  m  =  4  and 
change  the  list  to  1,2,  3, 4. 

Since  4  =  n  1,  we  set  m  =  |_ 2J  =2.  Since  2  =  r  =  we  output  “true”  and  stop. 

This  search  required  three  comparisons  of  2  with  an  element  in  the  list — x  =  04,  x  <  04  and  x  =  a2-  A  linear  search 
would  have  used  two  comparisons. 

3.  (a)  To  find  the  complement  of  A  =  [a\,  a2, . . .  ,  a*}  with  respect  to  U  =  {1,  2, . . .  ,  100), 

Step  1.  For  i  =  1  to  100,  search  A  for  i  and  if  /  is  not  found,  output  i. 

The  output  numbers  are  the  elements  of  Ac. 
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4. 


2. 3.4.5  2,4,6,8,10 

3.4.5  2,4,6,8,10 

3.4.5  4,6,8,10 

4.5  4,6,8,10 

5  4,6,8,10 

5  6,  8,  10 

6,  8,  10 


1 

1,2 
1,2,2 
1,2, 2,3 
1,2, 2, 3, 4 

1.2.2,  3,4,4 

1.2,  2,  3,  4,  4,  5 

1,2,  2,  3,  4,  4,  5,  6,  8,  10 


6.  (a)  Here’s  the  bubble  sort: 


The  algorithm  required  seven  comparisons. 


k  =  5:  3J,  7,2,  5,4->  1,3/7,  2,5,4-+  1,3,  1/2,  5, 4^  1,3,  2,7/1,  4^  1,3, 2,5, 7, 4 

k  =  4:  M,  2,5,4,  7-^  l,U5,4,7->  1,2,  3/5,  4,  7^  1,2,  3,5,4,  7 

k  =  3:  /_2,  3,4,  5,7  ->  1,2,  3, 4,  5, 7  1,2,  3, 4,  5,  7 

k  =  2:  /_2,  3,  4, 5,  7  — >  1,2,  3,4,  5,7 

*  =  1:  J/2,  3,  4,  5,  7  ^  1,2,  3,4,  5,7. 

This  required  a  total  of5  +  4  +  3  +  2+  l  =  15  comparisons. 

(b)  Here’s  the  merge  sort. 

Step  2.  I;  3;  2;  7;  4;  5 

Step  3.  1.3;  2,7;  4,5 

Step  3.  1,2,  3,  7;  4,5 

Step  3.  1,2,  3,  4,  5,  7 

Merging  two  lists  of  length  1  to  one  of  length  2  requires  1  +  1  —  1  =  1  comparison.  Thus,  the  initial  merging  of  six 
lists  of  length  1  to  three  lists  of  length  2  requires  1  +  1  +  1  =  3  comparisons.  Merging  three  lists  of  length  2  to  one 
of  length  2  and  one  of  length  2  requires  2  +  2—1  =3  comparisons.  The  final  merging  of  lists  of  lengths  4  and  2 
requires  4  +  2  —  I  =  5  comparisons,  for  a  total  of  3  +  3  +  5  =  11  comparisons. 

8.  Sort  the  list  into  increasing  order  a\  <  ci2  <  •  •  ■  <  a„.  If  n  =  2m  +  1  is  odd,  the  median  is  am;  if  n  =  2m  is  even, 
the  median  is  \(am  +  am+ 1).  Since  determining  the  parity  of  n  adds  only  another  operation,  the  complexity  of  this 
algorithm  is  the  complexity  of  the  sort,  at  best  0(n  log  n). 

9.  (a)  Here's  the  bubble  sort. 


k  =  7:  a,  b,  c,  d,  u,  v,  w,  x  — >•  a,  b ,  c,  d,  u,  v,  w,  x  —*■  a ,  c,  b,  d,  u,  v,  w,  x  — ►  a,  c,  d,  b,  u,  v,  w,  x 

— >  a,  c,  d,  u ,  b ,  v,  w,  x  —>  a,  c,  d ,  u,  b ,  v,  w,  x  —>■  a,  c,  d,  m,  b,  v,  w,  x 

k  =  6:  a,  c,  d,  m,  d,  u,  .v,  w  —>■  a,  c,  d,  u,  b,  v,  x,  w  — >  a,  d ,  c,  «,  fo,  v,  x,  w  —>  a,  <7,  u,  c,  b,  v,  x ,  w 

— >  a,  d,  u,  c,  7>,  v,  x,  w  -*■  a,  d,  u,  c,  b,  v,  x,  w 

k  =  5:  a,  d,  u,  c,  T>,  x,  v,  w  — >  d,a,  u,  c,  b,  x,  v,  w  — *■  d,  a,  u,  c,  b,  x,  v,  w  —>■  d ,  a,  h,  c,  b,  x,  v,  w 

—>■  d,  a,  u,  c,  b,  x,  v,  w 

k  =  4:  d,  a,  a,  c,  x,  fr,  u,  w  —>■  d,a,  u,  c,  x,  b,  v,  w  —*■  d,  a,  u,  c,  x,  b ,  v,  w  —>■  d ,  a,  «,  c,  x,  b,  v,  w 

k  =  3:  d,  a,  u,  c,  x,  b,v,w  -*■  d ,  a,  u,  c,  x,  6,  v,  w  -»  <7,  a,  w,  c,  x,  b,  v,  w 

k  =  2;  d,  a,  «,  c,  x,  b,  u,  w  — »■  d,  a,  «,  c,  x,  7»,  u,  w 

=  1:  d,  a,  u.  c,  x,  b,  v,  w  — *■  d,  a,  «,  c,  x,  7>,  v,  w 

A  total  of7  +  6  +  5  +  4  +  3  +  2+  l  =28  comparisons  is  needed. 

10.  (a)  The  lists  1  and  2,  3,  4  can  be  merged  with  one  comparison. 

(b)  The  lists  5  and  2,  3,  4  can  be  merged  with  three  comparisons. 

(c)  The  lists  1,3,5  and  2,  4,  6  can  be  merged  with  3  +  3  —  1  =  5  comparisons. 
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11.  (a)  c,  a,  e,b,d  ->  a,  c b,d  —>■  a,  c,  e,  b ,  d  —*■  a,c,  b,  e ,  d 
a,  c,  b,d,e  —*■  a,  c,  b,  d,  e  — >  a,  b,  c,  d,  e 
a,  b,  c,  d,e  — >■  a,  b,  c.  d,  e 
a,  b,  c,d,  e  — >  a,  b,  c,  d,  e 

14.  The  answer  is  minfj,  /}.  It  is  impossible  to  have  fewer  than  this  number  of  comparisons  since  until  minj.v,  t}  of 
comparisons  have  been  made,  elements  remain  in  each  list.  To  see  that  minj.v,  r)  can  be  achieved,  consider  ordered 
lists  a\,  a2,  ■  ■  ■  ,as  and  bi,  b2,  •  •  •  ,  br  where  s  <  1  and  as  <  b\.  After  s  comparisons,  the  first  list  is  empty. 

16.  Using  the  efficient  binary  search,  0(\ogk)  comparisons  are  needed  to  search  the  predecessors  of  a^.  In  all,  the  number 
of  comparisons  is  (9(log  l+log2+-  •  -+logn).  Since  ioga+log/)  =  log  ab,  we  have  log  l+log2+-  •  +logn  =  logn!. 
The  result  follows. 

18.  This  is  less  efficient.  A  merge  sort  is  0(n  logn)  and  a  binary  search  is  O(logn).  Since  n  +  nlogn  ~  nlogn,  the 
suggested  procedure  is  0(n  logn),  while  a  linear  search  is  £>(«),  which  is  better. 


Exercises  8.4 


1.  (a)  t _ Permit) 

,  12  3  4 

n\TT2Jtin4 

2  12  4  3 

n\7t2TC^7T^ 

3  1324 

7T 1 7T27T37r4 

4  I342 

it  \  it  2x3x4 
,  14  2  3 

3  X\X2X}X4 

6  l432 

Tt)7T27t37T4 

7  2  13  4 

TV  1 7T2^37r4 

8  2  14  3 


j _ m _ _ s _ 

3  4  {1, 2,  4}c  =  {3} 

2  3  {1,  3}r  =  (2,  4) 

3  4  {1, 3,  4}c  =  {2} 

2  4  (1, 4}c  =  {2,  3} 

3  3  {l,4,3f  =  {2} 

1  2  {2}<?  =  {1, 3, 4} 
3  4  {2,  l,4)c  =  {3} 


2.  (b)  42531,43125,43152,43215,43251 

3.  (a)  We  consider  each  part  of  Step  2.  Finding  the  largest  j  such  that  Xj  <  it  j  + 1  requires  at  most  n  comparisons.  At  most 

another  n  comparisons  are  needed  to  find  the  minimum  of  {7 r,-  |  i  <  j,  tv,  >  itj ) .  To  find  the  complement  of  a  subset  A 
of  {1, 2, . . .  ,  n}  requires  searching  A  for  each  of  the  elements  1,2,...  ,  n  and  noting  those  which  are  not  in  A.  Using 
the  efficient  binary  search,  each  search  is  C,(log2  «),  adding  another  n  log2  n  comparisons.  An  efficient  merge  sort  adds 
another  n  log2  n  comparisons,  so  each  pass  through  Step  2  requires  at  most  n  +n  +n  log2  n  +n  log2  n  —  2(n+n  log2  n) 
comparisons.  Since  this  step  is  executed  n\  —  1  times  and  n  +  n  log2  n  ~  n  log2  n,  the  algorithm  is  0(n \n  log2  n ). 

5.  (a)  1234  1235  1236  1245  1246 

1256  1345  1346  1356  1456 

2345  2346  2356  2456  3456 

6.  (a)  23459,  23467,  23468  precede;  23478,  23479,  23489  follow. 

7.  (b)  First  list  the  combinations  of  1,2,...  ,  n  taken  r  at  a  time  by  the  method  of  Proposition  8.4.3.  Then,  for  each  of 

these  combinations,  enumerate  all  permutations  of  its  elements  using  Algorithm  8.4.2. 
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8.  (a)  123456  123457  123458  123467  123468  123478  123567 

123568  123578  123678  124567  124568  124578  124678 

125678  134567  134568  134578  134678  135678  145678 

234567  234568  234578  234678  235678  245678  345678 

9.  If  a\ci2  . . .  ar  is  one  of  the  combinations  of  1,2,...  ,  n  taken  r  at  a  time,  a\  cannot  be  less  than  1 ,  ai  cannot  be  less 
than  2  and,  in  general,  a,  cannot  be  less  than  i.  Thus,  123 ...  r  is  the  smallest  combination. 

Now  let  it  be  a  combination  and  it'  the  combination  determined  by  it  as  in  the  proposition.  We  show  that  it'  is  the 
immediate  successor  of  it  with  respect  to  lexicographic  order.  First,  since  all  numbers  to  the  left  of  k  —  1  are  the 
same  in  both  combinations,  while  —  1  is  increased  to  k,  we  conclude  that  tr  -<  it' .  Now  suppose  that  tt  <  <t  <  tt' 

for  some  combination  a.  All  integers  to  the  left  of  k  —  1  (in  it)  are  the  same  in  it'  as  well,  and  hence  also  in  a .  If 

k  —  1  were  also  the  same  in  er,  then  we  would  have  a  =  n  since  no  number  to  the  right  of  k  —  I  can  be  increased 
in  it.  Thus,  in  er,  k  —  1  must  be  increased  to  k  (and  no  more,  since  a  <  it').  It  then  follows  that  a  =  it'  since  it'  is 
the  smallest  sequence  whose  initial  segment  (up  to  k )  consists  of  the  integers  of  it' . 

10.  (a)  n;  n  —  r  +  1 ;  n  —  r  +  j. 

(b)  It  must  be  123  . . .  r. 

(c)  It  must  be  (n  —  r  +  1)  (n  —  r  +  2)  . . .  n. 

11.  Given  n  >  r  >  0,  to  enumerate  the  (")  combinations  of  1 , 2, ....  n  taken  r  at  a  time,  proceed  as  follows. 

Step  1.  Set  t  =  1.  Output  Comb(l)  =  123  . . .  r.  If  r  =  n,  stop. 

Step  2.  For  t  =  1  to  (")  —  I,  given  combination  Comb(/)  =  aiai . . .  ar,  determine  the  next  combination  Comb(f  +  1 ) 
as  follows. 

(i)  Find  the  largest  j  such  that  aj  <  n  —  r  +  j. 

(ii)  Output  Comb(r  +  1)  =  a\  «2  •  •  •  <*j-\  aj  +  1  aj  +  2  ...  aj  +  r  -  j  +  1 . 

Exercises 
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Here  is  the  solution  indicated. 
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(e)  Here  are  the  graph  corresponding  to  the  cubes  and  two  edge  disjoint  subgraphs. 
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Here  is  the  solution  indicated. 


F 

B 

R 

L 

Cube  4 

G 

B 

IV 

R 

Cube  3 

R 

G 

B 

W 

Cube  2 

B 

R 

G 

B 

Cube  1 

W 

W 

R 

G 

5.  (a)  Beta  index  =  g 

6. 

I 

9.  If  there  are  four  or  more  vertices  of  one  color,  say  red,  then  there  are  at  least  (3)  =  4  red  triangles.  Otherwise,  there 
are  three  red  and  three  white  vertices,  hence,  one  triangle  of  each  color. 
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1. 


v2 


Here  is  one  possibility.  t>i 


n 


3. 


5. 


t'5  v4 

a - 0 

© - i 

10  edges.  This  is  /C5,  the  complete  graph  on  five  vertices. 
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1 

0 

2 

1 
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2 

2 
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1 
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0 
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10. 


12. 


14.  (a) 

15.  (a) 


16.  (a) 

(b) 

17.  (a) 

(c) 

19.  (a) 

20.  (a) 

(c) 


21. 


Vertex 

Vi 

v2 

v3 

v4 

V5 

v6 

column  total 

8 

9 

1 

8 

10 

6 

vertex  degree 

2 

2 

3 

2 

1 

4 

accessibility  index 

4 

4.5 

2.3 

4 

10 

1.5 

City  Vf,  has  the  lowest  accessibility  index;  it  is  the  most  accessible.  City  t>5  is  least  accessible. 

iii.  Joining  v\  and  113  gives  now  eight  edges,  with  still  six  vertices. 

New  beta  index  =  |  =  1 .3. 

New  accessibility  indices  are  2.3,  4.5,  1.5,  4,  10,  1.5. 

Cities  Vf,  and  V3  are  now  tied  for  most  accessible;  115  is  least  accessible. 

iv.  Joining  i>2  and  Vf,  gives  eight  edges  and  still  six  vertices. 

New  beta  index  =  |  =  1 .3. 

New  accessibility  indices  are  4,  2.3,  2.3,  4,  9,  I. 

City  Vf,  is  most  accessible;  ns  is  least  accessible. 

/C„  has  (2)  =  n fn2~ 1 1  edges.  Each  of  the  n  vertices  has  degree  n  —  1,  so  the  sum  of  the  degrees  is  n(n  —  1).  This  is 
twice  the  number  of  edges,  as  asserted  by  Proposition  9.2.5.  If  11  is  even  there  are  n  odd  vertices;  if  n  is  odd,  there 
are  0  odd  vertices.  In  either  case  the  number  of  odd  vertices  is  even,  in  accordance  with  Corollary  9.2.7.  The  beta 
index  is  =  2^!. 

Consider  the  graph  in  which  the  vertices  correspond  to  the  people  at  the  party  and  an  edge  between  vertices  indicates 
that  the  corresponding  people  shook  hands.  The  degree  of  a  vertex  in  this  graph  is  the  number  of  hands  that  person 
shook.  Thus,  the  result  is  an  immediate  consequence  of  Corollary  9.2.7. 


e\{e} 


Degree  of  v\  is  1 ;  degree  of  v2  is  3;  degree  of  vy  is  4;  degree  of  V4  is  2. 

No.  The  maximum  degree  of  a  vertex  in  a  graph  with  four  vertices  is  3.  (Loops  are  not  allowed  in  graphs.) 

No  such  graph  exists.  The  sum  of  the  degrees  of  the  vertices  is  an  odd  number,  17,  which  is  impossible. 

Impossible.  A  vertex  of  degree  5  in  a  graph  with  six  vertices  must  be  adjacent  to  all  other  vertices.  Two  vertices  of 
degree  5  means  all  other  vertices  have  degree  at  least  2,  but  the  given  degree  sequence  contains  a  1. 

12 


This  is  not  bipartite  because  it  contains  a  triangle. 

This  is  bipartite  with  bipartition  sets  indicated  R  and  W. 


At  least  two  of  the  three  vertices  must  lie  in  one  of  the  bipartition  sets.  Since  these  two  are  joined  by  an  edge,  the 
graph  cannot  be  bipartite. 


Section  9.3  S-43 


23.  Let  x  be  the  number  of  vertices  in  one  of  the  bipartition  sets.  Then  n  —  x  is  the  number  of  vertices  in  the  other. 
The  largest  number  of  edges  occurs  when  all  x  vertices  in  one  set  are  joined  to  all  n  —  x  vertices  in  the  other;  so 

the  number  of  edges  is  at  most  x{n  —  x).  The  function  fix)  =  x(n  —  x)  (whose  graph  is  a  parabola)  has  a  unique 

2  2 

maximum  at  (|,  so  x(n  —  x)  <  ^  for  all  x,  and  the  result  follows. 

25.  (a)  2®  (c)  g)2©-3 

27.  23 

29.  J2  deg  Vj  =  k\V\.  But  also,  J2  deg  u,-  =  2\£\.  Therefore,  2\£ |  =  X:|V|  and  so  k  divides  2|£j.  But  k  is  odd,  so  k  |  \£ |. 

30.  There  are  n  vertices  and  n  possible  degrees  for  the  vertices;  namely,  0,  1 , 2,  . . .  ,  n  —  1 .  If,  however,  we  have  a  vertex 

of  degree  0,  then  it  is  not  possible  to  have  another  vertex  of  degree  n  —  1 .  Hence,  there  are  really  only  n  —  1  possible 
“holes”  into  which  the  n  vertices  can  fit.  Hence,  some  vertex  degree  is  repeated. 


Exercises  9.3 


(i)  and  (ii)  are  not  isomorphic  because  (i)  has  five  edges  and  (ii)  has  four, 
(i)  and  (iii)  are  isomorphic,  as  shown  by  the  labeling. 

A  B  A  D  a  D  a  B 


C  D  B  C  C  D  D 

(i)  and  (iii)  (i)  and  (v) 

(i)  and  (iv)  are  not  isomorphic  because  (iv)  has  a  vertex  of  degree  1  and  (i)  does  not.  (Also,  (iv)  has  only  four  edges.) 

(i)  and  (v)  are  isomorphic,  as  shown  by  the  labeling. 

(ii)  and  (iii)  are  not  isomorphic  because  (ii)  has  four  edges  and  (iii)  has  five  edges.  ~ 

(ii)  and  (iv)  are  not  isomorphic  because  (iv)  has  a  vertex  of  degree  1  and  (ii)  does  not. 

(ii)  and  (v)  are  not  isomorphic  because  (v)  has  five  edges  and  (ii)  has  four  edges. 

(iii)  and  (iv)  are  not  isomorphic  because  (iv)  has  a  vertex  of  degree  1  and  (iii)  does  not. 

(iii)  and  (v)  are  isomorphic,  as  shown  by  the  labeling  to  the  right. 

(iv)  and  (v)  are  not  isomorphic  because  (iv)  has  a  vertex  of  degree  1  [or  because  (v)  has  five  edges]. 


3.  (a) 


o  o 

0,  0,0 


1,1,0  2,1,1  2,2,2 

4.  (b)  These  graphs  are  isomorphic.  One  possible  isomorphism  is  given  by 


as  illustrated. 


<p(A)  =  p,  <p{B)  —  f,  < p(C)  =  u,  <p(D)  =  v ,  tp(E)  =  Q ,  (p{F) 

P 


<p(G) 


C 


D  F 


C  u  v  D 

5.  (a)  No;  it  has  a  vertex  of  degree  5. 

6.  Any  graph  Q  with  n  vertices  is  a  subgraph  of  IC„,  as  is  easily  seen  by  joining  any  pair  of  vertices  of  Q  where  there 
is  not  already  an  edge. 
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7.  (a)  Suppose  that  Q  and  ‘H  are  isomorphic  graphs  and  y>:  V(Q)  — »•  V(H)  is  the  isomorphism  of  vertex  sets  given 
by  Definition  9.3.1.  Label  the  edges  of  Q  arbitrarily,  ,gm,  and  then  use  <p  to  label  the  edges  of  H  as 

hi,  I12 . hm\  that  is,  if  edge  gi  has  end  vertices  v  and  w  in  Q,  let  h\  be  the  edge  joining  <p(v)  and  <p( w)  in  H. 

Repeat  to  obtain  I12 . hm.  Now  a  triangle  in  Q  is  a  set  of  three  edges  {g,-,  gj,  gk}  each  two  of  which  are  adjacent. 

It  follows  from  the  way  we  labeled  the  edges  of  H  that  {g;,  gj,  gk)  is  a  triangle  in  Q  if  and  only  if  [hi,  hj,  hk]  is  a 
triangle  in  'H.  Thus,  the  number  of  triangles  in  each  graph  is  the  same. 


10.  (a)  No. 


H 


Exercises  10.1 


1.  (a) 

2. 


3.  (a) 


4.  (a) 


6.  (a) 


7.  (a) 


(b) 


9. 


11.  (a) 

(b) 


14. 


0 - Q 

Since  the  graph  describing  the  Konigsberg  Bridge  Problem  (Fig  9.2)  has  several  odd  vertices,  it  is  not  Eulerian.  It  is 
not  possible  to  walk  over  the  bridges  of  Konigsberg  exactly  once  and  return  to  the  starting  position. 

The  graph  is  Eulerian  because  it  is  connected  and  each  vertex  has  even  degree  4.  Piecing  the  circuits  ABCDEFGA 
and  ACFBDGEA  together  at  A  gives  the  Eulerian  circuit  ABCDEFGACFBDGEA. 


Yes,  there  is  because  A  and  B  are  the  only  vertices  of  odd  degree. 


No.  In  the  graph  representing  the  modified  Konigsberg  Bridge  Problem 
there  are  two  vertices  of  odd  degree. 


D 


This  question  asks  about  the  possibility  of  an  Eulerian  trail  and  there  is  one,  between  the  two  vertices  A  and  B  of 
odd  degree.  One  possibility  is  AC BDADCAB. 

Yes.  In  this  case,  both  Q  and  H  must  be  cycles. 

K,n  is  Eulerian  -o-  n  is  odd. 


/C„  has  an  Eulerian  trail  if  and  only  if  n  =  2.  For  n  =  2,  certainly  o — o  has  an  Eulerian  trail.  For  n  >  2,  if  two 
vertices  have  odd  degree,  then  there  are  other  vertices  of  odd  degree  so  no  Eulerian  trail  can  exist. 

Suppose  the  vertices  of  the  circuit  are  vq,v\,  . . .  ,v„,  vo-  Consider  all  subcircuits  of  the  form  u,-,  it,-+i,  u,-+  2,  ...  ,  v,-. 
(There  is  at  least  one  such,  taking  i  =  0.)  That  subcircuit  u,-,  v,-+i,  u,+2, ...  ,  it;  which  uses  the  fewest  number  of 
vertices  is  a  cycle.  If  the  original  circuit  was  not  a  cycle,  then  those  vertices  not  on  the  first  chosen  subcircuit 
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15. 


20. 


V,-,  l’,+2.  • .  •  ,  Vj,  together  with  v, ,  form  another  subcircuit,  and  the  same  argument  as  before  shows  that  this 

contains  a  second  cycle. 


False.  In  the  graph  shown,  V1V2V3V2V]  is  a  closed  walk  that  does  not  contain  a  cycle. 


By  definition,  u  ~  u,  so  ~  is  reflexive.  If  u  ~  v,  then  there  is  a  walk  u  =  uq,  u\,  . . .  ,  Uk  =  v  from  u  to  v.  But  then 
v  =  Uk,  Uk-\ .  •  •  •  ,u\,uq  =  u  is  a  walk  from  v  to  u,  so  v  ~  u  and  ~  is  symmetric.  Finally,  if  u  ~  v  and  v  ~  w, 
then  there  is  a  walk  u  =  i<o,  «i , . . .  ,Uk=v  from  u  to  v  and  a  walk  v  =  t>o,  iq , . . .  ,vg  =  w  from  v  to  w.  But  then 
u  =  uq,  u\, . . .  ,  Uk  =  v  =  vq,  «i, . . .  ,  vi  =  w  is  a  walk  from  u  to  w,  proving  u  ~  w  and  establishing  transitivity. 


We  must  show  that  there  is  a  walk  between  any  two  vertices  x  and  y  of  Q.  We  show  that  there  is,  in  fact,  a  walk  of 
length  at  most  2  between  x  and  y.  If  xy  is  an  edge,  then  obviously  there  is  a  walk  from  x  to  y,  so  suppose  that  xy 
is  not  an  edge.  Let  S  be  the  set  of  vertices  adjacent  to  x  and  T  be  the  set  of  vertices  adjacent  to  y.  Thus  x  £  S  U  T 
and  y  £  S  U  T  so  the  number  of  vertices  in  Q  is  20  >  |<S  U  T\  +  2.  Using  the  Principle  of  Inclusion-Exclusion, 
20  >  |<S|  +  \T\  —  |<S  fl  T\  +  2  =  degx  +  degy  —  |5  fl  T|  +  2  >  21  —  |5n  T\.  It  follows  that  5  n  T  ^  0,  so  there  is  a 
vertex  u  in  both  S  and  T  and,  thus,  a  walk  xuy  from  x  to  y. 


21.  (a)  Since  Q  is  connected  and  n  >  1,  no  vertices  have  degree  zero.  Therefore,  if  there  are  no  vertices  of  degree  one,  every 
vertex  of  Q  has  degree  at  least  2.  Using  £  to  denote  the  set  of  edges  of  Q,  it  follows  that  2\£\  =  ]T  deg  Vj  >  2 n  (by 
Proposition  9.2.5)  and  the  number  of  edges  \£\  >  n  as  required. 


24.  Suppose  that  Q\  has  n \  components  and  that  Gi  has  ni  <  n\  components.  Let  v\ . v„,  be  vertices  of  Q\  each  in 

a  different  component.  Then  there  is  no  walk  between  any  pair  of  these  vertices.  On  the  other  hand,  if  u>i, . . .  ,  w„ , 
are  any  n\  vertices  of  Qi,  at  least  two  of  these  must  lie  in  the  same  component  (by  the  Pigeon-Hole  Principle)  and 
hence  there  is  a  walk  between  these  two.  Thus,  the  vertices  v,  in  Q\  do  not  correspond  to  any  n\  vertices  of  Gi,  so 
these  graphs  cannot  be  isomorphic. 


Exercises  10.2 

1.  This  graph  is  not  Hamiltonian.  To  see  this,  suppose  H  were  a  Hamiltonian  cycle. 
Since  vertices  A  and  B  have  degree  2,  the  two  edges  incident  with  each  of  these 
vertices  would  be  in  H.  Thus,  H  would  contain  the  cycle  ACBDA,  which  cannot 
be  the  case  since  this  does  not  contain  all  vertices  of  the  graph.  The  graph  is  not 
Eulerian  because  it  contains  vertices  of  odd  degree. 


2.  (b)  This  is  not  Hamiltonian,  since  it  isn’t  connected. 


4.  (a)  Yes,  it  is  Hamiltonian:  ABDCEA  is  a  Hamiltonian  cycle. 

(c)  The  graph  is  not  Eulerian.  Vertices  A  and  E  have  odd  degree. 


7.  In  the  graph  to  the  right,  vertices  correspond  to  rooms  and  edges  to  doorways. 
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(a)  No,  because  the  graph  is  not  Hamiltonian.  Since  vertices  A,  C,  D,  and  E  have  degree  2,  the  edges  labeled  3,  4,  5, 
6,  1,  and  2  would  have  to  be  part  of  any  Hamiltonian  cycle.  These  edges,  however,  define  a  proper  cycle,  which  is 
not  allowed. 

8.  The  result  is  obvious  if  n  =  1  since  in  this  case  there  are  just  two  people  who  are  friends.  So  we  assume  that  n  >  1. 
Consider  the  graph  whose  vertices  correspond  to  people  and  where  an  edge  between  vertices  v  and  u  signifies  that 
v  and  li  are  friends.  The  question  asks  us  to  prove  that  this  graph  is  Hamiltonian.  This  is  an  immediate  consequence 
of  Dirac’s  Theorem  since  the  graph  in  question  has  2n  >  3  vertices  each  of  degree  d  >  n  = 

9.  (a)  n  edges. 

(c)  ICn  has  ”^'l2~1)-  edges,  so  the  maximum  is  edge  disjoint  cycles. 


10.  The  cube  is  indeed  Hamiltonian;  the  labels  1, . . .  ,  8  on  the  vertices 
exhibit  a  Hamiltonian  cycle. 


4  5 


13.  (a)  As  suggested,  add  an  extra  vertex  v  to  Q  and  join  it  to  all  other  vertices.  Then  deg  v  —  n  >  iL^,  and  deg  w  > 

4- 1  =  for  all  other  vertices.  By  Theorem  10.2.4,  this  new  graph  with  n  4- 1  vertices  has  a  Hamiltonian  cycle. 
Deleting  v  and  all  the  new  edges  incident  with  v  leads  to  a  Hamiltonian  path  in  our  original  graph. 

(e)  o — o  is  an  example! 

14.  As  in  the  proof  of  Dirac’s  Theorem  all  vertices  adjacent  to  tq  are  in  V. 

(a)  If  tq  and  v,  are  adjacent,  C\ :  tqtq  •  •  •  v,v\  is  a  cycle.  If  t  <  n,  then  there  exists  a  vertex  w  not  in  C\.  As  noted,  w  is 
not  adjacent  to  tq,  hence  the  hypothesis  tells  us  that  deg  tq  4-  deg  to  >  n.  It  follows  that  there  exists  some  vertex  u 
which  is  adjacent  to  both  tq  and  to;  otherwise,  the  graph  has  at  least  n  4-2  vertices,  counting  tq  and  to  and  the  disjoint 
sets  of  vertices  adjacent  to  v\  and  to  to.  Since  u  is  adjacent  to  tq,  u  =  v,  for  some  i.  Now  100,0, +i  •  ■  •  vtv\  ■  ■  ■  o,_i 
is  a  path  longer  than  V,  a  contradiction.  Thus  t  =  n,  so  C\  is  a  Hamiltonian  cycle  and  we’re  done. 

15.  Add  a  new  vertex  adjacent  to  all  existing  vertices  and  apply  the  result  of  Exercise  14. 


16.  (c)  False. 


is  Eulerian  but  not  Hamiltonian. 


17.  (a)  Yes!  Since  there  is  a  path  between  any  two  vertices,  the  graph  is  connected.  Thus,  there  exists  an  edge  e  in  the  graph, 
joining,  say,  vertices  o  and  to.  Now  let  V  be  a  Hamiltonian  path  from  o  to  to.  Then  starting  at  0,  following  V  to  to 
and  then  e  to  0  produces  a  Hamiltonian  cycle  from  v  to  v. 

Exercises  10.3 

"010100* 

10  110  0 
010111 
*•  A  -  1110  0  0 

0  0  10  0  1 

0  0  10  10. 
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3.  (a)  The  (3,  5)  entry  of  A3  is  5.  The  (2,  2)  entry  of  A3  is  2. 

4.  Each  1  represents  an  edge.  Each  edge  VjVj  contributes  two  1  ’s  to  the  matrix,  in  positions  (/',  y)  and  (y,  7).  The  number 
of  l’s  is  twice  the  number  of  edges. 

6.  (a)  The  (7,  y)  entry  in  A2  is  the  number  of  walks  of  length  2  from  7  to  y.  Hence,  the  sum  of  all  such  entries  is  the  total 
number  of  walks  of  length  2. 

-  o  i  o  i  i  -|  roiooi- 

10  10  0  10  0  1  0 

8.  (a)  A,  =  0  10  0  1  ,  A2  =  0  0  0  1  0. 

10  0  0  0  0  110  1 

_  1  0  1  0  0  J  L  1  0  0  1  0  _ 

(b)  The  function  <p  is  an  isomorphism  because,  if  the  vertices  of  G\  are  relabeled,  t;,-  being  replaced  by  <p(u,)  =  then 
the  adjacency  matrix  of  Q\  relative  to  the  w,  ’s  is  A2.  (See  Theorem  10.3.3.) 

'  0  0  1  00- 

0  0  0  0  1 

(c )  P  =  0  0  0  10 

1  0  0  0  0 

.  0  1  0  0  0  . 

[??'■] 

11.  (a)  PA  =  x  y  z  is  A,  but  with  rows  written  in  the  order  2,3,  1,  the  order  in  which  the  rows  of  7  were 
a  b  c 

rearranged  to  give  P. 

The  matrices  are  the  adjacency  matrices  of  graphs  Gu  62 ,  respectively.  Since  G2  has  three  vertices  of  degree  1  while 
G\  has  only  one,  the  graphs  are  not  isomorphic,  so  no  such  P  exists,  by  Theorem  10.3.4. 

The  7th  entry  on  the  diagonal  of  A37  is  the  number  of  walks  of  length  37  from  v,  to  itself.  But  in  a  bipartite  graph, 

you  can  only  get  from  Vj  back  to  itself  in  an  even  number  of  steps.  Hence,  the  entry  is  0. 

A2  is  an  adjacency  matrix  A  is  the  0  matrix. 

Proof.  For  A2  to  be  an  adjacency  matrix,  it  must  have  all  diagonal  entries  equal  to  0.  But  the  7th  diagonal  entry  of 
A2  is  the  number  of  walks  of  length  2  from  v,  to  itself.  Now,  if  u,t>y  is  an  edge  of  Q,  then  v,  v j  ty  is  a  walk  of  length 
2  from  Vj  to  itself,  and  the  7th  diagonal  entry  would  not  be  0.  We  conclude  that  Q  cannot  have  any  edges;  that  is,  A 

is  the  zero  matrix.  On  the  other  hand,  if  A  is  the  zero  matrix,  certainly  A2  =  A  is  an  adjacency  matrix. 

18.  We  could  store  the  incidence  matrix,  whose  columns  correspond  to  edges  and  rows  to  vertices.  The  (7,  j)  entry  is  1 
if  vertex  i  is  incident  with  edge  y,  and  otherwise  0.  We  could  also  simply  store  the  edges  as  a  linear  list,  perhaps  of 
numbers:  Assuming  less  than  100  edges,  we  could  store  the  edge  ij  as  1007  +  j  and  recover  7  as  L10^7  J  and  j  as 
(1007  +  y)  -  1007. 

Exercises  10.4 

2.  (a)  [BB]  The  final  labeling  starting  at  A  is  shown  to  the 
right.  The  shortest  path  from  A  to  E  is 
ABC  J  IGF  KE  and  has  length  13. 


13.  (a) 

14. 

15.  (a) 


H(A,  1)  9  G(7,  9)  1  F(G,  10) 


S-48  Solutions  to  Selected  Exercises 


3.  If  we  start  at  A,  permanent  labels  will  be  assigned  in  the  order  A,  H,  B,  C,  D,  J ,  I,  G,  F,  K,  E. 

5.  A  shortest  path  has  length  11.  One  way  in  which  permanent  labels  might  be  assigned  is  in  the  order  A ,  B ,  D,  C,  S, 

J ,  /,  H,  E,  L,  G,  F,  K,  M,  O ,  P ,  A,  0,  /?. 

7.  The  shortest  path  has  length  13,  as  shown. 


9.  In  each  case  the  answer  is  yes,  if  A  and  E  were  in  different  components  of  a  graph  which  was  not  connected. 

11.  (a)  Assign  each  edge  a  weight  of  I. 

12.  As  explained  in  the  text,  the  complexity  function  for  determining  the  shortest  distance  from  a  given  vertex  to  each 
of  the  others  is  (D(n2);  that  is,  for  sufficiently  large  n  and  some  constant  c,  the  algorithm  requires  at  most  cn2 
comparisons.  Applying  the  algorithm  to  each  of  the  n  vertices  (after  which  all  shortest  distances  are  known)  requires 
at  most  n(cn2)  =  cn 3  comparisons.  Thus,  this  process  is  <D(n2). 

13.  (b)  d(  1,5)  =  27;  d(  1,6)  is  still  oo;  ef(3,4)  is  8;  d( 8,5)  =  24. 

16.  It  is  indeed  necessary  to  continue.  An  identical  set  of  d(i,  j)  for  k  —  r  and  k  =  r  +  1  simply  u2 
indicates  that  the  shortest  path  from  each  u,  to  each  vj  passing  through  iq , . . .  ,  vr  has  the  f 

same  length  as  the  shortest  path  through  v\,...  ,  vr+\.  In  the  graph  shown  to  the  right,  for  l 

instance,  the  values  of  d(i ,  j )  do  not  change  until  k  =  3  since  the  shortest  path  between 
pairs  of  vertices  cannot  be  reduced  until  vertex  v 3  is  used.  js 

Exercises  11.1 

1.  (a) 


2.  /C5  is  Eulerian,  so  no  additional  edges  are  needed. 


3.  Here  is  the  unique  solution.  4 


4. 


We  show  one  of  several  solutions  in  each  case. 
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6. 

9. 


The  shortest  route  from  X  to  X  is  a  circuit  (perhaps  in  a  pseudograph)  passing  through  Y.  Hence,  it  can  also  be 
viewed  as  a  circuit  from  Y  to  Y .  Any  other  route  from  Y  to  Y  could  also  be  viewed  as  a  route  from  X  to  X.  So  there 
cannot  be  any  shorter  route  from  Y  to  Y. 


Each  odd  vertex  in  Q  is  the  end  vertex  of  precisely  one  new  path  constructed  by  the  algorithm,  so  it  is  even  in  Q' . 
Each  even  vertex  in  Q  either  has  unchanged  degree  in  Q'  or,  as  an  intermediate  point  on  one  or  more  paths  between 
odd  vertices,  has  its  degree  increased  by  a  multiple  of  2,  and  so  remains  even  in  Q' . 


Exercises  11.2 

1.  (a)  These  digraphs  are  not  isomorphic  because  one  has  six  vertices  while  the  other  has  five. 

3.  Every  arc  comes  out  of  one  vertex  and  goes  into  another,  hence,  adds  one  to  the  sum  of  all  indegrees  and  one  to  the 
sum  of  all  outdegrees. 

5.  The  answer  is  yes.  Since  Q  is  an  Eulerian  graph,  there  exists  an  Eulerian  circuit.  Now  just  orient  the  edges  of  this 
circuit  in  the  direction  of  a  walk  along  it. 

0  110' 

10  10 
0  10  1 
1  I  I  0  _ 

(b)  The  (3,  3)  entry  of  A2  is  2  because  there  are  two  directed  walks  of  length  2  from  vertex  3  to  vertex  3;  namely,  323 
and  343.  The  (1,4)  entry  of  A2  is  1  because  the  only  directed  walk  of  length  2  from  vertex  1  to  vertex  4  is  134. 

(c)  The  (4,  2)  entry  of  A2  is  4  because  there  are  four  directed  walks  of  length  3  from  vertex  4  to  vertex  2;  namely,  4212, 
4232,  4132,  and  4342.  The  (1,3)  entry  of  A4  is  6.  There  are  six  walks  of  length  4  from  vertex  1  to  vertex  3;  namely, 
13423,  13413,  13213,  12123,  12323,  and  12343. 

(d)  The  digraph  is  strongly  connected;  12341  is  a  directed  circuit  which  permits  travel  in  the  right  direction  between  any 
two  vertices. 

(e)  The  digraph  is  not  Eulerian.  Vertices  2,  3,  and  4  have  different  indegree  and  outdegree;  vertex  3,  for  instance,  has 
indegree  3  and  outdegree  2. 

10.  (a)  With  the  graphs  labeled  as  shown,  the  adjacency  matrices  are 


-  010  0  0  0  " 

0  0  10  0  0 

„  0  0  0  10  0 

Ai  -  0  0  0  0  10  and  a2 

0  0  0  0  0  1 

.1  0  0  0  0  0  . 
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11.  (a)  A\ 


0  10  1“ 
0  0  10 
0  0  0  1 
1  1  0  0  _ 


and  A  2  = 


0  0  0  1' 
10  10 
110  0 
0  0  10. 


(b)  With  the  vertices  of  Q\  relabeled  according  to  (p,  its  adjacency  matrix  becomes  that  of  Cn_- 

-0100- 

(c)  P-  1000 
^  0  0  0  1 

.0010. 


(d)  The  digraphs  are  strongly  connected:  In  Q 1,  for  instance,  viv2v3V4.vi  is  a  circuit  which  respects  arrows  and  Q2  is 
isomorphic  to  Q\  hence,  also  strongly  connected. 

(e)  The  digraphs  are  not  Eulerian.  In  Q\,  for  instance,  vertex  v2  has  indegree  2  but  outdegree  1. 

14.  (a)  Each  of  these  graphs  is  strongly  connected;  each  is  a  cycle. 


15.  (a)  There  are  just  two  possibilities  for  the  outdegree  sequence;  1,  1,  1  and 
2, 1,0.  The  corresponding  graphs  are  shown  at  the  right. 


16. 


No.  K 3  is  connected,  but 


is  not  strongly  connected. 


18.  True.  Let  v  be  a  vertex.  Since  Q  has  at  least  two  vertices  and  Q  is  strongly  connected  there  is  some  arc  of  the  form 
vw.  Since  Q  is  strongly  connected,  there  is  a  path  from  w  to  v.  This  path  does  not  use  arc  vw  since  all  vertices  of  a 
path  are  distinct.  For  the  same  reason,  all  arcs  on  this  (or  any)  path  are  distinct.  Thus,  arc  vw  followed  by  the  path 
from  w  to  v  gives  the  desired  circuit. 


1 

2 

Max.  no.  of  arcs 

3  4 

5 

6 

V2  1,  Uj 

1.  tfi 

1,  U] 

L  V\ 

1.  vi 

I.  v\ 

v3  8,  v\ 

2,  V2 

2,  V2 

2,  l>2 

2,  l>2 

2,  v2 

V4  00 

11,  U3 

5,  v 3 

4,  v5 

4,  l>5 

4,  U5 

V5  OO 

4,  V2 

3,  D3 

3,  U3 

3,  U3 

3,  U3 

V6  7,  V[ 

7,vi 

7,  vt 

6,  v5 

6,  V5 

6,  v5 

20.  Bellman-Ford  works  fine  on  undirected  graphs  without  negative  edges.  It  wouldn’t  make  sense  to  apply  Bellman-Ford 
to  an  undirected  graph  with  a  negative  edge  weight,  since  any  walk  could  be  shortened  by  passing  up  and  down  that 
edge  as  often  as  desired. 

21.  (a)  Dijkstra  incorrectly  determines  that  the  length  of  a  shortest  path  to  V2  is 

1 .  Dijkstra  does  not  always  work  when  applied  to  digraphs  which  have 
arcs  of  negative  weight. 

(b)  No  shortest  path  algorithm  will  work.  There  is  a  negative  weight  cycle, 
hence  no  shortest  distance  to  V2,  for  example. 

23.  (a)  As  Step  2  of  the  algorithm  shows,  for  each  j,  the  values  d, ( j )  depend  only  on  the  values  dj-\(k)  and  the  arc  weights. 
It  follows  that  if  the  values  dj-\ (j)  =  dj ( j )  are  identical,  then  dj-\(j)  =  di (j)  =  di+\{j)  =  ■  •  •  =  dn-\ (j). 


v2(v\,  1) 


va  (ui .  1) 
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Exercises  11.3 


1.  (a)  Since  the  chain  ends  UC.  the  given  G-fragments  arise  from  any  of  3!  =  6 
chains.  The  given  U,C-fragments  arise  from  any  of  5!/2!  =  60  chains.  The 
better  estimate  is  6.  The  abnormal  fragment  is  UC,  and  this  ends  the  chain. 
The  interior  extended  bases  are  U,  G.  The  unsplittable  fragments  are  G,  C, 

C,  U.  The  chain  starts  and  ends  with  C.  The  only  answer  is  CAAGCUGGUC. 


CAAG 


uc*c  1  GGU 


AAG 


(e)  Since  the  chain  ends  CC,  the  G-fragments  arise  from  any  of  4!  =  24  chains. 
The  U,C-fragments  arise  from  any  of  6!/2!  =  360  chains.  The  better  estimate 
is  24.  The  only  interior  extended  base  is  C.  The  unsplittable  fragments  are  G, 
C,  C.  This  chain  starts  with  G  and  ends  with  C.  There  are  two  answers: 
GUGAUGACCAGCC  and  GAUGUGACCAGCC. 


GU  GAU 


2.  (a)  The  last  letter  in  the  chain  is  U,  C,  G,  or  A.  If  it  is  U  or  C,  the  last  G-fragment  will  be  abnormal.  If  it  is  G,  the  last 

U,C-fragment  will  be  abnormal.  If  it  is  A,  the  last  G  and  the  last  U.C-fragments  will  each  be  abnormal. 

(b)  There  are  two  abnormal  fragments  if  and  only  if  the  chain  ends  with  A. 

3.  The  chains  GGUGU  and  GUGGU  have  the  same  G-  and  U.C-fragments.  We  leave  it  to  you  to  check  that  no  shorter 
chains  exist. 

4.  (a)  Let  B  be  an  interior  extended  base.  Then  B  came  from  some  fragment,  say  a  U,C-fragment.  (The  argument  which 

follows  easily  adapts  to  the  case  of  a  G-fragment.)  Thus  our  fragment  has  the  form  XBY,  and  the  only  U  or  C  here 
is  possibly  at  the  end  of  Y.  Since  the  G-enzyme  splits  off  B,  both  X  and  B  end  with  a  G  and  B  contains  no  other  G. 
Thus  B  is  itself  a  G-fragment.  Since  B  contains  no  U,  C  or  G,  it’s  unsplittable. 

(b)  The  first  extended  base  is  certainly  an  unsplittable  fragment:  It  ends  at  the  first  U,  C  or  G.  Clearly,  it  isn’t  interior. 
(It’s  first!)  The  last  extended  base  contains  at  most  one  U,  C  or  G  (and  this  at  its  end)  so  it  is  also  unsplittable.  Again, 
it  is  clearly  not  interior.  (It  is  last!) 


Exercises  11.4 

1.  (a)  See  Exercise  15(a)  of  Section  11.2.  The  score  sequences  (left  to  right)  are  1,1,1  and  2,  1,0.  The  tournament  on  the 

left  is  not  transitive;  the  one  on  the  right  is. 

2.  Since  for  each  pair  of  (distinct)  vertices  v,,  vj  precisely  one  of  VjVj,  Vj v,  is  an  arc,  in  the  adjacency  matrix  A,  for 
each  i  ^  j,  precisely  one  of  a,j,  aji  is  1.  Thus,  A  +  AT  has  0’s  on  the  diagonal  and  l’s  in  every  off-diagonal  position. 

5.  (a)  The  sum  of  the  scores  is  the  number  of  arcs,  by  Proposition  11.2.2;  thus,  YH=is>  =  (2)  =  \n(n  ~  O-  Since 

2^"=1(n  —  1  —  St)  —  j (n  —  1)  —  ]C"_  1  Si  =  n(n  —  1)  —  j«(n  —  1)  =  j«(n  —  1),  we  have  the  desired  result.  In  a 
tournament  with  n  players,  each  player  plays  n  —  1  games,  so  if  a  player  wins  s,  games,  he  loses  n  —  1  —  s;  games. 
The  result  says  that  the  sum  of  the  numbers  of  wins  equals  the  sum  of  the  numbers  of  losses. 

6.  Let  w  be  any  other  vertex.  If  v  beats  w,  there  is  a  path  of  length  1  from  v  to  w  and  we’re  fine.  Hence,  assume  that  w 
beats  v ,  that  is,  w  — >  v.  Among  those  vertices  that  v  beats  there  must  be  one,  say  x,  which  beats  w,  since  otherwise 
s(w)  >  s(v)  +  1  (recall  that  w  beats  v),  contradicting  the  maximum  score  of  v.  Hence,  for  some  x,  we  have 

V 

,  but  then  there  is  a  path  of  length  2  from  v  to  w. 

X 

7.  (a)  No.  The  sum  of  the  scores,  s(y),  is  the  number  of  arcs.  Here,  1  +  1  +  2  +  3  =  7,  but  the  number  of  arcs  is  (t)  =  6. 
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11.  Suppose  T  is  a  transitive  tournament  that  contains  the  cycle  V1V2 . . .  v„vi.  Since  T  contains  no  3-cycle,  upon 
considering  the  vertices  v\,  V2  and  113,  we  see  that  iq  1)3  must  be  an  arc.  Then,  considering  v\,  1)3,  14,  we  see  that  v \  14 
is  an  arc.  Continuing,  we  eventually  have  an  arc  v\vn,  contradicting  the  fact  that  vnv\  is  an  arc.  (In  a  tournament, 
for  any  pair  «,  o  of  vertices,  precisely  one  of  uv,  vu  is  an  arc.) 


Exercises  11.5 

1.  With  reference  to  Fig  11.18,  the  arc  BD  now  has  weight  4.  This  does  not  change  any  of  the  labels  shown,  but  it  does 
eliminate  the  possibility  of  a  second  shortest  path.  Now  the  only  shortest  path  is  SADFT,  requiring  11  units  of  time 
as  before. 

5.  The  new  digraph  is  shown.  The  project  now  requires  18  units  of  time.  The  critical  path  is  SFEW PaDVCT .  The 
slack  in  PI  is  now  1;  the  slack  in  the  installation  of  kitchen  cabinets  is  3.  All  other  tasks  have  slack  0  since  they  lie 
on  the  critical  path. 


K(Pa,  15) 


(5,4)  (£,  8)  (£,12) 


T(C,  18) 


£(£,  9) 


V(D,  16) 


C(V.  18) 


6. 


(a) 


Since  the  time  required  for  a  task  does  not  depend  on  which  other  tasks  have  been 
can  occur  simultaneously,  this  a  Type  II  scheduling  problem. 


completed  and  since  some  tasks 


T(Pa,  27) 


(b)  The  only  critical  path  is  SGShH PaT,  taking  a  total  of  27  months. 

(c)  The  slack  of  Po  is  4.  This  job  could  take  as  much  as  9  units  of  time  without  affecting  the  time  (21)  by  which  the  task 
PI,  subsequent  to  Po  and  on  the  critical  path,  would  be  accomplished.  There  is  no  slack  in  Sh.  It’s  on  the  critical 
path.  The  slack  in  A  is  3.  If  this  task  took  7  units  of  time,  its  label  would  be  (Sh,  23)  and  the  label  on  R  would 
become  (A,  27).  There  would  be  another  critical  path,  SGShART,  but  the  time  for  the  job  would  be  unaffected.  The 
slack  in  Tw  is  4.  Twisting  could  take  as  long  as  6  units  of  time  and  the  project  would  still  be  completed  in  27  units. 
The  slack  in  R  is  3. 

(d)  No,  the  project  will  not  be  delayed.  Delaying  Po  by  three  months  changes  Po's  label  to  (G,  15),  but  this  does  not 
affect  any  other  labels  on  the  digraph.  Delaying  A  by  three  months  changes  A’s  label  to  (Sh,  23),  but  does  not  affect 
the  critical  path,  or  its  length. 

10.  (a)  This  type  I  scheduling  problem  is  described  by  the  following  digraph. 
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(b)  The  shortest  time  is  12  units. 

(c)  One  of  several  ways  in  which  this  time  can  be  achieved  is  to  first  conduct  the  library  search,  then  do  the  field  work, 
then  the  laboratory  analysis,  then  create  the  database  and  finally  do  the  write-up. 

Exercises  12.1 


7. 


We  think  of  a  path  as  a  sequence  of  vertices.  There  are  at  least  two  vertices  of  Vi  which  are  also  in  V\  ( v  and  w, 
for  example).  Let  u  w  be  the  first  vertex  of  V\  which  is  encountered  on  the  reversed  path  Pi  from  w  back  to  v. 
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(It  is  possible  that  u  might  equal  v.)  It  follows  that  the  path  from  u  to  w  along  V\  followed  by  the  path  back  to  u 
along  V2  in  reverse  from  ui  is  a  cycle. 

Exercises  12.2 


? 

!•  (a)  g — e — e — e — e  0 

A 

(b)  Since  each  tree  with  five  vertices  has  all  vertices  of  degree  at  most  four,  there  is  one  isomer  for  each  such  tree,  the 
C  atoms  corresponding  to  the  vertices.  There  are  three  isomers  of  C5//12. 

3.  A  beta  index  less  than  1  says  that  there  are  fewer  edges  than  vertices.  One  possibility  is  that  the  graph  is  not 
connected;  in  other  words,  there  exist  two  cities  such  that  it  is  impossible  to  fly  from  one  city  to  the  other.  If  the 
graph  is  connected,  then  it  must  be  a  tree  by  Theorem  12.2.3.  This  means  there  is  a  unique  way  of  flying  from  any 
city  to  any  other  city. 

5.  (— »)  A  tree  with  n  vertices  has  n  —  1  edges  by  Theorem  12.2.3  and  no  cycles  by  Proposition  12.1.2. 

(«— )  Suppose  G  is  an  acyclic  graph  with  n  vertices  and  n  —  1  edges.  Since  Q  has  no  cycles,  Proposition  12.1.2  shows 
that  we  have  only  to  prove  that  Q  is  connected.  Let  then  C],  C 2,  ■  ■  •  ,  C*  be  the  connected  components  of  G  and 
suppose  that  C,  has  n,-  vertices.  (Thus,  ]T  n,  =  n.)  Since  Q  has  no  cycles,  there  are  no  cycles  within  each  C, .  It  follows 
that  each  C;  is  a  tree  with  «,•  -  1  edges.  The  number  of  edges  in  Q  is,  therefore,  A  A  —  1)  =  (A  «;)  —  k  =  n  —  k. 
So  n— k  =  n— \,k  =  l,  G  has  only  one  component;  that  is,  G  is  connected. 

10.  There  are  no  circuits  in  the  subgraph  since  there  are  no  circuits  in  CkHjk+i-  Also,  given  any  two  C  vertices,  there  is  a 
path  between  them  in  CkHik+2  (because  Q lhk-\2  is  connected).  Any  H  vertex  on  this  path  would  have  degree  two. 
Thus,  there  is  none;  the  path  consists  entirely  of  C  vertices  and  hence  lies  within  the  subgraph.  Thus,  the  subgraph 
is  connected,  hence  a  tree. 

11.  (a)  Let  x  be  the  number  of  H  vertices  adjoined.  Since  T  had  k  —  1  edges,  and  one  new  edge  is  added  for  each  H,  Q 

has  (k  —  1)  +  x  edges.  Therefore,  deg  v,  =  2(k  —  1  +  x ).  But  J2  deg  v,  =4  k+x  since  each  C  has  degree  4  and 
each  H  has  degree  I .  Therefore,  4k  +  x  =  2k  —  2  +  2x  and  x  =  2k  +  2. 

14.  A  tree  is  a  complete  bipartite  graph  if  and  only  if  it  is  fC]  n  for  some  n. 

Proof.  Certainly  K. i,„  is  a  tree.  Conversely,  if  a  tree  is  complete  bipartite,  then  it  is  K.m  n  for  some  m  and  n.  But  such 
a  graph  has  no  vertices  of  degree  one  unless  rn  or  n  is  1 .  The  result  follows. 

15.  (a)  By  Corollary  1 2.2.4,  each  of  the  c  components  has  at  least  two  vertices  of  degree  1 .  So  there  are  at  least  2c  vertices 

of  degree  1  altogether. 

16.  (a)  Using  Corollary  12.2.4,  we  have  ^deg(u,)  >  8,  so  the  tree  has  at  least  four  edges  and  hence  at  least  five  vertices.  If 

the  result  is  not  true,  then  there  are  at  most  three  vertices  of  degree  one  while  the  rest  have  degree  at  least  two.  Then 

A  deg  Vi  >  2(3)  +  3(1) +  (n- 5)2  =  2n-l, 
contradicting  the  fact  that  ^degu,-  =  2 (n  —  1). 

Exercises  12.3 


1.  We  show  T  and  two  other  spanning  trees  found  by 
adding  a  and  then  successively  deleting  /  and  g. 


T 


a 


a 
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4.  By  Theorem  12.3.3,  the  numbers  are  1  1  =  1,  2°  =  1,  31  =  3,  42  =  16,  53  =  125,  and  64  =  1296. 


A  BA  BA  BA  BA  BA  BA  BA  B 


C  DC  DC  DC  DC  DC  DC  DC  D 


A  BA  BA  BA  BA  BA  BA  BA  B 


C  DC  DC  DC  DC  DC  DC  DC  D 

7.  (a)  K-2,2  has  four  spanning  trees  (obtained  by  deleting  each  edge  in  succession).  They  are  all  isomorphic  to  o  o  o  o. 

9.  The  edge  in  question  is  a  bridge  (see  Definition  13.2.2);  that  is,  its  removal  disconnects  the  graph.  To  see  why.  call 
the  edge  e.  If  Q  \  [e]  were  connected,  it  would  have  a  spanning  tree.  However,  since  Q  \  { e )  contains  all  the  vertices 
of  Q ,  any  spanning  tree  for  it  is  also  a  spanning  tree  for  Q.  We  have  a  contradiction. 

12.  (a)  Say  the  edge  is  e  and  T  is  any  spanning  tree.  If  e  is  not  in  T,  then  T U  jc)  must  contain  a  circuit.  Deleting  any  edge 

of  this  circuit  other  than  e  gives  another  spanning  tree  which  includes  e. 

(c)  No.  If  the  three  edges  form  a  circuit,  no  spanning  tree  can  contain  them. 

14.  (a)  The  subgraph  To  U  {e}  is  connected  because  %  is.  Since  it  has  the  same  number  of  vertices  and  edges  it  cannot  be  a 

tree;  therefore,  it  contains  a  circuit.  Let  /  be  an  edge  on  this  circuit  such  that  /  £  T.  (Since  T  does  not  contain  a 
circuit,  this  is  possible.)  Then  (To  U  { e })  \  {/}  is  still  connected  (you  can  go  either  way  around  a  circuit),  and  it  has 
n  vertices  and  n  —  1  edges,  so  it's  a  spanning  tree. 

16.  There  are  (j)  possible  edges  from  which  we  choose  n  —  1.  The  number  of  graphs  is,  therefore.  (^).  The  number 
of  trees  on  n  labeled  vertices  is  n"~2.  The  number  of  trees  on  n  labeled  vertices  is  nll~2.  For  n  <  6.  the  table  shows 
the  numbers  of  trees  vs.  graphs. 


Exercises  12.4 

1.  (a)  We  want  five  edges  (since  there  are  six  vertices).  Choose  BC ,  then  AD,  FE,  and 
DE.  We  would  like  next  to  choose  AE,  but  this  would  complete  a  circuit  with  AD 
and  DE,  so  we  choose  AC  and  obtain  the  spanning  tree  shown,  of  weight  13. 


2.  (a)  The  edge  of  least  weight  incident  with  E  is  EF .  The  least  weight  of  those  edges 
adjacent  to  EE  is  3;  we  choose  one  of  them,  say  AE  (in  an  effort  to  obtain  a 
different  tree  from  before).  There  is  just  one  edge  of  least  weight  among  whose 
which,  together  with  EE  and  AE,  form  a  tree;  namely,  AD.  Now  those  edges 
which  together  with  EF,  AE,  and  AD  form  a  tree  have  weights  5  and  6.  We 
choose  one  of  least  weight,  say  CE.  Finally,  we  choose  BC,  obtaining  the  tree  of 
weight  1 3  shown  to  the  right. 
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3.  We  need  five  edges  because  there  are  six  vertices  in  the  graph.  We  use  Kruskal’s 

algorithm.  First  select  BC  of  weight  5,  then  DF  of  weight  6.  Next  we  select  BF  and 
AB,  both  of  weight  7.  We  would  like  to  select  AF  next,  but  cannot  since  it  completes 
the  circuit  ABF  A,  so  we  select  EF  of  weight  9  next.  The  five  edges  we  have  selected 
comprise  a  minimum  spanning  tree.  The  smallest  length  of  pavement  required  is 
5  +  6  +  7  +  7  +  9  =  34. 


4.  (b)  i.  Choose  B  D,  BE,  and  CE  each  of  weight  6.  The  remaining  edge,  CD,  of  maximum 
weight  (6)  cannot  be  chosen  because  it  would  complete  the  circuit  BDCEB  with  the 
previously  chosen  edges.  So  we  choose  DF  and  then  AB  obtaining  the  spanning  tree 
shown  at  the  right  of  maximum  weight  28. 

6.  Assign  all  edges  of  the  graph  a  weight  of  I .  Then  carry  out  the  algorithm. 

8.  (a)  If  the  graph  is  unweighted,  put  a  weight  of  1  on  edge  e  and  2  on  every  other  edge.  If  the  graph  is  weighted,  ensure 
(by  temporarily  changing  weights  if  necessary)  that  the  weights  of  the  edges  different  from  e  are  all  larger  than  the 
weight  of  e.  In  either  case,  Kruskal’s  algorithm  will  select  e  first. 

10.  Let  d  be  the  lowest  weight  among  the  edges  incident  with  vertex  v.  The  first  time  that  an  edge  incident  with  v  is 
considered  for  selection  (because  it  has  least  weight  among  remaining  edges),  no  edge  incident  with  v  will  complete 
a  circuit  with  edges  previously  selected.  Always  seeking  edges  of  lowest  weight,  the  algorithm  must  select  an  edge 
of  weight  d. 

11.  (b)  If  the  weights  of  a  graph  with  n  vertices  are  distinct,  then  Kruskal's  algorithm  selects  at  each  stage  a  unique  edge 

of  lowest  weight.  Since  Kruskal’s  algorithm  yields  only  one  tree,  the  graph  has  only  one  minimum  spanning  tree, 
by  (a). 

12.  (b)  Each  time  a  vertex  is  relabeled,  the  component  to  which  it  belongs  contains  at  least  twice  as  many  vertices  as  before. 

Thus,  if  the  label  on  a  vertex  changes  1  times,  2‘  <  n;  so  t  <  log2n  as  required. 

(c)  Since  the  initial  sorting  of  edges  requires  0(N  log  N)  comparisons,  we  have  only  to  show  that  the  relabeling  process 
described  in  (a)  and  (b)  can  also  be  accomplished  within  this  bound.  By  (b),  the  total  number  of  vertex  relabelings 
is  0(n  logn).  If  n  <  N,  we  are  done.  If  n  =  N  +  1,  then  n  log n  <  (2 N)  log  N 2  =  4 N  log  N  and  again  we  are  done, 
except  when  n  =  1  or  2  when  no  steps  are  needed.  Finally,  if  N  <  n  —  2,  no  spanning  tree  is  possible  and  a  check 
for  this  could  be  included  at  the  beginning  of  the  algorithm. 

15.  (a)  The  graph  is  complete,  so  an  obvious  approach  is  to  try  to  choose  the  lowest  weight  available  edge  at  each  vertex. 
Such  a  cycle  is  ADEF  BC  A,  which  has  weight  2  +  3  +  2  +  54-  1  +  5=  18. 


(b)  As  shown  on  the  left,  the  minimum  weight  of  a  spanning  tree  after  A  is  removed  is  11.  The  two  edges  of  least  weight 
at  A  have  weights  2  and  3  so  we  obtain  an  estimate  of  1 1  +  2  +  3  =  16  as  a  lower  bound  for  the  weight  of  any 
Hamiltonian  cycle. 


D  D 

(c)  As  shown  on  the  right,  the  minimum  weight  of  a  spanning  tree  after  B  is  removed  is  12.  The  two  least  edges  at  B 
have  weights  I  and  5,  so  we  obtain  12+1+5=  18  as  a  lower  bound. 
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Exercises  12.5 

1.  (a)  This  is  acyclic:  F,  B,  D,  H,  A,  E,  C,G  is  a  canonical  labeling. 

3.  Let  Q  be  a  digraph  with  n  vertices  and  let  A  be  the  adjacency  matrix  of  Q.  The  indegree  of  vertex  i  is  the  sum  of  the 

entries  in  column  i.  This  requires  n  —  1  two-number  additions.  Repeating  for  n  vertices  involves  n(n  —  1)  =  (D(n2) 

additions. 

5.  (a)  We  use  the  notation  and  ideas  of  the  proof  of  Theorem  12.5.3.  If  Q  has  a  cycle,  then  we  can  construct  a  (possibly 

empty)  set  of  vertices  {ho,  tti, . . .  ,  u*}  such  that  some  <S  =  V  \  (vo,  «i ,  i>2 . %}  is  nonempty  and  has  the  property 

that  Sv  7^  0  for  all  i>  e  5;  that  is,  for  every  v  e  S,  there  exists  an  arc  of  the  form  xv  with  x  e  S.  Let  uq  e  S 
be  arbitrary  and  choose  u\  e  S  such  that  u\uo  is  an  arc.  Choose  «2  €  S  such  that  «2« i  is  an  arc.  If  a 2  =  «o. 
then  U2U\uq  is  a  cycle.  Otherwise,  choose  t/3  e  5  so  that  uyuj  is  an  arc.  In  general,  having  chosen  distinct  vertices 
«o,  mi,  . . .  ,  m*  e  S  such  that  m,+|M;  is  an  arc  for  i  =  0,  1, . .  .k  —  1,  choose  Uk+ 1  e  S  such  that  itk+\Uk  is  an  arc. 
Since  S  is  finite,  there  exists  k  such  that  Uk+ 1  =  ut  with  l  <  k.  Then  Uk+\UkUk-\  •  •  •  is  a  cycle. 

7.  Since  the  undirected  graph  is  a  tree  with  n  vertices,  T  has  n  —  1  arcs,  by  Theorem  12.2.3. 

9.  (a)  Here  are  the  distances  dt  and  the  corresponding  values  of  p, . 


11.  The  arc  vPtvt  is  the  last  on  a  shortest  path  which  the  algorithm  has  found  from  vo  to  vt.  Let  Q  be  the  digraph  which 
consists  of  precisely  those  arcs  (and  their  end  vertices).  Vertex  vq  has  indegree  0  since  there  is  no  arc  v/Vq  with 
i  >  0.  Every  other  vertex  in  Q  has  indegree  1  since  it  is  the  last  on  a  shortest  directed  path.  Suppose  Q  has  r  vertices. 
Then  it  has  r  —  1  edges,  one  for  each  of  its  vertices  other  than  vq.  By  Theorem  12.2.3,  it  suffices  to  show  that  Q 
is  connected.  This  will  certainly  be  the  case  if  we  show  that  every  vertex  of  Q  is  connected  to  vo  by  some  path  in 
Q.  Suppose  this  is  not  the  case  and  let  t  be  minimal  with  the  property  that  there  is  no  directed  path  from  vo  to  v, . 
Clearly  t  >  0.  Then  Q  contains  an  arc  vPl  v,,  the  last  on  a  directed  path  to  vt.  Since  pt  <  t,  there  is  a  directed  path 
from  vo  to  vPl ,  and  the  arc  vPl  vt  is  directed  vPl  -*■  v,.  Thus  there  is  a  path  from  vo  to  vt. 

13.  It  is  sufficient  simply  to  modify  Step  1  by  setting  dk  =  0  and  d ,  =  00  for  i  <  k.  (We  could  also  modify  Step  2  by 
starting  the  for  loop  at  t  =  k  +  1,  but  there  is  little  change  in  efficiency  unless  k  is  roughly  the  size  of  n.) 

14.  (a)  i.  Here  are  the  distances  d,  from  v\  and  the  corresponding 

values  of  pt. 


ii.  Here  are  the  distances  dt  from  V2  and  the  corresponding 
values  of  p,. 
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1.  (a)  The  final  backtracking  is  7.  2,  I 


5  4  17  5  4  17 
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if  ft  ?2  6n 


The  final  backtracking  16,  15,  14,  13,  12,  11,  6,  5,  4,  3,  2,  1. 


4.  (a)  Consider  the  status  of  the  algorithm  at  the  time  that  v  is  labeled.  If  there  are  no  unlabeled  vertices  adjacent  to  v  at 
this  stage,  we  have  nothing  to  prove.  So  we  assume  that  there  are  some.  Let  Q'  denote  the  subgraph  of  Q  consisting 
of  those  vertices  as  yet  unlabeled  which  are  connected  to  v  via  a  path  of  unlabeled  vertices,  and  all  the  edges  among 
these  vertices.  After  v  is  labeled,  the  algorithm  moves  to  an  unlabeled  vertex  adjacent  to  v  and  hence  into  the  subgraph 
Q'.  It  then  either  returns  to  v  or  moves  to  another  vertex  in  Q'  and  gives  it  a  label. 

Now  the  depth-first  search  algorithm  covers  any  edge  at  most  twice,  once  when  it  labels  a  vertex  and  once  when 
it  backtracks.  Since  there  are  only  finitely  many  edges  in  Q' ,  the  algorithm  must  eventually  backtrack  to  v.  If  there 
remain  any  adjacent  unlabeled  vertices,  the  algorithm  chooses  one  to  label  and  moves  again  into  Q' .  As  before,  it 
must  eventually  backtrack  to  v.  The  process  continues  until  all  vertices  adjacent  to  v  are  labeled. 


6. 


if  g  is 


,  one  spanning  tree,  namely 


,  is  not  obtainable  by  depth-first  search. 


7.  (a)  Each  time  the  procedure  terminates,  start  it  again  with  an  unlabeled  vertex.  The  number  of  connected  components  is 
equal  to  the  number  of  times  the  procedure  terminates. 

9.  No.  For  instance,  you  can  get  from  (7,  1,0)  to  (3,  5,  0)  in  one  step,  but  not  back. 

10.  Breadth-first  search  is  another  way  of  systematically  moving  through  all  the  vertices  of  a  graph.  Suppose  a  graph  has 
n  vertices.  The  procedure  assigns  to  these  vertices  labels  from  the  set  { 1 , 2, . . .  ,  n  j.  Assign  label  1  to  any  vertex.  Then 
label  all  vertices  adjacent  to  vertex  1  with  consecutive  labels  starting  at  2.  Then  pass  to  vertex  2  and  label  all  vertices 

adjacent  to  2  which  have  not  yet  been  labeled  with  consecutive  labels  from  the  unused  members  of  {1,2 . n }, 

always  using  the  smallest  available  integer  first.  Then  pass  to  vertex  3  and  so  on,  at  each  stage  moving  to  the  next 
highest  labeled  vertex  and  labeling  all  adjacent  vertices  not  yet  labeled  with  the  unused  members  of  {1,2,...  ,  n }, 
smallest  unused  integer  first.  Continue  until  some  vertex  acquires  label  n  or  all  vertices  adjacent  to  the  vertex  with 
the  highest  label  are  already  labeled. 

11.  (a) 


5  3  12 


Exercises  13.2 


1.  (a)  Not  strongly  connected;  edge  36  is  a  bridge. 


5 


-6- 

4 


1 


7 
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3.  (a)  A  strongly  connected  orientation  for  the  Petersen  graph  is  shown  to  the  right. 


5.  No,  it  cannot.  Every  edge  of  an  Eulerian  graph  is  part  of  a  circuit;  the  removal  of  an  edge  of  a  circuit  certainly  does 
not  disconnect  a  graph. 

6.  If  e  is  a  bridge,  we  can  let  u  and  v  be  its  ends.  If  there  were  a  path  from  u  to  v  which  did  not  require  e,  together  with 
e,  we  would  have  a  circuit  containing  e.  Since  the  deletion  of  an  edge  which  is  part  of  a  circuit  cannot  disconnect  a 
graph,  we  have  a  contradiction. 

7.  (a)  Choose  a  vertex  v.  For  each  vertex  u  adjacent  to  v,  orient  the  edge  uv  in  the  direction  u  -+  v. 

8.  Yes.  To  get  from  a  to  &  in  the  new  orientation,  just  find  the  path  from  b  to  a  in  the  old  orientation  and  follow  it  in 
reverse. 


9.  (a)  False.  The  graph  shown  at  the  right  cannot  be  given  a  strongly  connected  orientation,  yet 
every  edge  is  part  of  a  circuit. 


,A>,A> 


10.  (a)  If  there  were  a  path  V  between  u  and  v  in  Q  \  [e],  then  V  U  (e)  would  be  a  circuit,  contradicting  the  fact  that  e  is  a 
bridge. 


11.  Yes  it  can.  Cutting  all  the  bridges  divides  the  city  into  connected  components  without  bridges.  Each  component, 
therefore,  has  a  strongly  connected  orientation  (by  Theorem  13.2.3),  so  it  is  possible  to  assign  directions  to  the  streets 
of  each  component  which  allows  the  possibility  of  (legal)  travel  between  any  two  points.  Now  note  that  making  each 
bridge  a  two-way  street  allows  arbitrary  travel  between  components. 


Exercises  14.1 

1.  (a)  We  draw  the  graph  quickly  as  a  planar  and  then,  after  some  thinking,  as  a  plane  graph  with  straight  edges. 


(b)  There  are  seven  regions,  numbered  1,2 . 7,  with  boundaries  afg,  ghe,  hbi ,  icd,  bjc,  fedk ,  and  ajk,  respectively. 

(c)  E  =  11,  V  =  6,  R  =  7,  N  =  22;  so  V  —  E  +  R  =6  —  11  +7  =  2;  N  =  22  <  22  =  2E  and  E  =  1 1  <  12  =  3  V  —6. 
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3. 


5.  (a) 


(e) 


7. 


8.  (a) 

10.  (a) 

11. 


14.  (a) 


15. 

17.  (b) 


19.  (a) 

20.  (a) 

21.  (a) 


Solid 

V 

E 

F 

V-E  +  F 

tetrahedron 

4 

6 

4 

2 

cube 

8 

12 

6 

2 

octahedron 

6 

12 

8 

2 

dodecahedron 

20 

30 

12 

2 

icosahedron 

12 

30 

20 

2 

This  is  planar.  Here  it  is  drawn  as  a  plane  graph  (with  straight  edges). 


This  is  not  planar.  There  is  a  subgraph  homeomorphic  to  AC3  3  as  shown. 


We  know  from  Theorem  14.1.4  that  £  <  3L  —  6.  Substituting  in  E  =  V  +  R  —  2,  we  obtain  V  +  R  —  2  <  3L  —  6, 
or  R  <  2V  —  4,  as  required. 

(Ai  E  =  3  =  3(3)  —  6  =  3L  -6. 

K.„  is  planar  if  and  only  if  n  <  4. 

Introduce  B 


12  12 


Assume  the  result  is  not  true.  Then  there  is  some  counterexample  Q  and  subgraph  H;  that  is,  for  these  graphs, 
E\  —  V\  <  E 2  —  V2-  (In  particular,  V\  ^  V2 - )  Choose  H  such  that  V\  —  V->  is  as  small  as  possible.  Since  Q  is 
connected,  we  can  find  a  vertex  v  which  is  in  Q ,  but  not  'H,  which  is  joined  to  some  vertex  in  H.  Let  K.  be  that 
subgraph  of  Q  consisting  of  H,  v  and  all  edges  joining  v  to  vertices  in  H.  Letting  V3  and  E 3  denote  the  numbers  of 
vertices  and  edges,  respectively,  in  1C ,  we  have  V3  =  V2  +  L  while  £3  >  £2  +  L  Hence,  £2  —  Vl  S  £3  —  V3  and  so 
£1  —  Vi  <  £3  —  V3.  Thus,  Q  and  its  subgraph  K,  provide  another  counterexample,  but  this  contradicts  the  minimality 
of  V\  —  Vi  since  Vj  —  V3  <  V\  —  Li. 

Yes.  An  example  is  shown  to  the  right.  The  graphs  are  homeomorphic  since  the  one 
on  the  right  is  obtainable  from  the  other  by  adding  a  vertex  of  degree  2. 

False,  as  shown  by  the  homeomorphic  graphs  at  the  right.  The  graph  on  the  left  is 
Hamiltonian,  but  the  one  on  the  right  is  not. 

Let  G\,Gi,...  ,  Qn  be  the  connected  components  of  Q.  Since  Q,  has  at  least  three  vertices,  we  have  Egt  <  3Vgi  —  6. 
Hence,  J2  E Qi  —  3  Vgi  —  6n,  so  £  <  3L  —  6 n  as  required. 

We  may  assume  that  Q  is  connected.  Say  there  is  only  one  vertex  of  degree  at  most  5.  Then  W  deg  1;,  >  6(L  —  1)  = 
6L  —  6,  contradicting  =  2£  <  6L  —  12. 

By  Theorem  14.1.4,  £  <  3L  -  6,  so  £  <  3(20)  -  6  =  54. 
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22.  (a)  Say  at  most  one  region  has  at  most  five  edges  on  its  boundary.  Then,  with  N  as  in  the  proof  of  Corollary  14.1.3, 

N  >  6{R  —  1).  But  N  <  2 E,  so  2 E  >  6R -6,  3 R  <  E  +  3.  Since  V  -  E  +  R  =  2,  6  =  3V -3E  +  3R  <  3V -2£  +  3; 

that  is,  2E  <  3V  —  3.  But  2E  =  £]degu,  >  3V  by  assumption,  and  this  is  a  contradiction. 

24.  Letting  x  denote  the  number  of  connected  components  of  Q,  we  have  V  —  E  +  R  =  \  +  x. 

Proof.  For  each  component  C ,  Vc  —  Eq  +  Rc  =  2.  Adding,  we  get  J^Vc  —  J2  Ec  +  J2  Rc  =  2x.  We  have 
Vc  =  V  and  Eq  =  £,  but  ^  7?c  =  /?  +  (jc  —  1 )  since  the  exterior  region  is  common  to  all  components.  Thus, 
V  —  E+R+x— \  =  2x,  V  —  E  +  R=  x+  \. 


Exercises  14.2 


2.  (a) 


(b) 


We  show  the  graph  superimposed  over  the  given  map.  Since  this  graph  contains  triangles, 
at  least  three  colors  are  necessary.  A  3-coloring  is  shown,  so  the  chromatic  number  is  3. 


False!  The  Four-Color  Theorem  says  that  the  chromatic  number  of  a  planar  graph  is  at  most 
(a)  has  chromatic  number  3. 


3. 


4.  (b) 


(e) 


6.  (a) 


In  neither  case,  will  four  colors  necessarily  suffice  as  the  pictures  to  the 
right  demonstrate. 


A  5-coloring  is  shown;  hence,  -  5-  Since  K. 5  is  a  subgraph,  x(Q) 


A  4-coloring  is  shown;  hence,  x(S)  5  4.  Since  Q  contains  1C 4  as  a  subgraph.  x(&)  =  4. 


The  graph  Q\  on  the  left  has  a  3-coloring,  as  shown.  Since  it  contains  a 
triangle,  x(Gi)  =  3-  The  graph  Q2  on  the  right  has  a  3-coloring,  as 
shown.  By  trying  to  label  alternately  the  vertices  of  the  outer  pentagon, 
we  see  that  two  colors  will  not  suffice,  so  x(Gi)  =  3  too. 


1 


1 


(b)  The  converse  of  the  Four-Color  Theorem  states  that  if  the  chromatic  number  of  a  graph  is  at  most  four,  then  the 
graph  is  planar.  This  result  is  false.  The  Petersen  graph  is  not  planar  (Exercise  4  of  Section  14.1),  but,  as  we  saw  in 
part  (a),  x  =  3. 
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7.  (a) 

9.  (a) 

(c) 

(d) 

(e) 

10.  (a) 

11.  (a) 

14.  (a) 

(d) 

15. 

18. 

19. 

24.  (a) 

25.  (a) 


26. 


Yes,  a  tree  is  planar  and  we  can  prove  this  by  induction  on  n,  the  number  of  vertices.  Certainly  a  tree  with  one  vertex 
is  planar.  Then,  given  a  tree  with  n  vertices,  removing  a  vertex  of  degree  1  (and  the  edge  with  which  it  is  incident) 
leaves  a  tree  with  n  —  1  vertices  which  is  planar  by  the  induction  hypothesis.  The  deleted  vertex  and  edge  can  now 
be  reinserted  without  destroying  planarity. 

(b) 


The  graph  isn’t  planar,  by  Kuratowski’s  Theorem,  as  the  results  of  (a)  and  (b)  each  illustrate. 

A  3-coloring  is  shown.  Since  the  graph  contains  triangles,  fewer  than  three  colors  will 
not  suffice.  The  chromatic  number  is  three. 


2  1 

The  converse  of  the  Four-Color  Theorem  says  that  a  graph  with  x  <  4  is  planar.  This  is  not  true,  as  this  graph 
illustrates:  The  chromatic  number  is  3,  but  the  graph  is  not  planar. 

By  Pause  7  of  this  section,  for  any  n,  x(/C„)  =  n  and  for  any  m,  n,  x(K-m.n)  =  2.  Thus,  x(/Ci4)  =  14  and 
X(7Qi,i4)  =  2. 

False.  If  Q  is  a  5-cycle,  then  x(G)  —  3  since  5  is  odd  but  Q  contains  no  triangle. 

False.  1C 3,3  is  Hamiltonian  and  has  chromatic  number  two  (by  Pause  7)  but  is  not  planar. 

False.  K, 7  is  Eulerian  (since  the  degree  of  every  vertex  is  six)  and  Hamiltonian  (since  every  complete  graph  is 
Hamiltonian),  yet  x(^-i)  =  7. 

Three  exam  periods  are  required  since  this  is  the  chromatic  number  of  the  graph. 

We  draw  a  graph  Q.  The  vertices  are  courses,  and  two  vertices  are  joined  if  somebody  is  taking  both  courses.  Since 
XiQ)  =  4,  four  time  periods  are  required. 

This  means  we  can  delete  the  edge  joining  “Econ”  to  “Stat,"  so  “Stat”  can  be  colored  3,  x(C?)  =  3  and  three  time 
periods  now  suffice. 

We  assume  Q  has  more  than  three  vertices  for  otherwise  there  is  nothing  to  prove.  Let  T  be  a  spanning  tree  for  Q. 
By  Exercise  7(c)  we  know  that  x(7”)  =  2.  Since  Q  has  n  vertices,  Q  —  T  U  {e}  for  some  edge  e.  If  the  ends  of  e 
have  different  colors  in  T,  then  x(G)  =  2;  otherwise,  one  of  these  ends  must  be  given  a  third  color  and  x(f?)  =  3. 
In  either  case,  x(G)  <  3  as  required. 

The  dual  of  the  cube  is  the  octahedron  whose  graph  appears  in  Fig  14.5. 

Yes,  it  is.  Both  the  tetrahedron  and  its  dual  are  K.4. 


Exercises  14.3 


1.  (a) 


(b) 

(d) 


36  nodes,  14  grid  segments,  6  nets. 


V\  and  V3,  for  example. 


(c)  V, 


We  know  x(G)  >  4  since  Q  contains  K, 4  (vertices  H ,  V2,  V3,  V4).  On  the  other  hand,  a  4-coloring  is  given  by 
{ Vi ,  V3),  {V2,  V5 },  {F4I,  {//).  Hence,  x(*3)  =  4.  The  above  4-coloring  is  also  a  partition  of  the  nets. 
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3.  False.  An  example  is  the  net  pattern  .  .  ^  for  which  the  line-of-sight  graph  is  o  o. 

4.  fa)  ~  need  not  be  reflexive.  Consider  *  * 


7.  False.  The  graph  in  Exercise  5  isn’t  planar.  The  subgraph  with  vertices  N i,  (V3,  N4,  1V5,  Nf,,  N-j  and  with  edge  N3N4 
removed  is  isomorphic  to  £3,3. 

8.  (a)  Imagine  the  possible  shorts  between  nets  as  the  edges  in  a  graph:  They  won’t  cross  since  they're  all  parallel.  Now 

compress  each  net  to  a  point,  and  we  have  a  plane  graph.  By  the  Four  Color  Theorem,  xU?)  5  4. 


(b)  Draw  separate  graphs  Q(V,  £3),  <J(V,  £2)  for  horizontal  lines  of  sight  fas  in  (a)]  and  vertical  lines  of  sight.  These 
graphs  have  the  same  vertex  set,  so  combine  them.  If  an  edge  is  repeated  (that  is,  both  horizontal  and  vertical  lines 
exist  between  nets),  then  omit  one  of  the  original  occurrences  of  it. 


9. 


10. 


It  is  possible  for  a  short  to  exist  between  any  pair  of  nets  here.  Hence,  Q  =  and  x(Q)  =  as  required. 


H  cannot  be  2-colored  since  it  is  a  cycle  of  odd  length.  If  we  delete  any  vertex,  however, 
the  resulting  graph  can  be  2-colored.  If  we  delete  e.  for  example,  we  obtain  the  graph  at 
the  right.  Shown  are  Q'  and  a  corresponding  floor  plan. 


13.  (a)  We  can  immediately  draw  Q \  for  this  problem  (using  the  obvious  Hamiltonian  cycle  that  a  golfer  must  follow).  Both 
Q 1  and  H  are  shown. 


Since  H  contains  a  triangle,  xW)  =  3,  so  such  a  course  is  impossible. 


14.  Deleting  vertex  2  (and  the  three  edges  incident  with  2)  leaves  /C5. 

15.  Since  the  graph  is  not  /C„  for  any  n,  nor  an  odd  cycle,  Brooks's  Theorem  says  x(S)  <  A(Q).  Here  A(Q)  =  3,  so  we 
have  x(G)  5  3.  But  this  graph  contains  cycles  of  odd  length,  so  x(0)  >  3.  We  conclude  that  x(G)  =  3. 
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Exercises  15.1 

1.  (a)  At  a,  f va  =  f.v«  =  2  and  Et,  =  f«c  +  foe  =  2  +  0  =  2.  At  e,  Et.  =  f«e  +  =0+1  =  1  and 

El.  =  f«c  +  fed  =  0  +  1  =  I.  At  d,  Ev  f»d  =  f6<i  +  fed  =  3  +  1  =  4  and  E„  fdv  =  fd/  =  4. 

(b)  The  value  of  the  flow  is  6. 

(c)  The  capacity  of  the  cut  is  cac  +  cae  +  c  be  +  Cbd  =  3+1+4  +  3=11. 

(d)  No.  Arc  dt  is  saturated. 

(e)  The  flow  is  not  maximum.  For  instance,  it  can  be  increased  by  adding  1  to  the  flow  in  the  arcs  along  sact. 

(f)  (fflc  —  ffo)  +  (ffie  -  f«i)  +  (f be  ~  f eb)  +  (f bd  ~  f db)  =  (2  —  0)  +  (0  —  0)  +  (1  —  0)  +  (3  —  0)  =  6. 

2.  (b)  The  value  of  the  flow  is  14. 

3.  (c)  cac  +  Che  +  cdr  =  3  +  9  +  9  =  21. 

5.  (a)  i.  Send  one  unit  through  the  path  sbet. 

ii.  The  flow  in  (a)  has  a  saturated  arc,  be. 

iii.  Here  is  a  maximum  flow,  of  value  6.  (Did  you  get  it?)  To  see  that  the 
flow  is  maximum,  consider  the  cut  S  =  {+  a ,  b ),  T  =  {c,  d.  e,  /,  t ). 


c 


7.  Let  (Si.Ti)  be  any  cut.  By  Theorem  15.1.4,  vaU^)  <  cap(Si.Tj).  But  val(:F)  =  cap(S.  T).  So  cap(5.  T)  < 
cap(<S| ,  T\).  Since  (<S| .  T\ }  was  an  arbitrary  cut.  the  result  is  proven. 


a  3, 3  d 


Exercises  15.2 


1.  (a)  i.  One  flow-augmenting  chain  is  sbadt  in  which  the  slack  is  I. 

ii.  Here  is  a  maximum  flow,  of  value  7.  We  can  see  this  is  maximum  by 
examining  the  cut  S  =  (.?,  a,  b),  T  =  {c,  d.  e ,  /,  t  j,  of  capacity 
csc  +  cad  +  Cbe  =  3  +  3  +  1  =  7,  the  value  of  the  flow. 


(d)  i.  One  flow-augmenting  chain  is  sacdfgt ,  which  has  slack  1. 

ii.  Here  is  a  maximum  flow,  of  value  20.  We  can  see  this  is  maximum 
by  examining  the  cut  S  =  {.v,  a.  b.  c},  T  =  [d,  e.  /,  g.  h.  r).  This  has 
capacity  cse  +  c +  c,,/  =  7  +  4  +  9  =  20.  the  value  of  the  flow. 


3.3  d 


a 


e  6, 6  /  7. 7  g 
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2.  (a) 


3.  (a) 


A  flow  of  value  6  is  given  in  Exercise  5(a)  of  Section  15.1.  To  see  this  is  maximum,  consider  the  cut  S  =  {.v,  a.  b ), 
T  =  (c,  d,  e,  /,  r}. 


To  see  that  the  pictured  flow  is  maximum,  consider  the  cut  S  =  j.v,  b }, 
T  =  {a,c,d,e,f,t}. 


a  7,4  c  11,4  e 


4.  (a)  We  multiply  the  capacities  by  10,  the  greatest  common  divisor  of  the  denominators,  and  find  a  maximum  flow  in  the 
new  network,  as  shown  on  the  left.  Then,  dividing  by  10,  we  find  a  maximum  flow  in  the  given  network,  of  value 
67/10,  as  shown  on  the  right.  (Only  the  flow  in  each  arc  is  shown.) 


Exercises  15.3 


1. 


The  demand  cannot  be  met;  the  best  that  can  be  done  is  as  shown.  Warehouse  b  sends  out  20  Klein  bottles  and  the 
others  send  out  none.  Five  go  to  retail  outlet  d  and  15  go  to  e. 

3.  The  needs  of  the  retail  outlets  cannot  be  met.  They  can  receive  at  most  14  units  a  month,  as  shown.  To  see  that  the 
flow  there  is  a  maximum,  consider  the  cut  in  which  T  =  («.  F,  G.  ?!  and  S  is  the  set  of  all  other  vertices. 
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5.  (a)  A  maximum  flow  is  shown,  with  direction  added  to  the  undirected  edges. 

5 


7.  Four  messengers  can  be  sent. 

9.  Let  u  and  v  be  two  vertices  of  K.„.  If  x  is  any  other  vertex,  u,  x,  v  is  a  path  from  u  to  v,  while  u,  v  is  also  a  path. 
These  n  —  1  paths  are  edge  disjoint.  Also,  there  cannot  be  more  than  n  —  1  paths  from  u  to  v  since  deg  u  =  n  —  1  and 
edge  disjoint  paths  must  start  with  different  edges.  Thus,  the  maximum  number  of  edge  disjoint  paths  between  u  and 
v  is  n  —  1.  But  this  is  also  the  minimum  number  of  edges  which  must  be  removed  in  order  to  sever  all  paths  from  it 
to  i>  (in  accordance  with  Menger’s  Theorem).  To  see  this,  note  that  deleting  all  edges  incident  with  u  certainly  severs 
all  paths,  so  at  most  n  —  1  edges  need  be  deleted.  On  the  other  hand,  if  we  remove  fewer  than  n  —  1  edges,  then 
some  path  from  u  to  v  remains  because  we  have  already  exhibited  n  —  1  edge  disjoint  paths. 


Exercises  15.4 


1. 


3. 

4.  (a) 

6.  (a) 

7.  (a) 

9.  (a) 

10.  (a) 

12.  (a) 


14. 


Karen  Mary  Aurie  Pamela  Lynne 

One  possible  matching  is  shown. 

David  Stuart  Paul  Roger  Philip 

{12,  34,  56,  78}  is  a  perfect  matching. 

Let  X  =  (Angela,  Brenda,  Christine.  Margaret,  Phyllis,  Renee).  Then  A(X)  =  {A,  B,  C,  E ,  F],  so  |X|  >  jA(X)|  and 
the  condition  fails. 

Bruce  -<->•  Maurice,  Edgar  Michael,  Eric  Roland,  Herb  •<->  Richard. 

Construct  a  bipartite  graph  with  vertex  sets  Vi  and  VS,  where  Vi  has  n  vertices  corresponding  to  A|, . . .  ,  A„,  V2  has 
one  vertex  for  each  element  of  S  and  there  is  an  edge  joining  A,  to  .v  if  and  only  if  ,v  e  A-,.  Given  a  subset  X  of 
V\,  the  set  A (X)  (notation  as  in  the  text)  is  precisely  the  set  of  elements  in  (JA.gX  Ax.  Thus  this  question  is  just  a 
restatement  of  Hall’s  Marriage  Theorem. 

Exchange  the  roles  of  V\  and  V2  in  Theorem  15.4.2.  It  may  be  possible  to  find  a  matching  which  saturates  V2-  Here’s 
a  necessary  and  sufficient  condition  for  this  to  happen:  If  X  is  any  subset  of  V2  and  A(X)  is  the  set  of  all  vertices 
of  V)  which  are  adjacent  to  some  vertex  of  X,  then  |A(X)|  >  |X|. 

K.m<n  has  a  perfect  matching  if  and  only  if  m  =  n.  To  see  this,  first  assume  that  m  =  n  and  let  the  vertex  sets  be 

Vi  =  [U] ,  «2 . um)  and  V2  =  [v\,  V2 . vm}.  Then  [u\ iq,  M2P2, . . .  ,  umvm }  is  a  perfect  matching.  Conversely, 

say  we  have  a  perfect  matching  and  m  <  n.  Since  each  edge  in  a  matching  must  join  a  vertex  of  V]  to  a  vertex  of 
V2,  there  can  be  at  most  m  edges.  If  m  <  n,  some  vertex  in  VS  would  not  be  part  of  any  edge  in  the  matching,  a 
contradiction.  Thus,  m  =  n. 

By  Proposition  15.4.3,  if  has  a  perfect  matching,  then  n  must  be  even.  Conversely,  assume  that  n  =  2m  is  even 
and  label  the  vertices  « 1 ,  «2,  •  •  •  , 11 2m-  Since  every  pair  of  vertices  is  joined  by  an  edge,  [u\U2-  U3U4,  •  •  •  ,  U2m-lU2m  1 
will  give  us  a  perfect  matching. 


No.  The  graph 


has  |V|  even,  but  it  has  no  perfect  matching. 


15. 


No.  The  graph 


has  a  perfect  matching,  but  each  vertex  has  degree  2,  which  is  less  than  i|V|  =  3. 


Glossary 


Without  a  firm  grasp  of  vocabulary,  it  is  impossible  to  speak,  let  alone  understand,  a  language.  The  same  applies  to  any 
science.  If  you  aren't  comfortable  with  its  terminology,  you  can’t  expect  to  understand  the  subject. 

What  follows  is  a  vocabulary  list  of  all  the  technical  terms  discussed  in  this  book  together  with  the  page  where  each  was 
first  introduced.  In  most  cases,  each  definition  is  followed  by  an  example. 


A 

absolute  value  The  absolute  value  of  a  number  x,  denoted 
|jc[,  is  defined  to  be  x  if  x  >  0  and  —  x  if  x  <  0.  For 
example,  |0|  =  0,  |5|  =  5  and  |  —  5|  =  5.  p.  76 

acyclic  A  graph  is  acyclic  if  it  contains  no  cycles,  p.  369 

adjacency  matrix  The  adjacency  matrix  of  a  graph  with  n 
vertices  v\,  V2,  ■ . .  ,  v„  is  the  n  xn  matrix  A  =  [<2,y]  whose 
(i,  j)  entry  a,y  is  l  if  vertices  v,  and  vj  are  adjacent  and 
otherwise,  a/j  =  0.  p.  318 

adjacent  Two  vertices  t>,  w  of  a  pseudograph  are  adjacent 
if  (v,  wj  is  an  edge.  p.  286 

algorithm  An  algorithm  is  a  procedure  for  carrying  out 
some  process.  Its  ingredients  are  an  input,  an  output, 
and  a  sequence  of  precise  steps  for  converting  the  input 
to  the  output.  For  example,  the  method  by  which  a 
perpendicular  to  a  line  can  be  constructed  at  a  speci¬ 
fied  point  is  a  familiar  algorithm  of  Euclidean  geometry. 
p.  240 

antisymmetric  A  binary  relation  ~  on  a  set  A  is  antisym¬ 
metric  if  a  ~  b  and  b  ~  a  imply  a  =  b.  For  example,  < 
is  antisymmetric  on  R  but  ~  defined  by  a  ~  b  if  ab  >  0 
is  not.  p.  53 

arc  An  arc  is  a  directed  edge  in  a  digraph.  p.  342 

argument  An  argument  is  a  finite  collection  of  statements 
Ai,A2,-..  ,  A„  called  premises  or  hypotheses  followed 
by  a  statement  23  called  the  conclusion.  Such  an  argument 
is  valid  if,  whenever  A  i ,  Ai, . .  ■  ,An  are  all  true,  then  23 
is  also  true.  p.  28 

arithmetic  sequence  The  arithmetic  sequence  with  first 
term  a  and  common  dijference  d  is  the  sequence  a , 
a+d ,  a+2d . For  example,  11,8,5,2,...  is  an  arith¬ 

metic  sequence  with  first  term  1 1  and  common  difference 
—3.  p.  166 

B 

base  b  representation  The  base  b  representation  of  a  natu¬ 
ral  number  N  is  the  expression  (an-\an-2  ■  •  •  no )b,  where 

the  integers  ao,  a  \ . a„~\  are  all  in  the  range  0  <  a,-  < 

b  and  N  =  an~ibn~l  +  an-2bn ~2  +  •  •  •  +  a\b  +  ao-  For 


example,  100  =  (10201)3  because  100  =  l(34)  +  0(33)  + 
2(32)  +  0(31)+ 1(3°).  p.  102 

Big  Oh  For  functions  /,  g :  N  — ►  R,  we  say  that  /  is  Big 
Oh  of  g,  written  /  =  0(g),  if  there  is  an  integer  no  and 
a  positive  real  number  c  such  that  \f{n)\  <  cjg(n)|  for 
all  n  >  wo-  For  example.  Inn  =  <D(n).  p.  247 

bijection  A  bijection  is  a  one-to-one  onto  function.  For 
example,  n  h*  2 n  defines  a  bijection  from  Z  to  21.  p.  73 

binary  relation  A  binary  relation  from  a  set  A  to  a  set 
B  is  any  subset  of  A  x  B.  A  binary  relation  on  A  is  a 
binary  relation  from  A  to  A.  For  example,  {(1,  a),  (2,  a)) 
is  a  binary  relation  from  N  to  the  letters  of  the  English 
alphabet,  and  {(x.y)  e  R2  |  x2  +  y2  =  1 )  is  a  binary 
relation  on  R.  p.  51 

bipartite  graph  A  bipartite  graph  is  a  graph  whose  vertices 
can  be  partitioned  into  two  disjoint  sets  in  such  a  way  that 
every  edge  joins  a  vertex  in  one  set  to  a  vertex  in  the  other. 

p.  288 

bipartition  set  The  bipartition  sets  of  a  bipartite  graph  are 
the  two  disjoint  sets  into  which  the  vertices  are  partitioned 
so  that  any  edge  joins  a  vertex  in  one  set  to  a  vertex  in 
the  other.  p.  288 

bridge  An  edge  e  of  a  connected  graph  Q  is  a  bridge  if  and 
only  if  the  subgraph  Q  \  (ej  is  not  connected,  p.  406 

C 

canonical  labeling  A  labeling  no,  ifi , . . .  >  vn- 1  of  the  ver¬ 
tices  of  a  digraph  is  canonical  if  every  arc  is  of  the  form 
Vj  Vj  with  i  <  j.  p.  392 

capacity  The  capacity  of  an  (,s,  f)-cut  {5,  T]  (s  e  S,  t  e 
T)  is  the  sum  of  the  capacities  of  all  arcs  from  S  to  T. 
p.  444 

cardinality  The  cardinality  of  a  finite  set  is  the  num¬ 
ber  of  elements  in  the  set.  While  this  book  does  not 
define  the  term  cardinality  for  infinite  sets,  it  does  dis¬ 
cuss  the  concept  of  same  cardinality  (see  definition). 
pp.  88  and  89 

Cartesian  product  The  Cartesian  product  of  sets  A  and 
B — denoted  AxB  or  A2  if  B  =  A — is  the  set  of  ordered 
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pairs  (a,  b),  where  a  e  A  and  b  6  B.  For  example, 
{1, 2}  x  {x}  =  {(1,  x),  (2,  x)}.  p.  48 
ceiling  The  ceiling  of  a  real  number  x,  denoted  {x"|,  is 
the  least  integer  greater  than  or  equal  to  x.  For  example, 
r3|l  =  4,  roi  =  0  and  r~3|l  =  -3.  p.  76 
chain  A  chain  is  a  trail  in  a  directed  network,  that  is,  a 
walk  in  which  the  arcs  are  distinct  and  can  be  followed 
in  either  direction.  p.  448 

chromatic  number  The  chromatic  number  of  a  graph  is  the 
smallest  natural  number  n  for  which  an  n -coloring  exists. 
p.  423 

circuit  A  circuit  in  a  pseudograph  is  a  closed  trail. 
p.  303 

coloring  A  coloring  of  a  graph  is  an  assignment  of  col¬ 
ors  to  vertices  so  that  adjacent  vertices  have  different 
colors.  An  n-coloring  is  a  coloring  which  uses  n  colors. 
p.  423 

combination  A  combination  of  objects  is  a  subset  of  them. 
An  r-combination  is  a  subset  of  r  objects.  For  example, 
{Charles,  Andrew}  is  a  2-combination  of  the  members  of 
the  British  royal  family,  p.  218. 
common  difference  See  the  definition  of  arithmetic 
sequence. 

common  ratio  See  the  definition  of  geometric  sequence. 
comparable  Two  elements  a  and  b  are  comparable  with 
respect  to  a  partial  order  <  if  and  only  if  either  a  <  b 
or  b  <  a.  For  example,  with  respect  to  “divides”  on  N,  2 
and  6  are  comparable,  but  2  and  7  are  not.  p.  64 
complement  The  complement  of  a  set  A,  written  Ac,  is 
the  set  of  elements  which  belong  to  some  universal  set, 
defined  by  the  context,  but  which  do  not  belong  to  A.  For 
example,  if  the  universal  set  is  R,  then  {x  |  x  <  1}C  = 
(x  |  x  >  1}.  p.  45 

complete  bipartite  The  complete  bipartite  graph  on  two 
sets  Vi  and  V2  is  that  bipartite  graph  whose  vertices  are 
the  union  of  Vi  and  V2  and  whose  edges  consist  of  all 
possible  edges  between  these  sets.  p.  288 
complete  graph  The  complete  graph  on  n  vertices  is  that 
graph  which  has  n  vertices,  each  pair  of  which  are  adja¬ 
cent.  p.  288 

complexity  The  complexity  of  an  algorithm  is  a  function 
which  gives  an  upper  bound  for  the  number  of  operations 
required  to  carry  out  the  algorithm.  It  is  usually  specified 
in  general  terms,  by  giving  another  function  of  which  it 
is  Big  Oh.  The  binary  search  algorithm,  for  example,  has 
complexity  O(log2  n).  p.246 

component  A  component  of  a  graph  is  a  connected  sub¬ 
graph  which  is  properly  contained  in  no  connected  sub¬ 
graph  with  a  larger  vertex  set  or  larger  edge  set.  p.  310 
composite  A  composite  number  is  a  natural  number  larger 
than  1  which  is  not  prime;  for  example,  6  and  10. 
p.  114 


composition  The  composition  of  functions  / :  A  — >  B  and 
g:  B  — x  C  is  the  function  g  o  f :  A  — >  C  defined  by 
g  o  f(a)  =  g(f(a))  for  a  £  A.  For  example,  if  f  and  g 
are  the  functions  R  — »•  R  defined  by  /(x)  =  x  +  2  and 
g(x)  =  2x  —  3,  then  g  o  f(x)  =  2x  +  1 .  p.  82 
conclusion  See  argument. 

congruence  class  The  congruence  class  of  an  integer  a 
(mod  n)  is  the  set  {b  €  Z  |  a  =  b  (mod  «)}  of  all  integers 
to  which  a  is  congruent  mod  n\  for  example,  the  congru¬ 
ence  class  of  5  (mod  7)  is  the  set  7Z  +  5  of  integers  of 
the  form  Ik  +  5.  p.  126 

congruence  mod  n  Integers  a  and  b  are  congruent  modulo 
n ,  where  n  >  1  is  a  natural  number — and  we  write  a  =  b 
(mod  n) — if  a—b  is  divisible  by  n.  For  example,  —2=10 
(mod  4),  but  7^—19  (mod  12).  p.  126 
connected  A  pseudograph  is  connected  if  there  is  a  walk 
between  any  two  vertices,  p.  305 
contradiction  A  contradiction  is  a  compound  statement 
that  is  always  false.  For  example,  ((-•p)  Aq)  A(p\>  (—■£/)) 
is  a  contradiction.  (Examine  the  truth  table.)  p.  20 
contrapositive  The  contrapositive  of  the  implication  “72  — > 
q”  is  the  implication  ->  (->/?).”  For  example,  the 

contrapositive  of  “If  a  graph  is  planar,  then  it  can  be  col¬ 
ored  with  at  most  four  colors"  is  “If  a  graph  cannot  be 
colored  with  at  most  four  colors,  then  it  is  not  planar.” 
p.  5 

converse  The  converse  of  the  implication  p  — x  q  is  the 
implication  q  — ►  p.  For  example,  the  converse  of  “If  a 
graph  is  planar,  then  it  can  be  colored  with  at  most  four 
colors”  is  "If  a  graph  can  be  colored  with  at  most  four 
colors,  then  it  is  planar.”  p.  4 
countable  A  set  is  countable  if  and  only  if  it  is  finite  or 
countably  infinite.  For  example,  the  rational  numbers  and 
the  set  of  natural  numbers  not  exceeding  1000  are  both 
countable  sets.  p.  90 

countably  infinite  A  countably  infinite  set  is  an  infinite  set 
which  has  the  cardinality  of  the  natural  numbers.  For 
example,  the  rational  numbers  form  a  countably  infinite 
set.  p.  90 

cut  An  (s,  t)-cut  in  a  directed  network  with  vertex  set  V, 
two  of  whose  members  are  s  and  t .  is  a  pair  of  dis¬ 
joint  sets  S  and  T,  x  e  S,t  €  T,  whose  union  is  V. 
p.  444 

cycle  A  cycle  in  a  pseudograph  is  a  circuit  in  which  the 
first  vertex  appears  exactly  twice  (at  the  beginning  and 
the  end)  and  in  which  no  other  vertex  appears  more  than 
once.  An  n-cycle  is  a  cycle  with  n  vertices,  p.  303 

D 

degree  The  degree  of  a  vertex  in  a  pseudograph  is  the  num¬ 
ber  of  edges  incident  with  that  vertex.  p.  286 
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degree  sequence  The  degree  sequence  of  a  pseudograph  is 
a  list  of  the  degrees  of  the  vertices  in  nonincreasing  order. 
p.  290 

derangement  A  derangement  of  n  distinct  symbols  which 
have  some  natural  order  is  a  permutation  of  them  in  which 
no  symbol  is  in  its  correct  position.  For  example,  312  is 
a  derangement  of  1 , 2,  3.  p.  228 
digraph  A  digraph  is  a  pair  (V,  £)  of  sets,  V  nonempty  and 
each  element  of  £  an  ordered  pair  of  distinct  elements  of 
V.  The  elements  of  V  are  called  vertices,  and  the  elements 
of  £  are  called  arcs.  p.  342 
direct  product  This  is  another  name  for  the  Cartesian 
product ,  whose  definition  appeared  previously.  p.  48 
directed  network  A  directed  network  is  a  digraph  with  an 
integer  weight  assigned  to  each  arc.  p.  360 
disjunctive  normal  form  A  compound  statement  based  on 
variables  x\ ,  X2,  .  ■  ■  ,  xn,  n  >  1,  is  in  disjunctive  normal 
form  if  and  only  if  it  has  the  form  (a n  Aa\2  A-  •  •  V 
(a2\  022  A  •  ■  •  A  ajn)  V  •  ■  •  V  (am\  A  (lm 2  A  •  •  ■  A  af,ln ), 

where,  for  each  i  and  /,  1  <  i  <  m,  1  <  j  <  n,  either 
ajj  =  xj  or  atj  =  -*Xj  and  all  minterms  anAa^A-  ■  -Aa,,, 
are  distinct.  For  example,  the  statement  (x[  Axj  A  *3)  v 
(x\  A  (-•X2)  A  ( — ‘JC3))  is  in  disjunctive  normal  form  on  the 
variables  X] ,  X2,  *3.  p.  26 

divides  Given  integers  a  and  b,b  0,  we  say  that  b  divides 
a — written  b  |  a — and  say  that  b  is  a  factor  or  a  divisor  of 
a  if  and  only  if  a  —  qb  for  some  integer  q.  For  example, 
3  1 18  but  5  1 18.  p.  104 
divisor  See  the  definition  of  divides. 
domain  The  domain  of  a  function  /:  A  — >  B  is  the  set  A. 
p.  73 

E 

edge  An  edge  in  a  graph  is  a  set  of  two  distinct  vertices. 

p.  286 

equivalence  class  If  ~  denotes  an  equivalence  relation  (see 
definition)  on  a  set  A  and  a  e  A,  then  the  equivalence 
class  of  a  is  the  set  of  all  elements  x  e  A  with  x  ~  a.  For 
example,  for  the  equivalence  relation  “congruence  mod 
2”  on  Z,  the  equivalence  class  of  17  is  the  set  of  all  odd 
integers.  p.  57 

equivalence  relation  An  equivalence  relation  on  a  set  is  a 
binary  relation  which  is  reflexive,  symmetric,  and  transi¬ 
tive.  For  example,  congruence  modulo  a  natural  number 
n  >  1  is  an  equivalence  relation  on  Z  but  divides  is  not. 
p.  56 

Eulerian  circuit  An  Eulerian  circuit  in  a  pseudograph  is  a 
circuit  which  contains  every  vertex  and  every  edge  of  the 
pseudograph.  p.  304 

Eulerian  pseudograph  An  Eulerian  pseudograph  is  a 
pseudograph  which  possesses  an  Eulerian  circuit. 
p.  304 


even  cycle  An  even  cycle  in  a  pseudograph  is  a  cycle  which 
contains  an  even  number  of  edges,  p.  303 
even  vertex  An  even  vertex  of  a  pseudograph  is  one  of 
even  degree,  p.  286 

F 

factor  See  the  definition  of  divides. 

factorial  For  a  natural  number  n  >  1,  n  factorial  means 
the  product  of  the  first  n  natural  numbers.  It's  denoted 
n\.  For  example,  3!  =  6  and  5!  =  120.  By  convention, 
0!  =  1.  p.  155 

Fibonacci  sequence  The  Fibonacci  sequence  is  the 
sequence  which  begins  1,  1,  and  each  of  whose 
subsequent  terms  is  the  sum  of  the  preceding  two. 
Thus,  the  first  ten  terms  are  1,  1, 2,  3,  5,  8,  13,  21, 34,  55. 

p.  168 

floor  The  floor  of  a  real  number  x,  denoted  |xj-  is  the 
greatest  integer  less  than  or  equal  to  x.  For  example, 
L3|J  =  3,  LOJ  =  0,  and  L-3$J  =  -4.  p.  76 
flow  Given  a  directed  network  with  vertex  set  V,  the  weight 
(or  capacity)  of  arc  uv  denoted  cuv,  and  two  distinguished 
vertices  s  and  t  called  the  source  and  the  sink,  respec¬ 
tively,  an  (s,  t)-flow  is  a  set  of  numbers  {fMU)  satisfying 
1.  0  <  fuv  <  cuv  for  all  u,  v  €  V 
2*  EveV  =  Eyev  fvu  for  all  u  e  V\{s,  r).  p.  442 
flow-augmenting  chain  A  flow-augmenting  chain  in  a  net¬ 
work  with  (s.  f)-flow  {fM„}  is  a  chain  in  which  fuv  <  c„v 
for  each  forward  arc  uv  of  the  chain  and  fuv  >  0  for  each 
backward  arc.  p.  449 

function  A  function  from  a  set  A  to  a  set  B  is  a  binary 
relation  /  from  A  to  B  with  the  property  that  for  every 
a  €  A,  there  is  exactly  one  b  e  B  such  that  (a,  b)  e  /. 
We  write  / :  A  —r  B  to  indicate  that  /  is  a  function  from 
A  to  B.  For  example,  a  sequence  of  integers  is  just  a 
function  N  — >  Z.  p.  71 

G 

generating  function  The  generating  function  of  a  sequence 
a 0,  a i,U2,  ■  ■  ■  is  the  expression  f(x)  =  ao+a\.x+a2X2  + 

■  ■  ■ .  The  generating  function  of  the  natural  numbers,  for 

instance,  is  the  expression  1  +  2x  +  3x2  H - .  p.  178 

geometric  sequence  The  geometric  sequence  with  first 
term  a  and  common  ratio  r  is  the  sequence 
a,  ar,  ar2 ,  ar3 ,  . . .  .  For  example,  l,j,|,g,...  is  the 
geometric  sequence  with  first  term  1  and  common  ratio  5 . 
p.  167 

graph  A  graph  is  a  pair  (V,  £)  of  sets,  V  nonempty  and 
each  element  of  £  an  unordered  pair  of  distinct  elements 
of  V.  The  elements  of  V  are  called  vertices',  the  elements 
of  £  are  called  edges.  p.  286 
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greatest  common  divisor  The  greatest  common  divisor 
(abbreviated  gcd)  of  integers  a  and  b  not  both  of  which 
are  zero  is  the  largest  common  divisor  of  a  and  b ;  that  is, 
a  number  g  such  that 

1.  g  I  a,  g  |  b  (it’s  a  common  divisor)  and, 

2.  if  c  j  a  and  c  \  b.  then  c  <  g  (it’s  the  largest  common 
divisor). 

In  general,  for  n  >  2,  the  gcd  of  n  nonzero  integers 
a\ ,  a2,  • . .  ,an  can  be  defined  either  as  the  largest  of  the 
common  divisors  of  these  integers  or,  inductively,  by 

gcd(ai,a2 - ,a„)  =  gcd(ai,  gcd(«2 - - «„))• 

For  example,  gcd(4,  18)  =  2,  gcd(  1 8,  —  30)  =  6  and 
gcd(— 6,  10,  — 15)  =  I.  p.105 
greatest  lower  bound  A  greatest  lower  bound  (abbreviated 
gib)  for  two  elements  a,  b  in  a  partially  ordered  set  (A,  <) 
is  an  element  g  €  A  satisfying 

1.  g  <  a,  g  <  b\  and 

2.  if  c  <  a  and  c  <  b  for  some  c  e  A  then  c  <  g. 

If  a  and  b  have  a  gib,  then  this  element  is  unique  and 
denoted  a  A  b.  For  example,  the  greatest  lower  bound  of 
two  natural  numbers  with  respect  to  ’’divides”  is  just  their 
greatest  common  divisor.  p.  66 


H 

Hamiltonian  cycle  A  Hamiltonian  cycle  in  a  graph  is 
a  cycle  in  which  every  vertex  of  the  graph  appears. 
p.  310 

Hamiltonian  graph  A  Hamiltonian  graph  is  one  which  has 
a  Hamiltonian  cycle,  p.  310 

homeomorphic  Two  graphs  are  homeomorphic  if  and  only 
if  each  can  be  obtained  from  the  same  graph  by  adding 
vertices  to  edges.  p.  417 

hypothesis  See  argument. 


I 

ideal  An  ideal  of  Z  is  a  subset  A  of  Z  with  three  properties: 
(i)  0  e  A;  (ii)  a  e  A  implies  —a  e  A;  (iii)  a,b  e  A 
implies  a  +  b  e  A.  For  example,  the  set  of  even  integers 
is  an  ideal  of  Z.  p.  162 

identity  function  The  identity  function  on  a  set  A  is  the 
function  i:  A  ->  A  defined  by  t(a)  =  a  for  any  a  e  A. 
Sometimes  we  write  la  to  emphasize  that  the  domain  of 
this  function  is  the  set  A.  p.  77 
incident  A  vertex  v  of  a  pseudograph  is  incident  with  an 
edge  e  if  and  only  if  e  =  {t>,  w }  for  some  vertex  w. 

p.  286 

indegree  The  indegree  of  a  vertex  in  a  directed  graph 
is  the  number  of  arcs  directed  towards  that  vertex. 
p.  343 


injective  This  is  another  word  for  one-to-one  (see  defini¬ 
tion).  p.  73 

intersection  The  intersection  of  sets — denoted  A  (T  B  for 
two  sets,  AnfiflC  for  three,  and  n"_ l  A,-  for  n  sets — 
is  the  set  of  elements  which  belong  to  each  of  the  sets. 
For  example,  {0,  1 , 2)  IT  {0,  1 , 3}  =  {0,  I };  {0|  n  N  =  0. 
p.  43 

inverse  function  The  inverse  of  a  one-to-one  onto  function 
f :  A  —r  B  is  the  function  B  — >  A  obtained  by  reversing 
the  pairs  of  /.  (Only  a  one-to-one  onto  function  has  an 
inverse.)  For  example,  if  /  =  {(1,  a),  (2,  £>)},  then  f~l  = 
{(a,  l),(b,2)j.  The  inverse  of  the  exponential  function 
R  ->  (0,  oo)  defined  by  / (jc)  =  ex  is  the  natural  logarithm 
(0,  oo)  — >  R  defined  by  f~ 1  (x)  =  lnx.  p.  80 
isolated  vertex  An  isolated  vertex  in  a  pseudograph  is  one 
of  degree  0.  p.  286 

isomorphic  Graphs  are  isomorphic  if  and  only  if  there  is 
an  isomorphism  (see  definition)  from  one  to  the  other. 
p.  295 

isomorphic  labeled  trees  Labeled  trees  T j  and  T2  are  iso¬ 
morphic  if  and  only  if  they  have  the  same  number  of 
vertices,  they  are  labeled  with  the  same  set  of  labels,  and 
vertices  labeled  v  and  w  in  7j  are  adjacent  if  and  only  if 
v  and  w  are  adjacent  in  7?.  p.  375 
isomorphism  An  isomorphism  from  a  graph  G\  (V) ,  £\ )  to 
a  graph  <52(V2,£2)  is  a  one-to-one  function  <p  from  Vi 
onto  V2  such  that  if  vw  is  an  edge  in  E\ ,  then  i p(v)<p(w) 
is  an  edge  is  E 2 ,  and  such  that  every  edge  in  £2  has  the 
form  <p(v)<p(w )  for  some  edge  vw  €  £ i.  p.  295 


L 

lattice  A  lattice  is  a  partially  ordered  set  in  which  every 
two  elements  have  a  greatest  lower  bound  and  a  least 
upper  bound.  For  example,  the  set  of  subsets  of  a  set  is  a 
lattice.  p.  67 

leaf  A  leaf  in  a  graph  is  a  vertex  of  degree  1.  p.372 

least  common  multiple  The  least  common  multiple  (1cm) 
of  nonzero  integers  a  and  b  is  the  smallest  of  the  positive 
common  multiples  of  a  and  b;  that  is,  it’s  an  integer  f  >  0 
such  that 

1.  a  \  t,  b  1 1  (it’s  a  common  multiple)  and, 

2.  if  m  >  0,  a  |  m  and  b\m,  then  £  <m  (it’s  the  smallest 
positive  common  multiple). 

For  example,  lcm(10,  12)  =  60  and  lcm(18,  30)  =  90. 

p.  110 

least  upper  bound  A  least  upper  bound  (abbreviated  lub) 
for  two  elements  a ,  b  in  a  partially  ordered  set  (A,  <)  is 
an  element  £  e  A  satisfying 

1.  a  <  l,  b  <  £  and, 

2.  if  a  <  c  and  b  <  c  for  some  c  e  A,  then  l  <  c. 


If  a  and  b  have  a  lub,  then  this  element  is  unique  and 
denoted  a  v  b.  For  example,  the  least  upper  bound  of  two 
natural  numbers  with  respect  to  "divides”  is  just  their  least 
common  multiple.  p.  66 

length  The  length  of  a  walk  in  a  pseudograph  is  the  number 
of  edges  it  contains.  p.  303 
logically  equivalent  Statements  A  and  23  are  logically 
equivalent  and  we  write  “A  23”  if  A  and  23  have 

identical  truth  tables.  For  example,  A :  p  -*■  (—'ey )  is  log¬ 
ically  equivalent  to  23:  — •(/?  Aq).  p.  20 
loop  A  loop  is  an  edge  of  a  pseudograph  which  is  incident 
with  only  one  vertex.  p.  286 

M 

map  This  is  another  word  for  function ,  which  was  defined 
previously,  p.  71 

matching  A  matching  in  a  graph  is  a  set  of  edges  with 
the  property  that  no  vertex  is  incident  with  more  than  one 
edge  in  the  set.  p.  459 

maximal  A  maximal  element  of  a  partially  ordered  set 
(A,  <)  is  an  element  a  with  the  property  that  a  <  b 
implies  a  =  b.  For  example,  4  is  a  maximal  element  of 
{ 1 , 2,  3,  4,  5,  6}  with  respect  to  "divides.”  p.  66 
maximum  A  maximum  element  of  a  partially  ordered  set 
(A,  <)  is  an  element  a  with  the  property  that  a  >  b  for 
every  b  e  A.  For  example,  N  is  a  maximum  element  with 
respect  to  c  on  N;  also,  10  is  not  a  maximum  element 
with  respect  to  “divides”  on  N.  p.  65 
minimal  A  minimal  element  of  a  partially  ordered  set 
(A,  <)  is  an  element  a  with  the  property  that  b  <  a 
implies  a  =  b.  For  example,  3  is  a  minimal  element  of 
{2,  3,  4,  5,  6}  with  respect  to  "divides."  p.  66 
minimum  A  minimum  element  of  a  partially  ordered  set 
(A,  ;<)  is  an  element  a  with  the  property  that  a  <  b  for 
every  b  e  A.  For  example,  1  is  a  minimum  element  with 
respect  to  “divides”  on  N  but  3  is  not.  p.  65 
Minimum  Connector  Problem  Find  the  spanning  tree  of 
least  weight  in  a  weighted  connected  graph,  p.  377 
minimum  spanning  tree  A  minimum  spanning  tree  in  a 
weighted  graph  is  a  spanning  tree  for  which  the  sum  of 
the  weights  of  the  edges  is  least  among  all  spanning  trees. 
p.  378 

multiple  edges  Multiple  edges  are  edges  (by  implication, 
more  than  one)  in  a  pseudograph  which  are  incident  with 
the  same  two  vertices.  p.  286 
multiplicity  See  the  definition  of  prime  decomposition. 

N 

n-coloring  See  coloring, 
n -cycle  See  cycle. 

negation  The  negation  of  the  statement  p  is  the  statement 
—•p  which  asserts  that  p  is  not  true.  For  example,  the 
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negation  of  “jt  equals  4”  is  the  statement  “x  does  not 
equal  4.”  p.  5 

O 

odd  cycle  An  odd  cycle  is  a  cycle  which  contains  an  odd 
number  of  edges,  p.  303 

odd  vertex  An  odd  vertex  in  a  pseudograph  is  a  vertex  of 
odd  degree,  p.  286 

one-to-one  A  function  /:  A  implies  B  is  one-to-one ,  or 
1-1,  if  and  only  if  f(a\)  =  f  {af)  implies  a \  =  02-  For 
example,  the  function  /:  Z  implies  Z  defined  by  /(n)  = 
In  is  one-to-one,  but  the  floor  function  |_  J :  R  implies  Z 
is  not.  p.  73 

one-to-one  correspondence  This  is  the  same  as  bijection 
and  is  the  preferred  term  in  the  context  of  cardinality. 
The  function  x  ex  defines  a  one-to-one  correspondence 
between  the  reals  and  the  positive  reals  showing  that  these 
sets  have  the  same  cardinality.  p.  89 

onto  A  function  /:  A  — >•  B  is  onto  if  and  only  if  for 
each  b  e  B,  there  exists  a  e  A  such  that  b  =  f(a). 
For  example,  the  function  / :  (0,  oo)  — ►  R  defined  by 
f(x)  =  lnx  is  onto.  p.  73 

order  See  same  order  or  smaller  order  or  total  order. 

orient  To  orient  an  edge  in  a  pseudograph  is  to  assign  a 
direction  to  it.  p.  405 

outdegree  The  outdegree  of  a  vertex  in  a  directed  graph 
is  the  number  of  arcs  directed  away  from  that  vertex. 
p.  343 


P 

partially  ordered  set  A  partially  ordered  set  is  a  pair  (A,  < 
),  where  A  is  a  set  on  which  there  is  defined  the  partial 
order  ;<.  For  example,  (R,  <)  is  a  partially  ordered  set. 
p.  63 

partial  order  A  partial  order  on  a  set  A  is  a  binary  relation 
on  A  which  is  reflexive,  antisymmetric,  and  transitive.  For 
example,  <  defines  a  partial  order  on  R.  p.  63 
partition  A  partition  of  a  set  A  is  a  collection  of  disjoint 
subsets  (sometimes  called  cells)  of  A  whose  union  is  A. 
For  example,  the  positive  integers,  the  negative  integers, 
and  {0}  form  a  partition  of  Z.  p.  59 
path  A  path  in  a  pseudograph  is  a  walk  in  which  all  ver¬ 
tices  are  distinct,  p.  303 

perfect  matching  A  perfect  matching  in  a  graph  is  a  set  of 
edges  which  have  the  property  that  each  vertex  is  incident 
with  exactly  one  edge  in  the  set.  p.  459 
permutation  A  permutation  of  a  set  of  distinct  symbols 
is  an  arrangement  of  them  in  a  line  in  some  order.  For 
example,  cab  is  a  permutation  of  a ,  b ,  c.  p.  213 
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planar  A  planar  graph  is  one  that  can  be  drawn  in 
the  plane  in  such  a  way  that  no  two  edges  cross. 
p.  413 

poset  This  is  an  abbreviation  for  partially  ordered  set. 
p.  63 

power  set  The  power  set  VIA)  of  a  set  A  is  the  set  of  all 
subsets  of  A.  For  example,  if  A  =  {1,2},  then  V{A)  = 
{0,A,{1},{2}}.  p.41 

premise  See  argument. 

prime  A  prime  or  prime  number  is  a  natural  number  p  >2 
whose  only  positive  divisors  are  itself  and  1 ;  for  example, 
2,  3,  and  5.  p.  114 

prime  decomposition  The  prime  decomposition  of 
an  integer  n  >  2  is  its  representation  in  the  form 
n  =  </“' qf1  ■  ■  ■  </“' ,  where  q\,  ...  ,qs  are  distinct  primes 
and  ,as  are  natural  numbers  called  the  multi¬ 

plicities  of  <?],...  ,<7j,  respectively.  For  example,  the 
prime  decomposition  of  200  is  200  =  2352.  The  prime 
factors  of  200  are  2  and  5,  with  multiplicities  3  and  2, 
respectively,  p.  118 
prime  divisor  See  prime  factor. 

prime  factor  The  prime  factors  (or  prime  divisors)  of  an 
integer  n  >  2  are  the  prime  numbers  which  divide  it. 
These  are  the  p,  or  qj  when  n  is  expressed  in  the  form  n  = 
P\P2  •  •  •  Pr  or  n  =  q^'qj2  ■  •  •  q?s  by  the  Fundamental 
Theorem  of  Arithmetic.  For  example,  the  prime  factors 
of  15  are  3  and  5;  the  prime  factors  of  36  are  2  and  3. 

p.  118 

Principle  of  Mathematical  Induction  Given  a  statement  CP 
concerning  the  integer  n,  suppose 

1.  CP  is  true  for  some  particular  integer  «o>  and 

2.  if  CP  is  true  for  some  particular  integer  k  >  no,  then  it 
is  true  for  the  next  integer  k  +  1 . 

Then  CP  is  true  for  all  integers  n  >  no-  p.  151 
proper  subset:  A  set  A  is  a  proper  subset  of  a  set  B  and 
we  write  A  ^  5  if  and  only  if  every  element  of  A  is 
also  an  element  of  B,  but  A  ^  B.  For  example,  N  C  z. 
p.  40 

pseudograph  A  pseudograph  is  like  a  graph,  but  it  may 
contain  multiple  edges  or  loops.  p.  287 


Q 

quotient  If  a  and  b  are  integers,  b  /  0,  and  a  =  qb  +  r 
with  0  <  r  <  b,  then  q  is  called  the  quotient  and  r  the 
remainder  when  a  is  divided  by  b.  For  example,  when  26 
is  divided  by  7,  the  quotient  is  3  and  the  remainder  is  5, 
because  26  =  3(7)  +  5.  p.  99 
quotient  set  The  quotient  set  of  an  equivalence  relation  is 
the  set  of  all  equivalence  classes.  For  example,  if  the 
equivalence  relation  on  Z  is  defined  by  a  ~  b  if  and  only 
if  a  —  b  is  divisible  by  2,  there  are  two  equivalence  classes. 


the  even  integers  and  the  odd  integers,  so  the  quotient  set 
is  the  set  {evens,  odds}.  p.  57 

R 

range  The  range  of  a  function  / :  A  — >  B  is  the  set  of 
elements  b  €  B  which  are  of  the  form  b  =  f(a)  for 
some  a  €  A.  For  example,  the  range  of  the  floor  function 
[  J :  R  — ►  R  is  the  set  of  integers.  p.  73 
r -combination  See  combination. 

reflexive  A  binary  relation  ~  on  a  set  A  is  reflexive  if  and 
only  if  a  ~  a  for  all  a  e  A.  For  example,  <  is  reflexive 
on  R  but  <  is  not.  p.  52 

relatively  prime  Integers  are  relatively  prime  if  and  only 
if  they  are  nonzero  and  the  only  positive  integer  which 
divides  each  of  them  is  1 ;  equivalently,  their  greatest  com¬ 
mon  divisor  is  1.  For  example,  2  and  3  are  relatively 
prime,  as  are  the  pairs  —16,  21  and  —15,  —28.  p.  108 
remainder  See  the  definition  of  quotient. 
r-permutation  An  r -permutation  of  n  symbols  (r  and  n 
positive  integers,  r  <  n)  is  an  arrangement  of  r  of  them  in 
a  line  in  some  order.  For  example,  caz  is  a  3-permutation 
of  the  letters  of  the  English  alphabet.  p.  213 
root,  rooted  tree  A  tree  is  rooted  if  it  comes  with  a  spec¬ 
ified  vertex,  called  the  root.  A  digraph  is  a  rooted  tree  if 
the  underlying  graph  is  a  tree  and  there  is  a  unique  vertex 
called  the  root,  of  indegree  0.  p.  370  and  p.393 

S 

same  cardinality  Two  sets  A  and  B  have  the  same  cardi¬ 
nality  if  and  only  if  there  is  a  one-to-one  correspondence 
between  them.  For  example,  the  integers  and  the  even 
integers  have  the  same  cardinality,  p.  89 
same  order  Functions  /  and  g :  N  ->  R  have  the  same 
order,  and  we  write  /  x  g,  if  and  only  if  each  is  Big  Oh 
of  the  other.  For  example,  15n3  x  4 n3.  p.  248 
saturated  A  saturated  vertex  in  a  graph  is  a  vertex  which 
is  incident  with  one  of  the  edges  in  a  given  matching. 
p.  459 

score  The  score  of  a  vertex  in  a  tournament  is  the  outdegree 
of  that  vertex.  p.  355 

score  sequence  The  score  sequence  of  a  tournament  is  the 
list  of  outdegrees,  in  nonincreasing  order.  p.  355 
sequence  A  sequence  is  a  function  with  domain  some  set 
of  integers  (often  N).  It  is  usually  described  by  listing  the 
elements  of  its  range.  For  example,  the  sequence  which 
is  the  function  / :  N  —*■  R  defined  by  f(n)  =  n1  would 

likely  be  described  by  the  list  1 . 4,  9,  16 .  p.  163 

set  difference  The  set  difference  A  \  B  of  sets  A  and  B 
is  the  set  of  elements  which  are  in  A  but  not  in  B .  For 
example,  {0,  1, 2,  3}  \  {0,  1, 4}  =  {2,  3}.  p.  45 
sink  This  is  a  distinguished  vertex  in  a  directed  network. 
p.  441 


smaller  order  A  function  / :  N  —*■  R  has  smaller  order 
than  another  function  g,  written  f  <  g,  if  and  only  if 
/  is  Big  Oh  of  g  but  g  is  not  Big  Oh  of  /.  For  example, 
n1  <  2”.  p.  248 

source  This  is  a  distinguished  vertex  in  a  directed  network. 
p.  441 

spanning  tree  A  spanning  tree  for  a  connected  graph  Q  is  a 
subgraph  which  is  a  tree  and  which  includes  every  vertex 

of  Q.  p.  378 

strongly  connected  A  digraph  is  strongly  connected  if  and 
only  if  there  is  a  walk  from  any  vertex  to  any  other  vertex 
which  respects  orientation  of  arcs.  p.  343 
strongly  connected  orientation  A  strongly  connected  ori¬ 
entation  of  a  graph  is  an  assignment  of  orientations  (direc¬ 
tions)  to  all  edges  in  such  a  way  that  the  resulting  digraph 
is  strongly  connected.  p.  405 
subgraph  A  subgraph  of  a  pseudograph  Q  is  a  pseudograph 
whose  vertices  and  edges  are  subsets  of  the  vertices  and 
edges,  respectively,  of  Q.  p.  287 
subset  A  set  A  is  a  subset  of  a  set  B,  denoted  A  c  B,  if 
and  only  if  every  element  of  A  is  also  an  element  of  B. 
For  example,  {0,  1}  c  (0,  1,2)  but  Z  g  N.  p.  40 
surjective  Another  word  for  onto.  p.  73 
symmetric  A  binary  relation  ~  on  a  set  A  is  symmetric  if 
and  only  if  a  ~  b  implies  b  ~  a.  For  example,  a  ~  b  if 
and  only  if  ab  =  1  defines  a  symmetric  relation  on  Q,  but 
<  does  not.  p.  52 

symmetric  difference  The  symmetric  difference  of  sets  A 
and  B,  denoted  A  ©  B,  is  the  set  of  elements  which 
belong  to  precisely  one  of  the  sets.  More  generally, 
the  symmetric  difference  A\  ©  At  ©  A3  ©  •  •  •  0  A„ 
of  n  sets  A\,  A  2,  A3,...  ,  A„  is  the  set  of  those  ele¬ 
ments  which  are  members  of  an  odd  number  of  the  sets 
A,-.  For  example,  {0,  1, 2,  3}  ©  {0,  1, 4,  5}  =  (2,  3,4,  5). 
pages  47  and  161. 


T 

tautology  A  tautology  is  a  compound  statement  that  is 
always  true.  For  example,  (p/\q)  —*■  (pvq)  is  a  tautology. 
(Examine  the  truth  table.)  p.  20 

term  The  terms  of  a  sequence  are  the  numbers  of  its  range 
or,  thinking  of  a  sequence  as  a  list,  the  numbers  in  the  list. 
For  example,  the  first  five  terms  of  the  Fibonacci  sequence 
are  1,  I,  2,  3,  5.  p.  164 

total  order  A  total  order  on  a  set  is  a  partial  order  in  which 
every  two  elements  are  comparable.  For  example,  <  is 
a  total  order  on  R,  but  C  is  not  a  total  order  on  'P(R). 
p.  64 

tournament  A  tournament  is  a  digraph  with  the  property 
that  for  every  two  distinct  vertices  u,v,  exactly  one  of 
uv,  vit  is  an  arc.  p.  355 
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trail  A  trail  in  a  pseudograph  is  a  walk  in  which  all  edges 
are  distinct,  p.  303 

transitive  A  binary  relation  ~  on  a  set  A  is  transitive  if 
and  only  if  a  ~  b  and  b  ~  c  implies  a  ~  c.  For  example, 
the  binary  relation  ~  defined  by  a  ~  b  if  ab  >  0  is  a 
transitive  relation  on  Z  \  {0}  but  not  on  Z.  p.  54 
transitive  tournament  A  transitive  tournament  is  a  tour¬ 
nament  with  the  property  that  whenever  uv  and  vw  are 

arcs,  so  is  uw  an  arc.  p.  357 
Traveling  Salesman’s  Problem  The  Traveling  Salesman's 
Problem  is  to  find  a  Hamiltonian  cycle  of  least  weight  in 
a  weighted  connected  graph,  p.  325 
tree  A  tree  is  a  connected  graph  which  contains  no  circuits. 
p.  369 

triangle  A  triangle  in  a  graph  is  a  3-cycle,  p.  299 

U 

uncountable  An  uncountable  set  is  an  infinite  set  which 
does  not  have  the  cardinality  of  N.  For  example,  the  real 
numbers  form  an  uncountable  set.  p.  90 
union  The  union  of  sets — denoted  A  U  B  for  two  sets, 
AUBUC  for  three,  and  |J"=i  A,-  for  n — is  the  set  of  ele¬ 
ments  which  belong  to  at  least  one  of  them.  For  example, 
{0,  1,2)  U{0,  1,3)  =  (0,  1,2,  3).  p.  43 

V 

valid  argument  See  argument. 

value  of  a  flow  The  value  of  a  flow  {f„v )  'n  a  directed  net¬ 
work  with  source  s,  sink  t,  and  vertex  set  V  is  the  integer 

valCF)  = 

veV  ueV 

equivalently, 

valCF)  =  ^  fv,  -  ^  f,v.  p.  443 
veV  veV 

W 

walk  A  walk  in  a  pseudograph  is  an  alternating  sequence  of 
vertices  and  edges,  beginning  and  ending  with  a  vertex, 
in  which  each  vertex  (except  the  last)  is  incident  with 
the  edge  which  follows  it  and  the  last  vertex  is  incident 
with  the  preceding  edge.  It  is  closed  if  and  only  if  the 
first  vertex  is  the  same  as  the  last  and  otherwise  open, 
p.  303 

weighted  graph  A  weighted  graph  is  a  graph  in  which 
there  is  a  nonnegative  real  number  associated  with  each 
edge.  p.  325 

Well-Ordering  Principle  This  principle  asserts  that  every 
nonempty  set  of  natural  numbers  has  a  smallest  element. 
p.  98 
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accessibility  index,  292 
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digraph,  392 
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of  a  digraph,  343 
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Adleman,  Leonard,  143 
algorithm,  240 
Bellman’s,  394 
Bellman-Ford,  345 
binary  search,  261 
complexity,  262 
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improved,  328,  329 
division,  100 
edge  disjoint  paths,  457 
enumerating  permutations,  272 
Euclidean,  106 
complexity,  257 
Eulerian  circuit,  307 
exponential,  253 
Floyd-Warshall,  330 
complexity,  333 
greedy,  383 
Homer’s,  243 
complexity,  258 
Kruskal’s,  383 
complexity,  384 
linear  search,  260 
logarithmic,  253 
merge  sort,  267 
merging,  265 
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backward  arc,  449 
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base,  101,  102 
Bellman,  Richard,  345,  393 
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binary,  102 
relation,  51 
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Cayley,  Arthur,  374 
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flow-augmenting,  449 
rule,  31 
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polynomial,  174 
roots,  174 
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Chen,  J.,  123 
Chinese 

Postman  Problem,  338 
Remainder  Theorem,  140,  141 
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circuit,  303 

Eulerian,  304,  338,  343,  351 
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how  to  order,  273 
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difference,  166 
ratio,  167 
commutative,  97 
commutativity,  22 
comparable,  65 
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complement  of  a  set,  45 
complete,  288 
bipartite,  288 
complex  numbers,  39 
complexity 

binary  search,  262 
bubble  sort,  264 
depth-first  search,  401 
Dijkstra,  327 
improved,  329 
enumerating  permutations, 
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Euclidean  algorithm,  257 
Floyd-Warshall,  333 
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Kruskal,  384 
merge  sort,  268 
Prim,  387 
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composite,  114 
composition  of  functions,  82 
compound  statement,  2,  17 
disjunctive  normal  form,  26 
conclusion,  3,  28 
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necessary,  9 
sufficient,  9 
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necessary  and  sufficient,  10 
congruence 
class,  126 
to  solve,  129 
congruent,  126 
connected,  297,  305 
strongly,  343,  405 


conservation  of  flow,  442 
continuum  hypothesis,  93 
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proof  by,  13,  14,  40,  59,  92, 

115,  117,  312,  358,  392, 
409,  416,  460 
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206 

converse,  4,  23,  133,  392 
convex,  162,  415 
Cook,  Stephen,  254 
coordinate,  48 
countable,  90,  125 
counter,  242 

counterexample,  10,  44,  74 
covers,  66 

CPM,  see  critical  path  method 
critical  path,  361 
method,  361 
cryptology,  142 
cube,  317,  415,  430 
cut 

(s,  t)-,  444 
capacity  of,  444 
edge,  406 
cycle,  303 

299,  303 
even,  303 

Hamiltonian,  310,  311,  343,  368, 
462 

odd,  303 
proper,  311 
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Dijkstra,  Edsger,  326 
algorithm,  326 
complexity,  327 
improved,  328,  329 
Dirac,  G.  A.,  314 
theorem  of,  314 
direct 

product,  48 

proof,  12,  84,  130,  248, 
250 

directed  network,  360,  441 
disjunctive 

normal  form,  26 
syllogism,  31 
distributive,  98 
distributivity,  22 
divides,  104 
divisible,  104 
division  algorithm,  100 
divisor,  104 
prime,  118 

dodecahedron,  314,  415,  430 
domain,  73 
restrict  the,  75 
dot  product,  137 
double  implication,  4 
dual  graph,  430 
Dudeney,  H.  E.,  337 

E 

edge,  286 
cut,  406 
delete,  288 
disjoint,  317,  456 
multiple,  286 
empty  set,  39 
enzyme  digest,  350,  353 
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equivalence 
class,  57,  126 
logical,  20,  21 
relation,  56,  251,  297 
equivalent 

statements,  11 
the  following  are,  358,  369 
Eratosthenes,  1 1 6 
Sieve  of,  1 16 
Euclid,  106,  115 
Euclidean  algorithm,  106 
complexity,  257 
Euler,  Leonhard,  120,  161,  277, 
304,  415 

Eulerian 

circuit,  304,  338,  343,  351 
pseudograph.  304,  306,  339 
even  cycle,  303 
event,  196 
events 

mutually  exclusive,  196 
exclusive  or,  21,  28 
existential  quantifier,  6 
exponential  time,  253 
extended  bases,  351 

F 

factor,  104 
prime,  1 1 8 

factorial,  155,  163,  211 
Faltings,  Gerd,  122 
feasible  relationships,  433 
Fermat,  Pierre  de,  48,  120,  121 
Last  Theorem  of,  121 
Little  Theorem  of,  133 
number,  120 
prime,  119,  163 
Fibonacci,  Leonardo,  168 

sequence,  168,  173,  232,  237, 
257 

Fields  Medal,  93 
finite,  88 

Five-Color  Theorem,  425 
flag,  268,  S-36 
floor,  76,  100 
flow 

(s,  t)-,  442 
conservation  of,  442 


maximum,  443 
unit,  445 
value,  443 

flow-augmenting  chain,  449 
Floyd,  R.  W„  330 
Floyd-Warshall  algorithm,  330 
complexity,  333 
for  all,  6 

Ford  Jr.,  Lester  Randolph,  345,  450 
forest,  372,  377 
forward  arc,  449 

Four-Color  Theorem,  46,  421,  424 
fragment,  350 
unsplittable,  352 
Fulkerson,  D.  R.,  450 
function,  71 
3  m  +  1,  173 
bijective,  73 
ceiling,  76 
complexity,  246 
composition,  82 
floor,  76 
generating,  375 
greatest  integer,  see  floor 
identity,  77 
inverse,  80 
powers  of,  172 
Fundamental  Theorem  of 

Arithmetic,  118,  157 

G 

Gage,  Paul,  1 14 

gcd,  see  greatest  common  divisor 
generality 

without  loss  of,  14 
generating  function,  178,  375 
geometric  sequence,  167 
sum  of,  167 

gib,  see  greatest  lower  bound 
Goldbach,  Christian,  123 
conjecture  of,  123 
golden  mean,  171 
graph,  286 

acyclic,  369,  377 
bipartite,  288 
complete,  288 
connected,  305 
dual,  430 


finite,  286 
Hamiltonian,  310 
line-of-sight,  432 
Petersen,  312,  419 
planar,  278,  413 
plane,  413 
relationship,  433 
weight  of,  325 
weighted,  325 
greatest 

common  divisor,  105,  113 
lower  bound,  67 
greedy,  383 
grid  segment,  430 
Guthrie,  Francis,  421 

H 

Hadamard,  Jacques,  121 
Hajratwala,  Nayan,  115 
Haken,  Wolfgang,  422 
Hall,  Philip,  460 

Marriage  Theorem,  460, 

461 

Hamilton,  William  Rowan,  313, 
374,  415 
Hamiltonian 
circuit,  3 1 1 

cycle,  310,  311,  343,  368, 

462 
graph,  310 

path,  316,  356,  358,  462 
Hasse,  Helmut,  66 
diagram,  66,  111 
Heawood,  Percy,  422 
hexadecimal,  102 
homeomorphic,  417 
homogeneous,  see  recurrence 
relation 

Horner,  William,  243 
algorithm,  243 
hundreds  digit,  101 
hypothesis,  3,  28 

I 

icosahedron,  317,  415 
ideal,  162 
idempotence,  22 
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identity,  98 
function,  77 
matrix,  319 

if  and  only  if,  4,  10,  59,  306,  372, 
407 

image,  72,  73 
implication,  3,  17 
double,  4 
implies,  3 

incidence  relations,  296 
incident,  286 
inclusion-exclusion 
principle  of,  190 
indegree,  343 
sequence,  343 
Indelkofer,  Karl-Heinz,  122 
induction 

hypothesis,  151 
principle  of  mathematical,  1 5 1 
proof  by,  356,  372,  415,  424, 
425 

strong  form,  157,  408 
infinite,  88 
countably,  90 
initial  condition,  164 
injective,  73 
Instant  Insanity,  279 
intersection,  43 
interval 
closed,  46 
half  open,  46 
notation,  46 
open,  46 
inverse,  80 

additive,  98,  132 
function,  80 
has  an,  80 

multiplicative,  98,  132 
of  an  implication,  23 
irrational  number,  14,  39 
ISBN,  136 
isolated,  286 
isomer,  374 
isomorphic,  294-296 
labeled  trees,  375 
isomorphism,  295 
class,  297 


J 

Jarai,  Antal,  122 
joins,  286 

K 

Kempe,  Alfred  B.,  422 
Kirchhoff,  Gustav,  319,  379 
Theorem  of,  379 
knight’s  tour,  316 
Koch,  J„  422 

Konigsberg  Bridge  Problem,  277 
Kruskal,  J.  B„  383 
algorithm  of,  383 
complexity,  384 
Kuratowski,  Kazimierz,  417 
Theorem  of,  418 
Kwan,  M.  K„  337 

L 

labeled  tree,  375 
labeling 

canonical,  392 
lattice,  68,  110 

lcm,  see  least  common  multiple 

leaf,  372 

least 

common  multiple,  110 
upper  bound,  67 
length 

of  walk,  303 
level 

of  a  vertex,  370 
lexicographic  order,  65,  271 
line-of-sight  graph,  432 
linear 

combination,  108,  161 
search,  260 
time,  253 

logarithmic  time,  253 
logically  equivalent,  20,  21 
loop,  242,  286 
lub,  see  least  upper  bound 
Lucas-Lehmer  Test,  1 1 5 

M 

map,  see  function 
Marriage  Theorem,  461 


matching,  459 
perfect,  459 

mathematical  induction,  see 
induction 

matrix 

adjacency,  318 
identity,  319 
permutation,  321 
transpose,  322 
Matrix-Tree  Theorem,  379 
maximal,  67 
maximum,  66 
flow,  443 
Menger,  Karl,  456 
theorem  of,  456 
merge  sort,  267 
complexity,  268 
merging  ordered  lists,  265 
Mersenne,  Marin,  119 
prime,  1 19 
minimal,  67 
minimum,  66 

connector  problem,  377,  382 
spanning  tree,  378 
minterm,  26 
mod,  126,  129 
modulus,  126 
modus 

ponens,  31 
tollens,  31 

multiplication  rule,  197 
multiplicity,  118 
mutually  exclusive,  196 

N 

n\,  n  factorial,  155,  163,  211 

n -coloring,  423 

n -cycle,  299,  303 

n-gon,  162 

n -tuple,  48 

natural  numbers,  37 

necessary,  9 

and  sufficient,  10 
negation,  5,  17 
double,  22 
net,  430 
network 

directed,  360,  441 
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node,  430 

NP-complete,  254,  325,  424 

O 

octahedron,  415 
octal,  102 
odd  cycle,  303 
one-to-one,  73 

correspondence,  88,  89 
one-way  street  problem,  405 
onto,  73 
open 

interval,  46 
walk  or  trail,  303 
or,  2,  3 

exclusive,  21,  28 
order 

lexicographic,  65,  271 
partial,  64 
same,  248 
smaller,  248 
ordered  pair,  48 
ordering 

canonical,  392 
Ore,  Oystein,  314 
theorem  of,  317 
orient,  405 
orientation,  349,  405 
strongly  connected,  405 
outdegree,  343 
sequence,  343 

P 

pairwise 

disjoint,  196 
relatively  prime,  141 
palindrome,  202 
partial 

fractions,  182 
order,  64,  97 
partially  ordered  set,  64 
particular  solution,  175 
partition,  59,  126-128,  447 
Pascal,  Blaise,  231 
triangle  of,  232 
path,  303 
critical,  361 


Hamiltonian,  316,  356,  358, 
462 

shortest,  325 
paths 

edge  disjoint,  456 
Pell  sequence,  185 
perfect 

matching,  459 
square,  125,  170 
permutation,  213 
matrix,  321 
r-,  213 
permutations 

enumerating,  272 
complexity,  274 
ordering,  272 

PERT,  see  project  evaluation  and 
review 

Petersen,  Julius,  312 
graph  of,  312,  419 
Pigeon-Hole  Principle,  204 
strong  form,  206 
planar,  278,  413 
plane  graph,  413 
Platonic  solid,  317,  410,  415 
polyhedron 

convex  regular,  415 
polynomial  time,  253 
poset,  64 
power  set,  41 
powers  of  a  function,  172 
premise,  28 
Prim,  R.  C.,  383 
algorithm  of,  386 
complexity,  387 
prime,  1 14 

decomposition,  118 
divisor,  1 1 8 
factor,  118 
Fermat,  119,  163 
Mersenne,  1 1 9 
number,  1 14 
relatively,  108 

Prime  Number  Theorem,  121 
primes 
twin,  122 

Principle  of  Mathematical 

Induction,  see  induction 


product 

Cartesian,  48 
direct,  48 

project  evaluation  and  review,  361 
proof 

by  contradiction,  see 
contradiction 
cases,  see  cases 
direct,  see  direct 
proper 

cycle,  3 1 1 
subset,  40 
proposition,  22 
pseudograph,  287 

Eulerian,  304,  306,  339 

Q 

Q.E.D.,  12 
quadratic  time,  253 
quadrilateral,  299 
quantifier,  6 
existential,  6 
universal,  6 
quaternions,  313 
quotient,  98,  99 
set,  57 

R 

r-combination,  218 
r-permutation,  213 
Redei,  L.,  356 
theorem  of,  356 
range,  73 

rational  number,  14,  38 
recurrence  relation,  164 
homogeneous,  173 
second  order  linear,  173 
solve,  164 

recursive  definition,  163 
reduce  modulo  n,  129 
reflexive,  52,  296 
relation 
binary,  51 

equivalence,  56,  251,  297 
relationship  graph,  433 
relationships 
feasible,  433 
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relatively  prime,  108 
pairwise,  141 
remainder,  98,  99 
resolution,  31 
restrict  the  domain,  75 
ribonucleic  acid,  350 
Rivest,  Ronald,  143 
RNA,  see  ribonucleic  acid 
root,  370,  393 
rooted  tree,  393 
RSA  Algorithm,  143 
rules  of  inference,  30 
Russian  peasant  multiplication, 
240,  256 

S 

(s,  0-cut,  444 
0,  0-flow,  442 
same  cardinality,  89 
saturated 
arc,  446 
vertex,  459 

Schroder  Bernstein  Theorem,  93 

score,  355 

search 

binary,  261 
breadth-first,  404 
depth-first,  399 
linear,  260 
ternary,  270 

second  order  linear  recurrence 
relation,  173 
sequence,  163 
degree,  290 

Fibonacci,  168,  232,  237, 

257 

indegree,  343 
outdegree,  343 
Pell,  185 
set 

builder  notation,  38 
difference,  45 
empty,  39 
finite,  88 
infinite,  88 
partially  ordered,  64 
power,  41 
totally  ordered,  65 


Shamir,  Adi,  143 
Shimura,  Goro,  122 
shortest  path,  325 
Sieve  of  Eratosthenes,  116 
sink,  441 
slack,  363,  449 
Slowinski,  David,  114 
solve  (a  recurrence  relation),  164, 
173 

sort 

bubble,  263 
merge,  267 
source,  44 1 ,  442 
spanning  tree,  378 
Spence,  Gordon,  114 
statement,  2,  17 
compound,  2,  17 
Stirling,  James,  155 
formula  of,  155 
strongly  connected,  343,  405 
orientation,  405 
subgraph,  280,  287 
subset,  40 
proper,  40 

substitution  theorem,  30 

sufficient,  9 

sum 

arithmetic  sequence,  166 
geometric  sequence,  167 
superset,  40 
surjective,  73 
swap,  263 
symmetric,  52,  296 
difference,  47,  161 
matrix,  319 


T 

Taniyama,  Yutaka,  122 

target,  73 

tautology,  20 

Taylor,  Richard,  122 

tens  digit,  101 

term  (of  a  sequence),  164 

ternary  search,  270 

tetrahedron,  415 

theorem 

Schroder-Bernstein,  93 

substitution,  30 


there  exists,  6 
thousands  digit,  101 
Three  Houses — Three  Utilities, 
278,  413,  416 

time 

exponential,  253 
linear,  253 
logarithmic,  253 
polynomial,  253 
quadratic,  253 
total  order,  65 
totally  ordered,  65 
tournament,  355 
transitive,  357 
Towers  of  Hanoi,  177 
trail,  303 
closed,  303 
Eulerian,  307 
open,  303 

transitive,  54,  296,  357 
transpose 

of  matrix,  322 

Traveling  Salesman’s  Problem, 
271,  325,  368, 

388 

tree,  369 
labeled,  375 

minimum  spanning,  378 
rooted,  370,  393 
spanning,  378 
triangle,  289,  293,  299 
inequality,  248,  250 
triangulate,  421 
truth 

table,  17 
value,  2 

twin  prime  conjecture,  123 
twin  primes,  122 

U 

uncountable,  90 

union,  43 

unit  flow,  445 

units  digit,  101 

universal  product  code,  137 

unsplittable  fragment,  352 

UPC,  see  universal  product  code 
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V 

valid  argument,  28 
value  of  a  flow,  443 
vector,  137 
Venn  diagram,  43 
vertex,  286 
delete,  288 
even,  286 
isolated,  286 


odd,  286 
saturated,  459 

Vinogradov,  I.  M.,  123 

W 

walk,  303 
closed,  303 
length,  303 
open,  303 


Warshall,  S„  330 
Wassing,  Heinz-Georg,  122 
weight 

of  a  subgraph  graph,  325 
of  an  edge,  325 
weighted,  325 

Well-Ordering  Principle,  98,  158 

Wiles,  Andrew,  121 

without  loss  of  generality,  14 


Page  Symbol 


Meaning 


Complexity/Order  of  Functions 


247 

O 

Big  Oh 

248 

f  <g 

/  has  smaller  order  than  g 

248 

f  ~g 

functions  /  and  g  have  the  same  order 

Number 

Theory 

58 

nZ 

the  set  of  multiples  of  the  integer  n 

58 

nZ  +  r 

the  set  of  integers  of  the  form  via  +  r,  equivalently,  the  set  of  integers 
congruent  to  r  (mod  n) 

76 

M 

ceiling  of  x,  least  integer  >  x 

76 

UJ 

floor  of  x,  greatest  integer  <  x 

104 

| 

divides 

121 

7t(x) 

number  of  primes  p  <  x 

126 

a  =  b  (mod  n) 

a  is  congruent  to  b  mod  n,  i.e.,  n  |  (a  —  b ) 

Combinatorial 


155 

n ! 

n  factorial 

212 

P(n,  r) 

the  number  of  permutations  of  n  objects  taken  r  at  a  time 

217 

0 

a  binomial  coefficient;  read  “n  choose  r”  it  equals  the  number  of 
combinations  of  n  objects  taken  r  at  a  time;  i.e.,  the  number  of  ways 
to  choose  r  objects  from  n 

228 

Dn 

number  of  derangements  of  n  symbols 

Graph  r 

rheoretical 

286 

Q{V,S) 

Q  is  a  graph  with  vertex  set  V  and  edge  set  £ 

286 

degu 

the  degree  of  vertex  v 

288 

Q  \  {<?} 

the  graph  whose  vertices  are  those  of  Q  and  whose  edges  consist  of 
those  of  Q  except  for  e 

288 

G  \  M 

the  graph  whose  vertices  are  those  of  Q  except  for  v  and  whose  edges 
are  those  edges  of  Q  not  incident  with  v 

288 

ICn 

complete  graph  on  n  vertices 

288 

m,n 

complete  bipartite  graph  on  vertex  sets  of  m  and  n  elements 

295 

G\=Gi 

295  and  =  and  isomorphic 

325 

w(e) 

weight  of  edge  e 

355 

s(v) 

score  of  vertex  v  in  a  tournament 

423 

y  m 

chromatic  number  of  the  graph  G 

Page  Symbol 


Meaning 


Set  Theoretical 


38 

e 

belongs  to,  is  an  element  of 

39 

0 

the  empty  set 

40 

c 

is  a  subset  of 

40 

c 

is  a  proper  subset  of 

40 

D 

is  a  superset  of,  contains 

41 

V(A) 

the  power  set  (the  set  of  all  subsets)  of  A 

43 

u 

union 

43 

n 

intersection 

45 

\ 

set  difference 

45 

Ac 

the  complement  of  A 

46 

[a,  b] 

{jc  e  R  |  a  <  x  <  b\,  where  a,b  e  R 

46 

(a,  b] 

(jc  e  R  |  a  <  x  <  b],  where  a,  b  e  R 

46 

(— oo,  b ] 

{x  e  R  |  .r  <  b],  where  b  e  R 

46 

(a,  oo) 

{*  e  R  |  jc  >  a},  where  a  e  R 

47 

® 

symmetric  difference 

48 

X 

Cartesian  (or  direct)  product 

88 

|A| 

the  cardinality  of  A  (if  A  is  finite,  this  is  just  the  number  of  elements 
in  A) 

Equivalence  Relations 

57 

A/~ 

quotient  set  of  A  (the  set  of  equivalence  classes  of  an  equivalence 
relation  on  a  set  A) 

57,  126 

a 

equivalence  class  of  a\  also  used  to  denote  the  congruence  class  of  an 
integer  a  (mod  n) 

Partially  Ordered  Sets 

63 

< 

a  partial  order 

66 

V 

least  upper  bound 

66 

A 

greatest  lower  bound 

Functions 

72 

f-A^B 

/  is  a  function  from  A  to  B 

73 

1-1 

one-to-one 

77 

t 

the  identity  function 

77 

tA 

the  identity  function  on  the  set  A 

80 

rl 

inverse  of  the  function  / 

82 

o 

composition  (of  functions) 
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